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Preface 


Combinatorics belongs to those areas of mathematics having experienced a most 
impressive growth in recent years. This growth has been fuelled in large part by the 
increasing importance of computers, the needs of computer science and demands 
from applications where discrete models play more and more important roles. But 
also more classical branches of mathematics have come to recognize that combi- 
natorial structures are essential components of many mathematical theories. 

Despite the dynamic state of this development, we feel that the time is ripe 
for summarizing the current status of the field and for surveying those major re- 
sults that in our opinion will be of long-term importance. We approached leading 
experts in all areas of combinatorics to write chapters for this Handbook. The 
response was overwhelmingly enthusiastic and the result is what you see here. 

The intention of the Handbook is to provide the working mathematician or com- 
puter scientist with a good overview of basic methods and paradigms, as well as 
important results and current issues and trends across the broad spectrum of com- 
binatorics. However, our hope is that even specialists can benefit from reading this 
Handbook, by learning a leading expert’s coherent and individual view of the topic. 

As the reader will notice by looking at the table of contents, we have structured 
the Handbook into five sections: Structures, Aspects, Methods, Applications, and 
Horizons. We feel that viewing the whole field from different perspectives and 
taking different cross-sections will help to understand the underlying framework 
of the subject and to see the interrelationships more clearly. As a consequence of 
this approach, a number of the fundamental results occur in more than one chap- 
ter. We believe that this is an asset rather than a shortcoming, since it illustrates 
different viewpoints and interpretations of thé results. 

We thank the authors not only for writing the chapters but also for many helpful 
suggestions on the organization of the book and the presentation of the material. 
Many colleagues have contributed to the Handbook by reading the initial versions 
of the chapters and by making proposals with respect to the inclusion of topics and 
results as well as the structuring of the chapters. We are grateful for the significant 
help we received. 

Even though this Handbook is quite voluminous, it was inevitable that some 
areas of combinatorics had to be left out or were not covered in the depth they 
deserved. Nevertheless, we believe that the Handbook of Combinatorics presents 
a comprehensive and accessible view of the present state of the field and that it 
will prove to be of lasting value. 


Ronald Graham 
Martin Grdtschel 
Laszl6 Lovasz 
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Our purpose in this chapter is twofold: to introduce the basic concepts and tech- 
niques of graph theory, and to develop that part of the theory concerned with 
paths and circuits. We have attempted to weave these two strands together, pro- 
ceeding from fundamental notions and elementary observations to more sophisti- 
cated concepts and methods, much as one does when engaged in research. One of 
the attractive features of the subject is the wealth of simply stated but challenging 
open problems, and we have included a large number of these. It must be empha- 
sized that the literature on paths and circuits is extensive. Thus what is presented 
here is necessarily a selection. Nevertheless, our hope is that, after studying this 
chapter, the reader will be well equipped to embark on the chapters dealing with 
particular aspects of graph theory. Circuits in graphs are treated in depth in the 
books of Walther and Voss (1974) and Voss (1991). There are also many survey 
articles on particular aspects of the topic. We recommend, in particular, the arti- 
cles of Carsten Thomassen. Besides their substantial contributions to the theory 
of graphs, each includes an informed and stimulating discussion of the question 
under consideration and its links to related matters. 


1. Basic concepts 


1.1. Graphs 


A graph G consists of a nonempty set V(G) of elements, called vertices or nodes, 
and a set E(G) of elements called edges, together with a relation of incidence that 
associates with each edge two vertices, called its ends. Two graphs G and H are 
isomorphic if there are bijections 0: V(G) > V(H) and ¢: E(G) — E(H) such 
that vertex v and edge e are incident in G if and only if vertex 6(v) and edge ¢(e) 
are incident in H. The pair of mappings (8, @) is an isomorphism from G to H. 

An edge with identical ends is a Joop and one with distinct ends is a link. Two 
or more edges with the same pair of ends are multiple edges. The two ends of an 
edge are said to be joined by the edge and to be adjacent to one another; adjacent 
vertices are also referred to as neighbours. An edge with ends x and y is denoted 
by the unordered pair xy. The number of vertices of a graph G is called its order, 
the number of edges its size; these parameters are denoted by u(G) and e(G), 
respectively. 

A multigraph is one with no loops, a simple graph one with neither loops nor 
multiple edges. A graph is finite if both its vertex set and its edge set are finite, and 
infinite otherwise. This chapter is devoted entirely to finite graphs. Quite different 
techniques are needed for dealing with infinite graphs. These are discussed in the 
survey by Thomassen (1983e) and also in chapter 42 by Hajnal. 

Graphs are so named because they can be represented pictorially, a vertex being 
indicated by a point and an edge by a line joining the points representing its ends; 
fig. 1 depicts the five Platonic graphs, the graphs of the Platonic solids. The graph 
of the tetrahedron in fig. 1 is an example of a complete graph: any two of its vertices 
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A A 


Tetrahedron Octahedron 


Cube Icosahedron Dodecahedron 
Figure 1. The Platonic graphs. 


are joined by an edge. An empty graph, by contrast, is one with no edges. The 
simple complete graph on n vertices is denoted by Ky. 

A graph is bipartite if its vertex set can be partitioned into subsets ¥ and Y 
so that each edge has one end in X and one end in Y; such a partition (X, Y) 
is called a bipartition of the graph. A bipartite graph G with bipartition (X, Y) is 
denoted by G(X, Y). Such a graph is balanced if |X|=|Y| and complete if each 
vertex of X is joined to each vertex of Y; the praph of the cube in fig. 1 is a 
balanced bipartite graph. The simple complete bipartite graph G(X, Y) in which 
|X| =m and |Y|=7n is denoted by Kn; if m=1, such a graph is called a star. 
The graphs Ks and K33 are known as the Kuratowski graphs because they feature 
in a fundamental theorem of K. Kuratowski (Theorem 5.4); K33 is also sometimes 
referred to as the Thomsen graph. 

It is sometimes convenient to represent the relations of adjacency and incidence 
in a graph by matrices. The adjacency matrix A(G) is the matrix (a,,) whose rows 
and columns are indexed by the vertices of G, a,, being the number of edges of 
G joining vertices u and v. The incidence matrix B(G) is the matrix (b,.) whose 
rows are indexed by the vertices of G and columns by the edges of G, b,. being 
the number of times vertex v and edge e are incident; so by, := 2 if e is a loop at 
v, Due ‘= 1 if e is a link at uv, and b,, := 0 otherwise. 

As a general rule, the symbol specifying the graph (typically G) is dropped from 
notation if the context makes clear which graph is being discussed; vertex and edge 
sets are then denoted simply by V and E, and adjacency and incidence matrices 
by A and B. By the same tokgn, the notation G(V, E) specifies a graph G with 
vertex set V and edge set E. An exception to this rule is the frequent use of n 
for the order, and occasionally m for the size, the letters v and e always denoting 
vertices and edges, respectively. 

The set of neighbours of a vertex v in a graph G is denoted by N;,(v). In a simple 
graph, the number of neighbours of a vertex is called its degree. More generally, 
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the degree of a vertex v in a graph G is the number of edges of G incident with 
v, a loop counting as two edges. A graph is d-regular if each vertex has degree 
d, and regular if it is d-regular for some d. A graph is even if each vertex is of 
even degree. The degree of a vertex v in a graph G is denoted by dg(v), the 
minimum, maximum, and average degrees of the vertices of a graph G by 8(G), 
A(G), and d(G), respectively. An isolated vertex is a vertex of degree zero, a leaf 
one of degree one. 

The number of edges in a graph and the degrees of its vertices are linked by a 
simple identity due to Euler (1736). 


Proposition 1.1. For any graph G, 
S 5d) = 2e(G). 


veVv 
' 


Proof (by counting in two ways). We consider the incidence matrix B := (b,,) of 
G and compute the sum of its entries in two ways: 


S-d(v) = 0° bve = D> Yo ve = D2 = 2e(G). 


veVv veV ecE ec€E veV ecE 


We have drawn attention to the proof technique used here, counting in two 
ways, because it is one that is frequently employed in combinatorial arguments. 
Further techniques will be highlighted as they are encountered; some of these are 
of general applicability, while others are designed specifically to study paths and 
circuits in graphs. 


Corollary 1.2. in any graph, the number of vertices of odd degree is even. 


1.2. Subgraphs and minors 


Let G and H be graphs. If V(H) C V(G) and E(#) C E(G), and if every edge of 
H has the same pair of ends in H as it has in G, the graph H is called a subgraph of 
G and the graph G a supergraph of H; these relationships are denoted by H CG 
and G 2 H, respectively. We also say that H is contained in G and that G contains 
H. A subgraph or supergraph H of G spans G if V(H) = V(G). 

The union of two subgraphs G, and G, of G is the subgraph G, UG, with 
vertex set V(G,) UV(G2) and edge set E(G,) U E(G»). Likewise the intersection 
of G, and G, is the subgraph G, M G2 with vertex set V(G,) N V(G2) and edge 
set E(G,)N E(G)). 

If S is a set of edges of a graph G(V, E), the graph whose vertex set is V and 
whose edge set is E \S is a subgraph of G: Likewise, if S is a set of vertices of 
G, the subgraph of G whose vertex set is V \S and whose edge set consists of 
those edges of G with both ends in V \S is a subgraph of G. In both instances, 
this subgraph is denoted by G \ S, and is said to be obtained by deleting S from 
G. If H is a subgraph of G, we denote the subgraph G \ V(#2) simply by G \ H. 


8 J.A. Bondy 


If S is a set of unordered pairs of vertices of G(V, EZ), the graph whose vertex 
set is V and whose edge set is E US is a spanning supergraph of G. This graph 
is denoted by G+5, and is said to be obtained by adding S to G; addition and 
deletion of edges are inverse operations. One may also add a set S of vertices to 
a graph. In this case, the resulting graph depends on which edges incident to the 
vertices of § are also added. If all pairs uv with wu € S and v € V are added, and 
no others, the resulting graph is denoted by G+S. 

If S is a set of edges of G, the graph whose vertex set consists of all ends of 
edges in S and whose edge set is S is a subgraph of G. Likewise, if S is a set of 
vertices of G, the graph whose vertex set is 5 and whose edge set consists of those 
edges of G with both ends in S is a subgraph of G. In both instances, this subgraph 
is denoted by G[S] and is called the subgraph of G induced by S. 

Any subgraph of a graph may be obtained by deleting appropriate vertices and 
edges. Similarly, any spanning subgraph may be obtained by deleting edges, any 
vertex-induced subgraph by deleting vertices, and any edge-induced subgraph by 
deleting edges and isolated vertices. 

The subgraphs of a graph can be represented algebraically by their incidence 
matrices, but it turns out to be more fruitful to identify a subgraph with the char- 
acteristic function of its edge set. Let E be a set. The characteristic function of a 
subset § of E is the function ys : EF — GF(2) defined by 


(ix {1 fees, 
WN") 0 ifegS. 


The functions y; form a vector space over GF(2). When E is the edge set of a 
graph G, this vector space is called the edge space of G, and may be regarded as the 
set of edge-induced subgraphs of G, the sum of two such subgraphs G[S] and G(T] 
being G{SAT], where A denotes symmetric difference. Families of subgraphs that 
are closed under this operation are thus of particular interest. The even subgraphs 
of G constitute such a family, and form a vector space €(G) over GF(2) called the 
cycle space of G. (Originally, even graphs were called cycles. To avoid confusion, 
we have chosen not to adopt this name, because it is also commonly used for what 
we shall call a circuit.) 

As noted earlier, deletion and addition of edges are inverse operations. Another 
operation closely related to deletion, but in a more subtle way, is that of contrac- 
tion. If S is a set of edges of a graph G, the graph derived from G by deleting 
S and identifying each pair of ends of the edges of S is said to be obtained by 
contracting S, and is denoted by G/S. If H is a subgraph of G, we denote the 
contraction G/E(H) simply by G/H. A minor of a graph G is any graph obtain- 
able from G by means of a sequence of vertex deletions, edge deletions and edge 
contractions. Minors play a very important role in the structural theory of graphs. 
In particular, the absence of a particular graph as a minor has a profound effect 
on the properties of a graph. The survey by Robertson and Seymour (1985) gives 
a good overview of this topic. Minors of graphs are discussed also in chapter 5 by 
Thomassen. 
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1.3. Walks 


A walk in a graph G is a sequence W := upe11e2 --- gz, where U9, U1,...,U¢ are 
vertices of G, e;,€2,...,¢€¢ are edges of G, and v,;_, and vu; are the ends of e;, 1< 
i < . In a simple graph, a walk is completely specified by its sequence of vertices, 
Uv) © Ve. The vertex vg is the tail of W, the vertices v,,...,U¢_1 its internal vertices, 
and the vertex v; its head; the tail and head are often referred to as the ends of W. 
The walk W is described as a ug-walk or a UgU;-walk if, as is often the case, one 
wishes to specify explicitly one or both of its ends; one also says that W connects 
Up and u;. The walk W is open if ug # uz and closed if vo = u,. The length of W is 
its number of edges, namely ¢; the parity of W is the parity of its length. 

We say that vertex vu; precedes vertex u; on W if i < j, and indicate this rela- 
tionship by the notation v; < u;. If v; precedes v,, the walk vj¢,,1;41 ---vj_1e;0, is 
referred to as the v,v;-segment of W, and may be denoted by W{v;,u,], W(ui-1,u;], 
W([v;,¥j41) or W(v;-1,0;41), aS convenient; or by W[v;,u;], to indicate traversal of 
the segment in the opposite sense. 

The walk W is a trail if its edges e,,€2,...,e, are distinct, and a path if its vertices 
Up, V1,---,U~ are distinct. Note that a vouv,-walk of minimum length is necessarily 
a uov,-path. A closed trail of positive length whose vertices (apart from its ends) 
are distinct is called a circuit. Dirac (1952) found a link between the length of a 
longest path or circuit in a graph and the minimum degree. 


Theorem 1.3. Let G be a simple graph of minimum degree 6. Then 
(i) G contains a path P of length at least 8; 
(ii) if 6 > 2, G contains a circuit C of length at least 6 +1. 


Proof (by choosing a maximal path). Let P := uu; ---u, be a maximal path of G 
(one that cannot be extended from either end). Then all the neighbours of uo lie 
on P. Setting k := max {i: vov; € E}, we have k > d(up) 2 6. Thus P is of length 
at least 5 and, if 5 > 2, C := ugv, ---Ugbo is a circuit of length at least 6+1. O 


A circuit decomposition of a graph is a partition of its edge set into circuits. 
For a graph to have a circuit decomposition, each vertex must clearly be of even 
degree. Veblen (1912) proved the converse statement. 


Theorem 1.4. A graph is even if and only if it admits a circuit decomposition. 


Proof. A graph with a circuit decomposition is even because each vertex is inci- 
dent with either zero or two edges of any circuit. On the other hand, a nonempty 
even graph has at least one circuit, by Theorem 1.3(ii), and the subgraph obtained 
on deleting the edges of this circuit is even. Sufficiency now follows by induction 
on e(G). O 


1.4. Connection 


A graph is connected if any two of its vertices are connected by a path. A com- 
ponent of a graph G is a maximal connected subgraph of G (that is, a connected 
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subgraph contained in no other connected subgraph). The number of components 
of a graph G is denoted by c(G). The vertices and edges of an xy-path form a 
connected subgraph in which the ends x and y have degree one and the internal 
vertices have degree two (unless the path is of length zero, in which case x = y 
has degree zero); similarly, the vertices and edges of a circuit form a connected 
2-regular subgraph. The terms path and circuit will frequently be used to refer to 
such subgraphs. Circuits are very often called cycles, they are also sometimes re- 
ferred to as polygons, short ones being given suggestive geometrical names: digon, 
triangle, quadrilateral, and so on. 

The distance d(x, y) between two vertices x and y in a graph is the length of 
a shortest xy-path in G, if there is one; otherwise their distance is defined to be 
infinite. The diameter of a graph is the maximum distance between its vertices. 
The girth of a graph is the length of a shortest circuit, if there is one, otherwise 


the girth is defined to b 

A connected graph G is separable if it contains subgraphs G, and G», and a 
vertex v, such that (a) G; UG, =G, (b) G,; NG, = {v}, and (c) G; ¥ {v}, i= 
1,2. Otherwise, G is nonseparable. A block of a connected graph is a maximal 
nonseparable subgraph; the blocks of a disconnected graph are the blocks of its 
components. 

A cut vertex of a connected graph G is any vertex v satisfying conditions (b) and 
(c) above for some pair G,, G2 of subgraphs of G satisfying (a). The cut vertices 
of a disconnected graph are defined to be the cut vertices of its components. Thus 
v is a cut vertex of G if and only if G\u has more components than G, unless 
uv is incident to a loop and at least one other edge. Analogously, we define a cut 
edge of a graph G to be an edge e such that G\ e has more components than G; 
equivalently, a cut edge is one that belongs to no circuit. 

The circuits of a graph lie entirely within its blocks. Thus, where circuits are con- 
cerned, nonseparability is a natural condition to impose. Hartman (1983) strength- 
ened Theorem 1.3 (ii) for nonseparable graphs by showing that the circuits of 
length at least 6 +1 in a simple nonseparable graph G of minimum degree 6 gen- 
erate the cycle space of G, unless 5 is odd and G = K;,,. If such a graph has an 
odd circuit, it therefore has an odd circuit of length at least 6 +1, unless & is odd 
and the graph is K5,;. 

A graph is acyclic, or a forest, if it contains no circuit or, equivalently, if each of 
its edges is a cut edge. The following basic property of forests was first noted by 
Listing (1862). 


Proposition 15. Let G be a forest. Then 

e(G) — v(G) + c(G) = 0. 
Proof (by induction). The assertion holds if e(G) = 0, since in this case each com- 
ponent is a single vertex. Assume that e(G) > 0 and that the assertion holds for all 


forests on fewer than e(G) edges. Let G’ be a subgraph of G obtained by deleting 
any one edge. Then G’ is a forest with e(G) — 1 edges, v(G) vertices and c(G)+1 


Basic graph theory: Paths and circuits 11 
components. By the induction hypothesis, applied to G’, 


e(G) — v(G) +c(G) =e(G) — 1—u(G)+c(G) +1 
=e(G’) —u(G’)+c(G’) =0. 


Thus the assertion holds for all forests. oO 


A tree is a connected acyclic graph. It follows from Propositions 1.1 and 1.5 that 
every tree on two or more vertices has at least two vertices of degree one; such 
vertices are called Jeaves. Any connected graph, when viewed as an assemblage 
of its blocks, has the structure of a tree. More precisely, the block cut-vertex tree 
of a connected graph G is the tree H whose vertices are the blocks and the cut 
vertices of G, a block and cut vertex being adjacent in A if the block includes 
the cut vertex in G: H is connected because G is, and acyclic because a block is a 
maximal nonseparable subgraph. The blocks of G corresponding to the leaves of 
H are the endblocks of G. 

Every connected graph contains a spanning tree, a spanning subgraph which is 
a tree. In fact, the spanning trees are precisely the spanning subgraphs that are 
connected and are minimal with respect to the property. They are also the spanning 
subgraphs that are acyclic and are maximal with respect to the property, and thus 
can be “grown” by starting with one vertex and adding edges and vertices one by 
one. This latter viewpoint is of fundamental importance in both theoretical and 
applied contexts, as we shall see shortly in the section on search trees. 

Each spanning tree T of a connected graph G determines a basis 7 of its cycle 
space @ in a natural way. For each edge e of G not in 7, the subgraph T +e 
contains a unique circuit C,. The set €7 of all such circuits is a basis for ©: it is 
independent because C, is the only circuit in @; that includes e, and it generates 
€ because any even subgraph H of G can be expressed in the form 


H= > C.. 
eck(H)\E(T) 


To see this, it suffices to observe that H + >>,. E(H)\E(T) Ce 18 an even subgraph 
contained in 7, and hence is empty by Theorem 1.4. 

Because the cycle space of a disconnected graph is the direct sum of the cycle 
spaces of its components, Proposition 1.5 shows that the dimension of the cycle 
space of a graph G is given by the formula 


dim€(G) = e(G) — v(G) + c(G). 
1.5. Extremal graphs 


Proposition 1.5 has the following immediate consequence. 


Corollary 1.6. Let G be a graph on n vertices and m edges, where m >n. Then G 
contains a circuit. 
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Corollary 1.6 is a very simple instance of a result in extremal graph theory. It 
asserts that every graph on ” vertices and sufficiently many edges (at least n) 
contains a circuit. Furthermore, the bound on the number of edges is best possible 
because there exist graphs on 7 vertices and n — 1 edges that do not contain circuits 
(namely trees); these graphs (trees) are referred to as extremal graphs for the 
property in question (containing a circuit). 

While a graph on n vertices and at least n edges is guaranteed to contain a 
circuit, the length of the circuit may take on any value between 1 and n. The 
following theorem, due to Mantel (1907), specifies how many edges are needed to 
guarantee the existence of a triangle in a simple graph. 


Theorem 1.7. Let G be a simple graph on n vertices and m edges, where m > n*/4. 
Then G contains a triangle. 


Proof (by contradiction). Suppose that G contains no triangle. Then adjacent ver- 
tices have disjoint sets of neighbours, so 


d(u)+d(v) <n, for each edge wv € E. 


Summing over all edges of G, we have 


> d(vy < mn. 
veV 
On the other hand, by Chebyshev’s inequality and Proposition 1.1, 
2 2 
Sd > Qroev d(v)) = 4m” 
veV 7 " 


These two inequalities imply that m < n7/4, contradicting the hypothesis. © 


Equality holds in the above inequalities only if G is regular of degree n/2. It 
follows readily that the extremal graphs in this case are the complete bipartite 
graphs K,,/7,,/2- Theorem 1.7 is a special case of Turén’s Theorem, the cornerstone 
of extremal graph theory. This topic is the subject of chapter 23 by Bollobds. 


1.6. Search trees 


Theorem 1.3, Corollary 1.6 and Theorem 1.7 assert only that certain types of 
paths and circuits may be found in graphs with appropriate properties. They do 
not address the question, of paramount importance in practical applications, of 
how to find such subgraphs. Of course, an exhaustive search of a finite graph 
can always, in theory, be completed in finite time. In practice, however, this is 
not good enough — time is of the essence; to be effective, an algorithm must be 
efficient. The running time of an algorithm is measured by the total number of basic 
operations (arithmetic operations, comparisons, and so on) required to implement 
it, and depends on the size and nature of the input. The complexity of a graph- 
theoretical algorithm is its running time, in the worst case, on graphs of a given 


Basic graph theory: Paths and circuits 13 


size (meaning, usually, their number of vertices and/or edges). A polynomial-time 
algorithm is one whose complexity is bounded above by a polynomial in the size of 
the input graph, an exponential-time algorithm one whose complexity is bounded 
below by an exponential function in the size of the input graph. Polynomial-time 
algorithms are regarded as efficient because their running time grows relatively 
slowly with the size of the input; this means that fairly large graphs (perhaps with 
several hundred vertices) can be handled effectively. Exponential-time algorithms, 
on the other hand, may take inordinately long to process graphs of very modest 
size (say, twenty vertices). 

Spanning trees provide an efficient way to search graphs, and are the key to many 
polynomial-time algorithms. A search tree in a connected graph is a spanning tree 
whose vertex set {uo,U1,..-,Un-1} and edge set {e1,€2,.-.,€n-1} are ordered in such 
a way that, for each j, 1 < j <-—1, the subgraph with vertex set {¥0,U1,--.,v;} 
and edge set {e,,@2,...,e;} is a tree; the vertex vo is the root of the search tree. 
In other words, a search tree is the growth process of a spanning tree, not just the 
tree itself. Starting at the root uo, the tree is grown by adding a vertex v,; and edge 
e; = uj;v; at each stage of the process; we call v; a child of u;, and vu; the parent 
of v;, in the tree. In general, there is some freedom as to the choice of u; and e;, 
but if they are chosen appropriately, important structural information about the 
graph is encoded in the resulting search tree. The most basic type of tree search, 
breadth-first search, was developed by Dijkstra (1959). 

A breadth-first search tree is a search tree with vertex set {vo,01,..-,Un—1} and 
edge set {e),€2,...,€,-1} where, for each j, 1 <j <n—1, uv; and e; are chosen in 
such a way that the parent of uv; is the vertex v; with smallest possible index i. Such 
a tree can be prown in linear time. The key property of breadth-first search is that 
the distance in a breadth-first search tree between its root and any other vertex 
is the same as their distance in the graph itself. Thus, by growing a breadth-first 
search tree from each vertex, shortest paths between all pairs of vertices in a graph 
can be found. In particular, the diameter can be determined. 

Breadth-first search also enables one to decide easily whether or not a graph 
is bipartite. Let T be a breadth-first search tree with root x in a graph G, which 
we assume to be connected. (If it is not, the procedure may be applied to each 
component in turn.) Let X be the set of vertices whose distance from x is even, 
and Y the set of vertices whose distance from x is odd. If no two vertices of X and 
no two vertices of Y are adjacent, (X, Y) is a bipartition of G and so G is bipartite. 
If, on the other hand, two of these vertices, wu and v, are adjacent, consider the 
ux-path P and vx-path Q in T. Let w denote the first vertex common to P and Q. 
Then P[u,w] and Q[v,w] are paths of the same parity and so, together with the 
edge uv, form an odd circuit of G. We conclude that G is not bipartite in this case, 
Since a circuit in a bipartite graph alternates between the sets of its bipartition and 
thus is necessarily even. 

The above algorithm outputs a bipartition if the input graph is bipartite and an 
odd circuit if it is not. Thus it supplies, as a byproduct, the following characteriza- 
tion of bipartite graphs, due to Konig (1916). 
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Theorem 1.8, A graph is bipartite if and only if it contains no odd circuit. 


If a graph is not bipartite, a shortest odd circuit can be found by growing a 
breadth-first search tree with root x from each vertex x. It is less easy to find 
a shortest even circuit, if there is one; an algorithm for doing so is described in 
section 4 (at the end of the discussion of stability number). 

The above examples illustrate the power of breadth-first search. Another highly 
effective tree search procedure, depth-first search, was developed by Tarjan (1972). 
A depth-first search tree is a search tree with vertex set {v9,v1,...,U,-1} and edge 
set {€,,€2,...,€,-1} where, for each j, 1 < j <n—1, uv; and e; are chosen in such 
a way that the parent of u; is the vertex vu; with largest possible index i. Depth-first 
search also takes linear time. It can be used to find the blocks and cut vertices of a 
graph, and has many other important applications. These are described in chapter 
28 by Grétschel, and in the book by Aho et al. (1974). 

In applications, the edges of a graph often have associated weights, or costs. An 
edge-weighted graph is a graph G together with a weight function w that assigns to 
each edge of G a weight w(e). Usually, the range of w is the set of nonnegative real 
numbers; in some instances, however, a different choice is appropriate: the set of 
all real numbers, the set of integers, or the set of integers modulo m, for example. 
The weight of an edge-weighted graph is the sum of the weights on its edges. An 
unweighted graph can be regarded as an edge-weighted graph in which each edge 
has weight one; its weight is then simply its size. In this way, results about graphs 
can sometimes be generalized to weighted graphs. Dijkstra (1959) showed, for 
instance, not only how to find shortest paths in graphs but, more generally, how 
to find minimum-weight paths in edge-weighted graphs with nonnegative weights. 

Most of the theorems to be mentioned in this chapter have proofs that can be 
adapted fairly easily to yield polynomial-time algorithms. However, there are many 
basic problems for which polynomial-time algorithms have yet to be found, and 
indeed might well not exist. In fact, determining which problems are solvable in 
polynomial time is one of the main preoccupations of theoretical computer scien- 
tists. An important class of problems in this regard is the class YY (standing for 
nondeterministic polynomial-time). We give here an informal definition; a precise 
treatment can be found in chapter 29, or in the book by Garey and Johnson (1979). 

A decision problem is a question whose answer is either “yes” or “no”; any 
object to which the question is addressed is an instance of the problem. A decision 
problem belongs to the class # if there is a polynomial-time algorithm that solves 
any instance of the problem in polynomial time. A decision problem belongs to 
the class W if, given any instance of the problem whose answer is ‘yes’, there 
is a certificate validating this fact which can be checked in polynomial time, and 
one, moreover, whose length is bounded above by a polynomial in the size of the 
input; such a certificate is said to be succinct. It follows from these definitions that 
P CN, since a polynomial-time algorithm is, in itself, a succinct certificate. 

The problem of determining whether a graph is bipartite belongs to ¥ and hence 
also to WP; likewise, the problem of determining whether a graph is nonbipartite 
belongs to both Y and VW. On the other hand, even though no polynomial-time 
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algorithm is known for determining whether a graph contains a spanning circuit, 
this problem belongs to YW, an appropriate certificate being the spanning circuit 
itself. Verifying that a graph contains no spanning circuit, by contrast, appears to 
be much harder; indeed, its status as regards membership in N¥ is unresolved. 

As noted above, ? C NY. Whether P = NF or not is the most fundamental 
open question in theoretical computer science. The so-called WP-complete prob- 
lems, which form a subclass of WY, are of much interest in this regard. These 
problems, which include the problem of determining whether a graph contains a 
spanning circuit, are algorithmically the most difficult in V9, in the sense that if 
any one of them can be solved in polynomial time, then so can every problem 
in NP (that is, P = NP). Several 4 P-complete problems will be encountered 
in this chapter. The 4 -complete problems belong to a larger class of seem- 
ingly intractable problems, the class of VW P-hard problems. The concept of WP- 
completeness was set on a sound theoretical footing by Cook (1971) and further 
developed by Karp (1972). 


1.7. Directed graphs 


A directed graph, or digraph, D is a graph G in which each edge is assigned a 
direction, one end being designated its tai! and the other its head. We call D an 
orientation of G, and write D := G.An edge with tail x and head y is denoted by 
the ordered pair (x,y); we say that x dominates y and write x — y. In diagrams, 
the direction of an edge is indicated by an arrow pointing towards the head of the 
edge. Two or more edges with the same tail and head are multiple edges. A strict 
digraph is one with neither loops nor multiple edges. 

The associated digraph D(G) of a graph G is the digraph obtained from G 
by replacing each edge by two oppositely directed edges with the same ends. 
An example is the strict complete digraph, the associated digraph D(K,) of the 
simple complete graph K,,. The underlying graph G(D) of a digraph D is the graph 
obtained from D by ignoring the orientations of its edges. An oriented graph is 
a digraph whose underlying graph is simple. A tournament is a digraph whose 
underlying graph is simple and complete. 

The adjacency matrix A(D) of a digraph D is the matrix (a,,) whose rows and 
columns are indexed by the vertices of D, a,, being the number of edges of D 
with tail wu and head v. 

Let v be a vertex of a digraph D. The outdegree d},(v) of v in D is the number 
of edges of D whose tail is v, the indegree d;,(v) the number of edges of D whose 
head is v. A sink is a vertex of outdegree zero, a source a vertex of indegree zero. 
The analogue for digraphs of Proposition 1.1 asserts that, for any digraph D, 


dod (v) = e(D) =} d*(v). 


vev veV 
A directed walk in a digraph D is a sequence W := (uo, e1,U1,---,€2,U¢), Where 
U0, U,,--.,U, are vertices of D, e;,...,e, are edges of D, and v;_; and vu; are the tail 


and head, respectively, of e;, 1 < i < 2. The vertices up and vu; are the tail and head 
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of W; when one wishes to refer to them explicitly, W is described as a (v9, v,)-walk. 
The notions of directed trail, directed path, and directed circuit are defined in an 
analogous manner, as are directed girth and directed diameter. A digraph is strong 
(or strongly connected) if any two vertices x and y are connected by an (x, y)-path. 
A strong component of a digraph D is a maximal strong subdigraph of D. 

Graphs can be regarded as a subclass of digraphs, a graph being identified with 
its associated digraph. This leads to a basic question, one that we shall examine 
with respect to paths and circuits: which properties of graphs extend to digraphs? 
Another fundamental question has to do with the relationship between a graph 
and its orientations: how are the properties of a graph reflected in the properties of 
its orientations? This latter question was studied by Chvatal and Thomassen (1978) 
with regard to girth and diameter. They established the existence of an integer- 
valued function f and an orientation function — such that, for any graph G and 
any edge e of G that belongs to a circuit of length at most k, the image @ ofe 
belongs to a directed circuit of length at most f(x) in G. This implies, in particular, 
that a connected graph with no cut edge has a strongly connected orientation, a 
result due to Robbins (1939). Chvdtal and Thomassen (1978) obtained a similar 
but sharper result for the diameter. 


Theorem 1.9. Let G be a graph of diameter d with no cut edge. Then G admits 
an orientation of directed diameter at most 2d? + 2d. 


A digraph is acyclic if it contains no directed circuit. An acyclic tournament is 
called a transitive tournament, the one of order n being denoted 7 7,,. The digraph 
D* obtained from D by contracting each strong component to a single vertex is 
an acyclic digraph, the condensation of D; if D is a tournament, D* is a transitive 
tournament. A strong component of D is called maximal if it has indegree zero in 
D* and minimal if it has outdegree zero in D*. 

An arborescence is an orientation of a tree in which one vertex, called the 
root, has indegree zero and every other vertex has indegree one. The analogue for 
digraphs of a breadth-first search tree is a breadth-first search arborescence. Shortest 
directed paths between pairs of vertices in a digraph can be found efficiently by 
growing such arborescences. In a strong digraph, a directed odd circuit can also be 
found in this way, if there is one. Consider a breadth-first search arborescence with 
root x. Let X be the set of vertices whose distance from x is even, and Y the set 
of vertices whose distance from x is odd. If some edge e joins two vertices in the 
same set, x is connected to the head y of e by directed paths of both parities. The 
concatenation of one of these paths with a (y, x)-path is a directed closed walk of 
odd length, which necessarily contains a directed odd circuit. On the other hand, 
if each edge has one end in X and the other end in Y, the digraph is bipartite. 
Thus a digraph has a directed odd circuit if and only if some strong component is 
not bipartite. 

This characterization leads directly to a theorem of Richardson (1953) on the 
existence of kernels in digraphs. A kernel in a digraph D is a set S of pairwise 
nonadjacent vertices that together dominate every vertex of V \ S. Many digraphs 
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fail to have kernels, the simplest being the directed odd circuits. What Richardson 
proved is that a digraph without a kernel necessarily contains a directed odd circuit. 


Theorem 1.10. Let D be a digraph with no directed odd circuit. Then D has a 
kernel. 


Proof. By induction on the order of D. If D is strong, then D is bipartite and 
each class of the bipartition is a kernel of D. If D is not strong, let D, be a maximal 
strong component of D, with vertex set V,. By the induction hypothesis, D, has 
a kernel, S,. Let Vz be the set of vertices of D dominated by vertices of §,;, and 
define D2 := D \ (V; U V2). Again by induction, D2 has a kernel, S). The set S$, U S, 
is akernelof D. O 


Kernels originated in the analysis of games on graphs, a kernel representing a 
set of winning positions. But they have other applications. We refer the reader to 
Berge (1985, chapter 14) for further information. 

A shortest directed odd circuit is a shortest directed closed odd walk, and such 
a walk, if one exists, can be found in polynomial time by taking odd powers of the 
adjacency matrix until a nonzero diagonal entry is encountered. Finding a directed 
even circuit appears to be a much harder proposition. Friedland (1989) gave a 
simple necessary and sufficient condition in terms of the adjacency matrix for a 
digraph to contain such a circuit. 


Theorem 1.11. Let D be a digraph, with adjacency matrix A. Then D has a directed 
even circuit if and only if 


per (1 +A) Adet (J +A), 


where per denotes the permanent and det the determinant. 


However, no polynomial-time algorithm has been developed to determine when 
this condition is satisfied. On the other hand, the problem of finding a directed 
even circuit in a digraph (posed by D.H. Younger - see Bermond and Thomassen 
1981) has not been proved W¥-complete. 


Problem 1.12. Is there a polynomial-time algorithm for finding a directed even 
circuit in a digraph? 


Thomassen (1993) has described such an algorithm for planar digraphs. Besides 
being a fundamental question, Problem 1.12 has an intriguing knife-edge quality: 
the slightly harder question of deciding whether there is a directed odd (or even) 
circuit through a given edge in a digraph is already W%-complete (Thomassen 
1985a). 

Problem 1.12 turns out to be equivalent to a question about a special class of 
digraphs considered by Seymour and Thomassen (1987). To subdivide an edge 
€ := xy in a graph is to replace e by a path P := xvy, where v is a new vertex. A 
subdivision of a graph is any graph derivable from it by recursively subdividing 
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Figure 2. The Koh digraph. 


edges. Analogous definitions apply to digraphs, the path P being a directed path 
in this case. Seymour and Thomassen (1987) characterized the digraphs with the 
property that every subdivision contains a directed even circuit. This characteriza- 
tion does not, however, lead to a polynomial-time recognition algorithm. Indeed, 
Seymour and Thomassen showed that the existence of such an algorithm is equiv- 
alent to the existence of a polynomial-time algorithm for finding a directed even 
circuit in a digraph. 

Thomassen (1992) found a sufficient condition in terms of the minimum indegree 
and outdegree for the existence of a directed even circuit in a strong digraph. 


Theorem 1.13. Let D be a strict strong digraph of minimum indegree 5- and 
minimum outdegree 5*, where &- > 3 and 5* > 3. Then D contains a directed even 
circuit. 


The bound on the degrees in Theorem 1.13 is sharp. Figure 2 shows a strong 
2-diregular digraph on seven vertices without directed even circuits, found by Koh 
(1976). 


1.8. Hypergraphs 


A hypergraph H consists of a set V(H) of elements, called vertices, and a family 
E(#) of subsets of V(H), called hyperedges (or simply edges, when there is no 
ambiguity). An example is the Fano hypergraph, shown in fig. 3, which has as 
vertices and edges the seven points and seven lines of the Fano plane, the finite 
projective plane PG(2, 2) (see, for example, Biggs and White 1979). 

Note that the seven 3-subsets defined by the vertices of the Koh digraph (fig. 2) 
and their outneighbours constitute the edges of the Fano hypergraph; this is an 
instance of a connection established by Seymour (1974) between strong digraphs 
without directed even circuits and a certain class of hypergraphs of which the Fano 
hypergraph is prototypical. 

A vertex v and edge e of a hypergraph are incident if v € e. The-incidence graph 
of a hypergraph H is the bipartite graph with bipartition (V(H), E(H)) in which 


ae ee 
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v= {1,2,3,4,5,6,7} 


E = {124,235,346,457, 
561,672,713} 


5 4 4 


Figure 3. The Fano hypergraph. 


Figure 4. The Heawood graph. 


v € V(H) and e¢ E(H) are adjacent if and only if they are incident in H. The 
incidence graph of the Fano hypergraph is a 3-regular graph on fourteen vertices 
known as the Heawood graph, shown in fig. 4. 

With each hypergraph, we may thus associate a bipartite graph. Conversely, to 
each bipartite graph G(X, Y), there corresponds a hypergraph H whose vertex 
set is X and whose hyperedges are the sets of neighbours of the vertices of Y. 
In this way, we can recover a hypergraph from its incidence graph, so the two 
structures are, in a sense, one and the same. Many other hypergraphs of interest 
can be associated with a graph G. We have already encountered one, the cycle 
space €(G), which may be regarded as the hypergraph whose hyperedges are 
the edge sets of the even subgraphs of G. Retaining only the minimal non-null 
elements of €(G), we obtain the hypergraph whose hyperedges are the edge sets 
of the circuits of G. This hypergraph has two special properties: (a) no proper 
subset of a hyperedge is a hyperedge, and (b) if C, and C, are hyperedges and e 
is an element of C; MC, then (C; U C2) \e contains a hyperedge. A hypergraph 
with property (a) is termed a clutter, one with properties (a) and (b) a matroid. 
The matroid defined by the circuits of a graph G is called the cycle matroid of G. 
Although they are a substantial abstraction of graphs, matroids capture perfectly 
certain graph-theoretical concepts and properties; indeed, they are the appropriate 
framework for the study of graph minors and for the treatment of a number of 
Significant questions about circuits, forests and spanning trees. They are also of 
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fundamental importance in the related field of combinatorial optimization. Their 
properties are discussed in chapters 9-11 and, more fully, in the book by Welsh 
(1976). Hypergraphs appear in various guises throughout combinatorics. They are 
treated in several chapters of this handbook, in particular in chapter 7, as well as 
in the book by Berge (1989). 


2. Hamilton paths and circuits in graphs 


2.1. Dirac’s theorem 


For a simple graph G, the minimum degree 6 is a parameter of some importance. 
The larger it is, as a function of n, the more likely it is that G contains a particular 
subgraph. For instance, Theorem 1.3(ii) implies that if 6 > 2 then G contains a 
circuit, and it follows from Proposition 1.1 and Theorem 1.7 that if 6 > 7/2 then 
G contains a triangle; yet another illustration is provided by Theorem 1.13. The 
theorem that initiated the study of long circuits (Theorem 2.1 below) is of this 
type. 

A path or circuit that includes every vertex of the graph is called a Hamilton path 
or circuit. A hamiltonian graph is one that contains a Hamilton circuit, a traceable 
graph one that contains a Hamilton path. As we noted in section 1, the problem 
of determining whether a graph contains a Hamilton circuit is ’9%-complete. The 
more general problem of finding a minimum-weight Hamilton circuit in an edge- 
weighted complete graph is also W#-complete. Known as the Travelling Salesman 
Problem, this latter question has been the subject of much study. Its many aspects 
are surveyed in chapters 28 and 35, and in the book edited by Lawler et al. (1985). 
Whether it is W9-hard to determine the parity of the number of Hamilton circuits, 
a question raised by K.A. Berman (personal communication 1986b), is not known. 

Dirac (1952) determined how large the minimum degree must be to guarantee 
the existence of a Hamilton circuit. 


Theorem 2.1 (Dirac’s Theorem). Let G be a simple graph of minimum degree 5 
on n vertices, where 6 > n/2 and n > 3. Then G contains a Hamilton circuit. 


We present three proofs of this fundamental result. All are based on an ob- 
servation that we shall refer to as the Lollipop Lemma (2.2). An (x, y)-dollipop 
is a graph CUP, where C is a circuit and P an xy-path such that CN P = {y}; 
lollipops have also been called loops and lassos. 

Let C be a circuit in a graph G, with a specified orientation, and let x be a 
vertex of C. The predecessor of x on C is denoted by x~, the successor of x on C 
by x*. This notation is extended to a set S of vertices of C in the natural way: 


So = {x |x€S} and St := {x*|x € 5S}. 


Lemma 2.2 (The Lollipop Lemma). Let CU P be an (x, y)-lollipop in a graph G, 
where C is a circuit of length £ and P a path of length m. Then G contains the xy~- 
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Figure 5S. Two paths in a lollipop. 


path PCly,y7] and the xy*-path PC [y,y*], both of length £+m—1. In particular, 
if C is a longest circuit of G and P is nontrivial, x is nonadjacent to both y~ and 
vs 

The Lollipop Lemma (2.2) is illustrated in fig. 5. Simple though it is, this lemma 
is the basis of many theorems on paths and circuits. We shall appeal to it frequently, 
often without explicit reference. 

The first proof that we give of Dirac’s Theorem (2.1) is essentially the one given 
by Dirac himself. 


Proof 1 (by choosing a longest circuit). Let C be a longest circuit in G, of length 
é. By Theorem 1.3(ii), 2 > 6+1. If C is a Hamilton circuit, there is nothing to 
prove, so suppose that G\ C #0. Let C UP be an (x, y)-lollipop, where P is as 
long as possible. Let m be the length of P. Note that since 


8>n—5>n-L=v(G\O), 


each vertex of G\C is adjacent to some vertex of C, and so m > 1. Each vertex 
of P is a potential neighbour of x. On the other hand, no vertex of C \ y whose 
distance from y along C is m or less can be adjacent to x; and of the remaining 
vertices of C, no two consecutive ones can be adjacent to x. We deduce that 


£—2m _ 


d(x) < 
(x) m+—, 


£ oon 
a 
a contradiction. oO 
The second proof of Dirac’s Theorem (2.1) is due to Newman (1958). 


Proof 2 (by choosing a longest path). Let P be a longest path in G, of length 2. 
We first show that G has a circuit of length @ +1. If the ends, x and y, of P are 
adjacent, C := P +xy is such a circuit. So we may assume that x and y are not 
adjacent. By the choice of P, all the neighbours of x and y lie on P. Set 


P:= UU, ++ Us, 
where up := x and uz := y, and define 


X := {u;: Uovis, € E}, Y := {v;: oye € E}. 
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We shall prove that XM Y #9. Since & > n/2, 
|X|+|¥| = d(vo) + d(vy) > 28 > n, 
and, since y ¢ XUY, 
IXUY|<é<n-1. 
Therefore 
|XNY|=|X|+|¥|-|XUY| 21. 
Let vu, E XN Y. Then 


_ 
C := Plug, vg|oyv¢ P [ve, Vis1]¥x41¥0 


is a circuit of length 2 + 1. 

We now show that C is a Hamilton circuit of G. If not, let CUP be a lollipop, 
where P has length m and, as in Proof 1, m > 1. By the Lollipop Lemma (2.2), 
G contains a path of length @ + m, contradicting the choice of P. Therefore C is 
indeed a Hamilton circuit of G. O 


The third proof that we give of Dirac’s Theorem, due to Bondy (1981), incor- 
porates elements of each of the other two. 


Proof 3. Let C be a longest circuit in G. If C is a Hamilton circuit, there is 


nothing to prove, so suppose that G\ C #9 and let CUP be an (x, y)-lollipop. 
Set 


VY = V(C), V2 = V(P)\y, V3 — V(G)\ V(CUP), 


and let d;(x) and d;(y*) denote the numbers of neighbours of x and y*, respectively, 
in V;, i=1,2,3. Since C is a longest circuit, we deduce that 
(i) x and y* cannot be adjacent to consecutive vertices v and v*, respectively, 
on C; 
(ii) y* cannot be adjacent to any vertex of V2; 
(iii) x and y* cannot be adjacent to a common vertex of V3. 


d(x) + dy (y") < Vi], 
by (ii), 

a,(y*) =0, 
and, by (iii), 

ds(x) + ds(y*) < |Vsl. 
Since 


d,(x) <[V2|—1, 
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we have 


3 
28 < d(x) +d(y*) = D7 (di(x) + d)(y*)) < [Vil + [Vol + [V3] —1 = —-1, 


i=1 
contradicting the hypothesis. Thus C is indeed a Hamilton circuit of G. 


No inductive proof of Dirac’s Theorem (2.1) is known. A false one was concocted 
by Woodall (1975) in order to illustrate the potential pitfalls of this fundamental 
technique. 

By the simple idea of transforming one problem into another, further results 
can be derived very easily. We give one small illustration; others can be found in 
chapter 29 by Shmoys and Tardos. 


Corollary 2.3. Let G be a simple graph of minimum degree & on n vertices, where 
5 > (n—1)/2. Then G contains a Hamilton path. 


Proof. For n=1, the result is trivial. For n > 2, we form the graph H := G+u. 
This graph H has n + 1 vertices and minimum degree at least ( + 1)/2. By Dirac’s 
Theorem (2.1), H contains a Hamilton circuit C. The subgraph C \ v is a Hamilton 
pathofG. @O 


2.2. Generalizations of Dirac’s theorem 


Dirac’s Theorem (2.1) is important because it is amenable to generalization in a 
great variety of ways. To start with, the hypothesis can be weakened. Proof 2 does 
not use the full strength of the hypothesis 6 > 1/2, but just the weaker condition 
that the ends x and y of a longest path satisfy the inequality d(x) +d(y) 2 n. 
Moreover, this latter condition is applied only to nonadjacent vertices x and y. 
Thus the very same proof yields the following stronger theorem, due to Ore (1960). 


Theorem 2.4 (Ore’s Theorem). Let G be a simple graph on n vertices, where n > 
3, in which the degree sum of any two nonadjacent vertices is at least n. Then G 
contains a Hamilton circuit. 


In fact, if one examines the proof more carefully still, one sees that it is based 
on the following observation. 


Proposition 2.5. Let G be a simple graph on n vertices, where n > 3, and let x and 


y be nonadjacent vertices of G with degree sum at least n. If G+ xy is hamiltonian, 
then so is G. 


This observation leads naturally to the definition of a closure operation on 
graphs. The closure of a graph G on n vertices is the graph derived from G by 
recursively joining pairs of nonadjacent vertices having degree sum at least n. 


Repeated application of Proposition 2.5 yields the following result of Bondy and — 
Chvatal (1976). 
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Lemma 2.6 (The Closure Lemma). A simple graph G is hamiltonian if and only 
if its closure is hamiltonian. 


In particular, a graph is hamiltonian if its closure is complete: the graphs sat- 
isfying the hypothesis of Dirac’s Theorem (2.1) are hamiltonian for this reason. 
Chvétal (1972) found a wider class of graphs defined in terms of their degree se- 
quences (the widest, in a certain sense) whose closures are complete and which 
are consequently hamiltonian. 


Theorem 2.7. Let G be a simple graph on n vertices with degree sequence 
(d;,d2,...,d,), where n>3 and d| < d) <--:<dy,. If there is no value of k < 
(n —1)/2 such that dy, < k and d,_, <n -—k, then G contains a Hamilton circuit. 


Proof. We shall show that the closure of G is complete; the conclusion will then 
follow directly from the Closure Lemma (2.6). Suppose, to the contrary, that this 
is not so. Let H be the closure of G. Then the degree sum of any two nonadjacent 
vertices of H is at most n—1. Let x and y be two such vertices of H whose 
degree sum is maximum. Without loss of generality, we may assume that dj, (x) < 
d(y). Set dy(x) := k. Then k < (n—1)/2 and dy(y) <n—k—-1. Denote by X 
the set of vertices nonadjacent to y in H. Then |X| =n—1-—-dy(y) 2k. Also, by 
the choice of x and y, do({v) < dy(v) <k for all v € X. Similarly, if Y denotes 
the set of vertices that are nonadjacent to x in H, we have |Y|=n-—k—-1 and 
d;;(v) < dy(v) <n —k —1 for all v € Y. Thus G has at least k vertices of degree 
at most & (namely those in the set X) and at least n — k vertices of degree at most 
n—k —1 (namely those in the set Y U {x}). This contradicts the hypothesis on G. 
We deduce that the closure of G is indeed complete, as claimed. 0 


Tian and Shi (1986) observed that the Closure Lemma (2.6) can also be used to 
give an easy proof of a generalization of Ore’s Theorem (2.4) due to Fan (1984): 
if Gis a simple graph on n vertices, where n > 3, in which no two vertices of degree 
less than n/2 have a common neighbour, then G contains a Hamilton circuit. 

Shi (1992) and Bollobds and Brightwell (1993) gave the following elegant gen- 
eralization of Dirac’s Theorem (2.1). 


Theorem 2.8, Let G be a simple graph on n vertices, and let S denote the set of 
vertices of degree at least n/2 in G. If |S| > 3, G has a circuit that includes every 
vertex of S. 


Proof. Let C be a circuit that includes as many vertices of S as possible. Suppose 
that $\ V(C) 4@, and let CUP be an (x, y)-lollipop, where x € S. Note that C 
includes at least two vertices of S because any two nonadjacent vertices of S lie 
in a 4-circuit of G. Let z be the first vertex of S following y on C. Then, as in 
Proof 3 of Dirac’s Theorem (2.1), the choice of C implies that d(x) + d(z) <n —1, 
a contradiction. O 


Another way in which Dirac’s Theorem (2.1) can be strengthened is by refining 
its conclusion. Corollary 2.10 below is such a result. A graph on a vertices is 
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pancyciic if it contains circuits of every length 2,3 < & <n. In particular, a pancyclic 
graph is hamiltonian. The following theorem of Bondy (1971a) gives a condition 
under which the converse holds. The proof presented here is due to C. Thomassen 
(see Bollobds 1978). 


Theorem 2.9. Let G be a simple hamiltonian graph on n vertices and at least n*/4 
edges. Then either G is pancyclic or n is even and G = Kyj2n/2- 


Proof (by induction). There is nothing to prove when 7 = 3, so assume that n > 4. 
Suppose, first, that G has a circuit of length n — 1, 


C 1 Uy U2 °° + Un__1U}. 
Let v:= V(G) \ V(C). If d(v) < (n — 1)/2, then 


n n—-1_ (n-1/ 
y emai Sai eee as 
By the induction hypothesis, G \ v is pancyclic. Since G is hamiltonian, by hypoth- 
esis, G is pancyclic too. If d(v) > (m — 1)/2, there exists for each £, 3 < é <n, an 
index i such that both vu; and vu,,,_2 are edges of G, and G contains the circuit 
vu; C[v;, 0;,¢-2]¥j4¢_20, Of length 2. Thus G is pancyclic in this case too. 

Suppose, now, that G has no circuit of length n — 1. Let v,v2---v,v, be a Hamil- 
ton circuit of G. Then, for all i, j such that 1 <i, j <n and j i — 1,i, at most one 
of the pairs uju; and v;,;0;,2 can be an edge of G. Thus 


d(v;) + d(¥}41) <n. 


e(G \v) =e(G) — d(v) 2 


Summing over all i, 1 <i <7, we have 


4e(G) =2 Salo.) <n’, 


i=1 
whence 
nt 
a" 
Equality holds only if n is even and, for all i, j such that 1 < i, j < mn and j #i —1,i, 


exactly one of the pairs u,v; and v;,10;.2 is an edge of G. It follows easily that 
G=K,/2,,/2 in this case. O 


e(G)< 


Corollary 2.10. Let G be a simple graph of minimum degree 3 on n vertices, where 
5 2n/2. Then either G is pancyclic or n is even and G = Ky) n/2- 


Corollary 2.10 was refined substantially by Amar et al. (1991), who proved that 
a simple hamiltonian graph of minimum degree 6 on n vertices, where 5 > 2n/5 
and n is sufficiently large, is either pancyclic or bipartite. Nash-Williams (1971a) 
Strengthened the conclusion of Dirac’s Theorem (2.1) in a very different manner. 
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Theorem 2.11. Let G be a simple graph of minimum degree 5 on n vertices, where 
& pn/2 and n>3. Then G contains at least |cn| edge-disjoint Hamilton circuits, 
where c = 5/224 = 0.0223. 


Weakening its hypothesis and strengthening its conclusion are two ways to gen- 
eralize a theorem. A more common way is simply to extend the domain of the 
hypothesis, including the theorem as a special case. There are many such general- 
izations of Dirac’s Theorem (2.1). We begin with one due to Pésa (1963). A path 
system is a graph each of whose components is a path. 


Theorem 2.12. Let s be a nonnegative integer, G a simple graph of minimum 
degree & on n vertices, where 5 > (n+s)/2 and n > 3, and S a set of s edges of G 
that induce a path system. Then G contains a Hamilton circuit that includes every 
edge of S. 


Proof (by considering a maximal counterexample). Suppose the theorem is false 
for some value of n, and let G be a maximal graph on n vertices satisfying the 
hypotheses but not the conclusion (for some set S of edges). Since a complete 
graph clearly satisfies the conclusion (for any set S), G is not complete. Let x and 
y be two nonadjacent vertices of G. Then G + xy also satisfies the hypotheses of the 
theorem (for the set 5). By the maximality of G, there is a Hamilton circuit C in 
G+xy that includes S. Because G does not contain such a circuit, xy € E(C). Let 
P :=C\xy. Then P is a Hamilton path in G that connects x and y and includes 
S. Let P := vjb2---v,, where v) := x and vu, := y. A counting argument similar to 
the one used in Proof 2 of Dirac’s Theorem (2.1) shows that, for some kK1<k< 
n—1, xox € E, yy € E and u,zv,,, ¢ S. But now P[v,,vgjugvn P fon, Ves Pes1¥1 
is a Hamilton circuit in G that includes S$, contradicting our initial supposition and 
establishing the theorem. 0 


Note that the above proof, restricted to the case s = 0, closely resembles Proof 2 
of Dirac’s Theorem (2.1) but is more succinct. In the case s = 1, a slightly stronger 
conclusion can be deduced. A graph is Hamilton-connected if any two of its vertices 
are connected by a Hamilton path. Implicit in the work of Erdés and Gallai (1959, 
Lemma 1.6) is the following theorem. . 


Corollary 2.13. Let G be a simple graph of minimum degree 5 on n vertices, where 
6 2 (n+1)/2 and n 23. Then G is Hamilton-connected. 


Proof. That any two adjacent vertices in G are connected by a Hamilton path 
follows directly from Theorem 2.12. For a nonadjacent pair, it suffices to add an 
edge e joining them and apply Theorem 2.12 to the graph G + e and the set S := {e}. 

0 


A Hamilton circuit is one that is as long as possible, passing through each vertex 
of the graph. Dirac’s Theorem (2.1) has been generalized to results asserting the 
existence of long (but not necessarily Hamilton) circuits. We mention two instances, 
the first due to Egawa and Miyamoto (1989) and Bollobds and Brightwell (1993), 
the second to Dirac himself (1952). 


Basic graph theory: Paths and circuits 27 


Theorem 2.14. Let k be a positive integer and G a simple graph of minimum 
degree 5 on n vertices, where 6 > n/(k +1) and n > 3. Then G contains a circuit of 
length at least n/k. 


Theorem 2.15. Let G be a simple nonseparable graph of minimum degree 5 on 
n vertices, where n > 3. Then G contains either a circuit of length at least 26 or a 
Hamilton circuit. : 


Strictly speaking, Theorem 2.15 is not a generalization of Dirac’s Theorem (2.1) 
because of the additional requirement that G be nonseparable. However, the latter 
property is an easy consequence of the hypothesis of Dirac’s Theorem (2.1). Thus 
we may indeed regard the theorem as a generalization. A similar comment applies 
to the following result of Bondy and Nash-Williams (see Nash-Williams 1971a, p. 
159), where the degree condition of Dirac’s Theorem (2.1) is relaxed considerably. 
This result is a useful adjunct to the Hopping Lemma (6.7), a powerful proof 
technique developed by Woodall (1973), and will be employed in section 6. 


Theorem 2.16. Let G be a simple nonseparable graph of minimum degree 6 on n 
vertices, where 5 > (n+2)/3 and n > 3, and let C be a longest circuit of G. Then 
no two vertices of G\ C are adjacent. 


Theorems 2.15 and 2.16 can both be derived by the longest-circuit technique, 
the strategy being to examine the structure of the components of G\C, where 
C is a longest circuit. A proof of Theorem 2.15 will be given in section 4 (see 
Theorem 4.11). 

A Hamilton circuit can also be viewed as a spanning walk that is as short as 
possible, passing through each vertex just once. Jackson and Wormald (1990a) 
and Pruesse (1990) generalized Dirac’s Theorem (2.1) to spanning walks that pass 
through each vertex no more than k times; such walks are called k-walks. 


Theorem 2.17. Let k be a positive integer and G a simple connected graph of 
minimum degree & on n vertices, where 5 > n/(k +1) and n 23. Then G has a 
k-walk. 


A closely related concept is that of a k-tree, a spanning tree in which each vertex 
is of degree at most k; 2-trees are thus Hamilton paths. It is easy to see that a 
k-tree admits a k-walk in which each edge of the tree is traversed twice. Jackson 
and Wormald (1990a) showed, conversely, that a graph which admits a k-walk 
necessarily contains a (k+1)-tree. Theorem 2.17 thus implies that, for k 22, a 
simple connected graph of minimum degree 5 on n vertices, where 5 > n/k and 


: 2 3, has a k-tree. Win (1975) obtained a slightly sharper bound on the minimum 
egree. 


Theorem 2.18. Let k be an integer, k 22, and G a simple connected graph of 
minimum degree 5 on n vertices, where 6 > (n —1)/k. Then G contains a k-tree. 


Finally, we turn to digraphs and another theorem of Nash-Williams (1969). 
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Theorem 2,19. Let D be a strict digraph on n vertices with minimum indegree 5~ 
and minimum outdegree &*, where min{8~ ,5*} > n/2. Then D contains a directed 
Hamilton circuit. 


This theorem, too, implies Dirac’s Theorem (2.1), since if a graph G satisfies 
the hypothesis of Dirac’s Theorem (2.1), its associated digraph D(G) satisfies the 
hypothesis of Theorem 2.19, We defer a proof to section 3 (see Meyniel’s Theorem, 
3.11), observing only that the proof techniques we have described thus far fail for 
digraphs. 

The above examples serve to illustrate not only the central role of Dirac’s 
Theorem (2.1) in the study of circuits in graphs but also, more importantly, the 
process of generalization in mathematics. They can be thought of as basic ele- 
ments in an ever-growing structure of generalizations of Dirac’s Theorem (2.1). 
And, just as they generalize Dirac’s Theorem (2.1), so they too are susceptible to 
generalization; indeed, Corollary 2.3 (same proof), Corollary 2.10 (similar proof), 
Theorem 2.12 (same proof), Corollary 2.13 (see Ore 1963), Theorem 2.14 (see 
Egawa and Miyamoto 1989, or Bollobdés and Brightwell, 1993), Theorem 2.15 
(Theorem 4.11), Theorem 2.16 (Theorem 4.23, case k = 2), Theorem 2.17 (Theo- 
rem 4.26), Theorem 2.18 (Theorem 4.27) and Theorem 2.19 (Meyniel’s Theorem, 
3.11) all admit generalizations of Ore type. 

Many other generalizations of Dirac’s Theorem (2.1) can be found in the litera- 
ture; we shall encounter some in later sections. Surveys on hamiltonian graphs ap- 
pear periodically. Those by Nash- Williams (1975), Lesniak-Foster (1977), Bermond 
(1978b), Chvatal (1985) and Gould (1991) represent a variety of viewpoints on the 
topic. 


3. Hamilton paths and circuits in digraphs 


3.1. Rédei's theorem 


In this section, we consider Hamilton paths and circuits in digraphs. The central 
result here, playing the analogous role to Dirac’s Theorem (2.1), is a theorem on 
tournaments due to Rédei (1934). 


Theorem 3.1 (Rédei’s Theorem). Every tournament contains a directed Hamilton 
path, 


This theorem is an immediate consequence of the following lemma, the basic 
idea of which is due to D. Kénig and P. Veress (see Rédei 1934). 


Lemma 3.2. Let D be a strict digraph, P an (x,y)-path in D of length €, and 
v€ V(D)\ V(P). Then 
(i) dp(v) < £ +2 if there is no (x,y)-path P’ in D such that V(P') = V(P)U {v}; 
(ii) dp(v) < 2 if P is a longest directed path in D. 
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Proof. (i) Let P := (vp,v1,...,U¢), Where up := x and v; := y, and set 
X:={v, EV(P)\vo: (v,u;) € E}, 
Y :={v; E V(P)\vo: (v;-1,v) € E}. 

Then 
XUY C {vj,02,...,ue} and XNY =9, 


for if vu, EC X NY, the (x, y)-path P’ := (vo, U1,..-,Uj~1,U,U;,--.,0¢) Contradicts the 
hypothesis on P. Thus 


dp(v) <|X|+|Y|+2=|XUY|+|XNY|4+2< 242, 


(ii) If P is a longest directed path in D, neither (v, vp) nor (uz,v) can be an edge 
of D, and the above argument yields 


dp(v) <|X|+|Y/<4 0 


Proof of Theorem 3.1. Let T be a tournament and P a longest directed path in 
T, of length é. If P is not a Hamilton path of 7, let v € V(T)\ V(P). Since T 
is a tournament, dp(v) = +1. On the other hand, by Lemma 3.2, dp(v) < é, a 
contradiction. Thus P is, indeed, a Hamilton path of T. oO 


3.2. Generalizations of Rédei’s theorem 


Rédei’s Theorem (3.1), like Dirac’s Theorem (2.1), admits a number of interesting 
generalizations. Thomassen (1982), for instance, refined it by imposing a rather 
natural degree condition on the ends of the Hamilton path. 


Theorem 3.3. Every tournament contains a directed Hamilton path whose tail is a 
vertex of maximum outdegree and whose head is a vertex of maximum indegree. 


Another generalization is a theorem of Rédei (1934) on the parity of the number 
of directed Hamilton paths; it was from this result that Rédei’s Theorem (3.1) was 
originally deduced. 


Theorem 3.4. Every tournament contains an odd number of directed Hamilton 
paths. 


Theorem 3.4 is established by means of a proof technique known as inclusion—- 
exclusion, an inversion formula with applications throughout mathematics. The 
following version of the formula suffices for our purpose; more general versions 
ate described in chapter 21 by Gessel and Stanley. We omit the proof. 


Formula 3.5 (Inclusion-Exclusion). Let Z be a finite set and f:22 +R a real- 
valued function defined on the subsets of Z. Define the function g:2 — R by 


g(X)= D> FY). 


XCYCZ 
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Then 
f(xy= YD (Haley). 
XCYCZ 


If D is a strict digraph with vertex set V := {1,2,...,n} and edge set E, and if 
ar is a permutation of V, the subset E, of E defined by 


E,:=En{(a(i), 7(i+1)): 1=1,2,...,n-1} 
induces a digraph each of whose components is a directed path. Such a digraph 
we call a directed-path system. 


Proposition 3.6. Let D be a strict digraph with vertex set V := {1,2,...,n} and 
edge set E. Denote by II the set of permutations of V and by # the set of directed 
Hamilton paths of D. For each subset X of E, define 


f(X):=|{we Tl: X=E,}| and h(X):=|{He #: XC E(H)}|. 
Then 
f(X) = A(X) (mod 2). 
Proof. For each subset X of E, define g(X) :=|{7¢ 7: X C E,}|. Then 
g(X)= >> Ff), 
XCYCE 
and so, by the Inclusion—Exclusion Formula (3.5), 
FX) = SD C1 gcy), 
XCYCE 


Observe that g(Y) =r! if and only if the spanning subdigraph of D with edge set 
Y is a directed-path system with r components. Thus g(Y) is odd if and only if Y 
induces a directed Hamilton path H of D, and 


f(X)= SS (y= a(x) (mod 2). 


{Hex: XCE(H)} 


Proof of Theorem 3.4. The theorem is valid for transitive tournaments, each of 
which has precisely one directed Hamilton path. Since any tournament T on n 
vertices can be derived from the transitive tournament 77, by reorienting appro- 
priate edges, it suffices to prove that the parity of A(T), the number of directed 
Hamilton paths in T, is unaltered by the reorientation of any one edge e. Taking 
X := {e} in Proposition 3.6, we have 


f(e) = h(e) (mod 2). 
Thus, if T’ denotes the tournament derived from T by reorienting e, 
A(T’) = h(T) + fe) —h(e) = A(T) (mod 2). so 


WA eee 
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A third generalization of Rédei’s Theorem (3.1) by Thomason (1986), asserting 
the existence in all sufficiently large tournaments of oriented Hamilton paths of 
every type (not just directed Hamilton paths). The following definition makes this 
notion precise. 

‘Let D be a digraph, and let P := uge,v, ---e,v, be a (not necessarily directed) 
path in D. The type of P is the sequence (€1,0,.--,&), where 


Jc if e; has head u;_,, 
fi) 4 if e; has head y%. 


The type of a circuit vie) ---vgéev; is defined similarly. A directed path or circuit 
is thus one of type (+,+, ...,+). 


Theorem 3.7. Let T be a tournament on n vertices, where n > 2'°8. Then T con- 
tains a Hamilton circuit of each type except, possibly, the directed Hamilton circuit. 
In particular, T contains a Hamilton path of each type. 


An analogous generalization of Theorem 3.4 was obtained by Forcade (1973), 
who discovered the remarkable fact that the parity of the number of Hamilton 
paths of any type in a tournament is independent of the tournament, depending 
only on the path type. Perhaps more surprising still is the following beautiful 
corollary of Forcade’s theorem. 


Theorem 3.8. Let T be a tournament of order 2”. Then T contains an odd number 
of Hamilton paths of each type. 


Excluding the directed Hamilton circuit in Theorem 3.7 is essential; the tran- 
sitive tournament, for instance, contains no directed circuits whatsoever. Indeed, 
an obvious necessary condition for the existence of a directed Hamilton circuit in 
a digraph is that it be strong. Camion (1959) showed that for tournaments this 
condition is also sufficient. 


Theorem 3.9 (Camion’s Theorem). Every strong tournament contains a directed 
Hamilton circuit. 


Camion’s Theorem (3.9) is yet another generalization of Rédei’s Theorem (3.1). 
To see this, one can use a construction similar to that employed to deduce Corol- 
lary 2.3 from Dirac’s Theorem (2.1). Alternatively, one can apply Camion’s Theo- 
rem (3.9) to the strong components of the tournament, thereby obtaining a directed 
Hamilton path in each, and then piece these paths together to form a directed 
Hamilton path in the entire tournament. 

Camion’s Theorem (3.9) is proved by means of a longest-circuit technique. The 
following simple lemma is the basis of this and other theorems on directed circuits 
in strong digraphs. Let D be a digraph, C a subgraph of D, and X and Y sets of 
vertices of D. A C-bypass is a directed path of length at least two whose head and 
tail lie in C and whose internal vertices lie in D \ C. An (X, Y)-path is a directed 
path whose tail lies in X and whose head lies in Y. 
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Lemma 3.10 (The Bypass Lemma). Let D be a strong nonseparabie digraph, and 
let C be a nontrivial proper subgraph of D. Then D contains a C-bypass. 


Proof. Since D is nonseparable, there is a path in D of length at least two whose 
tail x and head y belong to C and whose internal vertices belong to D \ C. Among 
all such paths, we choose one in which the number of edges directed the wrong 
way (that is, towards x) is as small as possible. We shall show that this path P is a 
C-bypass. Assume the contrary, and let (u,v) be an edge of P directed towards x. 
Since D is strong, there exist in D a (C,u)-path P,, and a (v,C)-path P,. It cannot 
be the case that the tail of P, is y and the head of P, is x, since then the (y, x)-walk 
P,,(u,v)P, would contain a C-bypass, contradicting the choice of P. Without loss 
of generality, therefore, we may assume that the tail of P, is z, where z # y. But 
now the zy-walk P,,P[u,y] contains a zy-path that contradicts the choice of P. 
Thus P is, indeed, a C-bypass. O 


Proof of Theorem 3.9. Let T be a strong tournament and C a longest directed 
circuit in T. If C is not a Hamilton circuit, T contains a C-bypass by the Bypass 
Lemma (3.10). We choose such a path P that the distance along C from its tail x 
to its head y is as small as possible. By the choice of C, this distance, the length of 
C[x, y], is no less than the length of P. Let u and v be internal vertices of C[x, y] 
and P, respectively. Since T is a tournament, one of these vertices dominates the 
other. If u dominates uv, the (u,y)-path (u,v)P[v,y] contradicts the choice of P. 
If, on the other hand, v dominates u, the (x,u)-path P[x,v](v, u) contradicts the 
choice of P. We conclude that C is, indeed, a directed Hamilton circuit of T. O 


Moon (1968, p. 6) strengthened the conclusion of Camion’s Theorem (3.9) by 
showing that every strong tournament is vertex pancyclic: each vertex lies in a 
directed circuit of every length. This can be proved easily using the Bypass Lemma 
(3.10) (see Bondy 1977). The Bypass Lemma (3.10) is also effective in establishing 
a far-reaching simultaneous generalization of Ore’s Theorem (2.4), Theorem 2.19 
and Camion’s Theorem (3.9), due to Meyniel (1973). The proof we present is by 
Bondy and Thomassen (1977). : 


Theorem 3.11 (Meyniel’s Theorem). Let D be a strict strong digraph on n vertices, 
where n > 2, in which the degree sum of any two nonadjacent vertices is at least 
2n —1. Then D contains a directed Hamilton circuit. 


Proof. The degree condition implies that D is nonseparable. Let C := (v1,v2,..., 
U¢,U1) be a longest directed circuit in D. If C is not a Hamilton circuit, there is a 
C-bypass in D by the Bypass Lemma (3.10). As in the proof of Camion’s Theorem 
(3.9), we choose such a path P that the distance along C from its tail x to its head 
y is as small as possible. Let 


Vi:=V(P)\ {x,y}, Ve-=V(CG,y)), Vs:-= V(Cly, x), (3.1) 
V4:= V(D) \ (Vi UV2U V3). (3.2) 
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By the definition of a C-bypass, V; ¥ 0, and by the choice of C, V, # @. Letue V; 
and v € V2, and denote by d,(u) and d;(v) the numbers of neighbours of u and vu, 
respectively, in V;, i= 1,2,3,4. Then 


i d,(u) < 2|V,| — 2. 
By the choice of P, u and v are nonadjacent; moreover, 
d,(v)=0 and d,(u) =0. 


Also by the choice of P, there is no directed path (u, w,v) or (v,w,u) with w € V4, 
and so 


dg(u) + dav) < 2|Va|. 

Finally, by Lemma 3.2 and the choice of C, 
d3(u) < |V3| +1. 

Denote by P’ a longest (y,x)-path in D such that V3 C V(P’) C V2. UV3. Let 
Vj :=V,\V(P’) and V;:=V(P’), 


and denote by d;(u) and d;(v) the numbers of neighbours of u and uv, respectively, 
in V/, i= 2,3. By the choice of C, V; #9. For ve V5, 


d(v) <2|V3|-2 and d3(v) <|V3|/+1, 


the latter inequality following from Lemma 3.2 and the choice of P’. Summing the 
above inequalities yields 


d(u) + d(v) = d;(u) + d;(v) + d2(u) + d3(u) + d,(v) + dy(v) + dg(u) + da(v) 
< 2|V,| — 2 +|V3| + 2|V5] +|V3] + 2] Vol 
< 2|Vi| + 2|V2! +2|V3| + 2|[V,| —2=2n-2. 


This contradicts the hypothesis and establishes the theorem. 


4. Fundamental parameters 


In this section, we introduce three important measures of the complexity of a graph: 
the connectivity, the stability number, and the chromatic number. These parameters 
are discussed in detail in chapters 2 and 4. Here we highlight their relationship to 
paths and circuits. We also introduce the edge analogues of the above concepts: 
the edge connectivity, the matching number, and the edge chromatic number, as 
well as one parameter of particular relevance to paths and circuits, the toughness. 
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4.1. Connectivity and edge connectivity 


A connected graph is, by definition, one in which any two vertices are linked by 
a path. Alternatively, a connected graph can be regarded as one whose vertex set 
admits no partition into two proper subsets, where vertices in different subsets are 
nonadjacent. These two viewpoints lead to two different ways of measuring the 
degree of connectedness of a graph. 

We denote a graph G with two specified vertices x and y by G(x, y). The mem- 
bers of a collection of xy-paths in a graph G(x, y) are edge-disjoint if no two have 
an edge in common, and internally-disjoint if no two have an internal vertex in 
common. A graph G on at least two vertices is k-edge-connected if any two ver- 
tices are connected by at least A edge-disjoint paths, and k-connected if any two 
vertices are connected by at least & internally-disjoint paths; a graph on one vertex 
is defined to be both k-edge-connected and k-connected for k = 0,1, but not for 
k >2. Thus every graph is 0-connected, a graph is 1-connected if and only if it 
is connected, and a loopless graph with at least two edges is 2-connected if and 
only if it is nonseparable. Also, since internally-disjoint paths are edge-disjoint, 
k-connected graphs are k-edge-connected. 

Analogous definitions apply to digraphs. A digraph D is strongly k-edge- 
connected if any two vertices u and v are connected by at least k edge-disjoint 
(u,v)-paths, and strongly k-connected or k-strong if any two vertices u and v are 
connected by at least & internally-disjoint (u,v)-paths. 

The edge connectivity of a graph G is the greatest integer k for which G is k- 
edge-connected, the connectivity the greatest integer k for which G is k-connected. 
Likewise, in a digraph D, the strong connectivity is the greatest integer k for 
which D is k-strong, and the strong edge connectivity is the greatest integer k for 
which D is strongly k-edge-connected. The edge connectivity of G is denoted by 
x’(G), the connectivity of G by x(G), the strong edge connectivity of D by «'(D) 
and the strong connectivity of D by «(D). These parameters can be computed in 
polynomial time (see chapter 2). 

If X and Y are sets of vertices in a graph G, the set of edges of G with one 
end in X and the other end in Y is denoted by [X, Y]. When X UY is a partitio 
of V(G), the set [X, Y] is called an edge cut of G; if X,Y # 0, the edge cut [X,Y 
is nontrivial; if x € X and y € Y, it separates x and y. The edge cut [X, Y] is also 
called the coboundary of X (or of Y). Every nonempty edge cut is a disjoint union 
of minimal nonempty edge cuts, or bonds. In a connected graph G, a set S of edges 
is a bond if and only if G \ S has exactly two components. The edge cuts of a graph 
G form a vector space %(G) over GF(2) called the bond space of G. The bond 
space and cycle space are orthogonal complements in the edge space. Because 
of this relationship, edge cuts are sometimes called cocycles and bonds cocircuits. 
Edge cuts are also referred to as cuts or cutsets. 

Analogously, if X US UY is a partition of V, where X,Y #60 and no edge of 
G has one end in X and the other end in Y, the set S is called a vertex cut of 
G; thus a set S of vertices in a graph G is a vertex cut if and only if G\S§ is 
disconnected. If x € X and y € Y, we say that the vertex cut S separates x and y; 
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note that only nonadjacent vertices are separated by vertex cuts. Vertex cuts are 
also called articulation sets. 

An important and extremely useful theorem due to Menger (1927) links these 
concepts. 


Theorem 4.1 (Menger’s Theorem). 

— Edge version: Let G(x, y) be a graph. Then the maximum number of edge-disjoint 
xy-paths in G is equal to the minimum number of edges in an edge cut of G sepa- 
rating x and y. 

— Vertex version: Let G(x,y) be a graph, where x and y are nonadjacent vertices 
of G. Then the maximum number of internally-disjoint xy-paths in G is equal to 
the minimum number of vertices in a vertex cut of G separating x and y. 


There is a corresponding version of Menger’s Theorem (4.1) for directed paths in 
digraphs. Proofs of both versions are given in chapter 2: Connectivity and Network 
Flows. Menger’s Theorem (4.1) is the standard tool for dealing with questions 
involving connectivity. It is frequently used in the following form. 


Lemma 4.2 (The Fan Lemma). Let G be a k-connected graph, where k > 1, let x 
be a vertex of G, and let Y be a set of at least k vertices of G, where x ¢ Y. Then there 
exist distinct vertices y1,yo,...,¥x in Y and internally-disjoint paths P,,P2,...,P,, 
such that 

(i) P; is an xy;,-path, 1 <i <k, and 

(ii) VIP) OY = {yi}, 1 <i<k 


Remark 4.3. Menger’s Theorem (4.1) likewise guarantees that, given any two 
k-sets of vertices X := {x1,x2,...,x,} and Y = {y\,yo,...,¥;,} in a k-connected 
graph G, there are k disjoint paths P,, P2,..., P, connecting X and Y in G. It 
does not, however, guarantee that path P; can be so chosen as to connect vertices x; 
and y;, 1 <i < k. Such a system is called a k-linkage, and a graph is k-linked if it has 
a k-linkage for arbitrary k-sets X and Y. The 2-linked graphs were characterized 
by Seymour (1980) and Thomassen (1980c). The corresponding concept for edge- 
disjoint paths is called a weak k-linkage. Thomassen (1980c) conjectured that a 
k-edge-connected graph is weakly k-linked if k is odd. This would imply that a 
k-edge-connected graph is weakly (k — 1)-linked if k is even, a result established 
by Huck (1991). Further information on linkages can be found in chapter 2. 


The Fan Lemma (4.2) is a very versatile tool. We give here one illustration of 
Its use, by Dirac (1960). 


Theorem 4.4. Let G be a k-connected graph, where k > 2, and let S be a set of k 
vertices in G. Then G contains a circuit C that includes every vertex of S. 


Proof. The theorem is true for k =2, since the union of two internally-disjoint 
Xy-paths is a circuit that includes both x and y. We proceed by induction on k. 
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Suppose that the theorem holds for k <n, where n > 3. Let G be an n-connected 
graph, let S be a set of n vertices of G, and let x € S. Put S’ := S\ x. Then S$’ is 
a set of m — 1 vertices in the n-connected graph G. By the induction hypothesis, 
there is a circuit C’ in G that includes every vertex of S’. If x € V(C’), we may take 
C :=C’, and there is nothing more to prove. Suppose, then, that x ¢ V(C’). Note 
that the vertices of S’ partition C’ into n — 1 segments. There are two cases. If C’ 
is of length at least n, we apply the Fan Lemma (4.2) with k :=n and Y := V(C’‘). 
By the pigeonhole principle, two of the paths whose existence is guaranteed by 
the lemma, say P; and P;, must terminate at vertices y; and y; that lie in the same 
segment of C’. The desired circuit C can now be obtained from C’ by replacing 
the y;y;-segment of C’ by the y,y,-path P; P ;. If C’ is of length n — 1, we apply the 
Fan Lemma (4.2) with k:=n-—1 and Y := V(C’), and use the above argument. 

Q 


Remark 4.5. Watkins and Mesner (1967) characterized the extremal graphs for 
Theorem 4.4, that is, those k-connected graphs in which some set of k + 1 vertices 
is included in no circuit. For k > 3, they are the k-connected graphs having a set 
of k vertices whose deletion leaves a graph with more than k components; there 
are two additional families of extremal graphs when k = 2. 


Bermond and Lovasz (1975) conjectured a digraph analogue of Theorem 4.4 for 
the case k = 2, namely that in a digraph of sufficiently high strong connectivity, any 
two vertices lie on a common directed circuit. This was disproved by Thomassen 
(1991). Bondy and Lovdsz (1981) generalized Theorem 4.4 by showing that, if 
k 22, the circuits through a set S of k vertices in a k-connected graph G generate 
the cycle space of G (except in certain specified cases); any circuit of G can, 
moreover, be expressed as the sum of an odd number of circuits through S. This 
implies that there is an even circuit through § and, if G is not bipartite, also an 
odd circuit through S. 

Egawa et al. (1991) obtained a strong common generalization of Theorem 4.4 
and Theorem 2.15. 

J 
Theorem 4.6. Let G be a simple k-connected graph of minimum degree 5, where 
k >2, and let S be a set of k vertices of G. Then G contains either a circuit of length 
at least 26 that includes every vertex of S or a Hamilton circuit. 


Establishing the existence of circuits through specified edges is far more chal- 
lenging. Let e; := x,y; and e := x2y2 be any two edges in a 2-connected graph G. 
Form a new graph G’ by subdividing e; by v;, i =1,2. Then G’ is 2-connected 
and, by Theorem 4.4, there is a circuit in G’ that includes both v, and v2. Thus, in 
G, there is a circuit that includes both e, and e. This observation cannot be ex- 
tended to 3-connected graphs; three edges that are incident with the same vertex, 
or that form an edge cut, do not lie on a common circuit. However, these are the 
only exceptions, as Ad4m (1963) proved. Lovdsz (1974) was led to formulate the 
following general conjecture. 
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MESES INO ANAND 


Figure 6. A vine on a path. 


Conjecture 4.7. Let G be a k-connected graph, where k > 2, and let S be a set of 
k edges in G that induce a path system. Then there is a circuit of G that includes 
every edge of S, unless k is odd and S is an edge cut of G. 


A slightly weaker version of this conjecture, due to Woodall (1977), was verified 
by Haggkvist and Thomassen (1982). Their proof uses a variant of the Hopping 
Lemma (6.7), to be discussed in section 6. 


Theorem 4.8. Let G be a k-connected graph, where k > 2, and let S be a set of 
k — 1 edges in G that induce a path system. Then there is a circuit of G that includes 
every edge of S. 


A related conjecture, for 3-regular graphs, was formulated by Holton and 
Thomassen (1986). The connectivity hypothesis is replaced here by the less strin- 
gent one of cyclic edge connectivity. A connected graph G that contains two disjoint 
circuits is cyclically k-edge-connected if it has no bond S of fewer than k edges 
such that both components of G\ S contain circuits. 


Conjecture 4.9. Let G be a 3-regular cyclically k-edge-connected graph and let 
S be a set of k — 1 edges in G that induce a path system. Then there is a circuit 
of G that includes every edge of S. 


In Theorem 4.4, the Fan Lemma (4.2) was used to establish the existence of 
certain circuits in k-connected graphs. Another very useful tool in this context, 
developed by Dirac (1952), is that of a vine (see fig. 6). A vine on a path P := P[x, y] 
is a set P := {P,[x;,y;]: 1 <i < m} of internally-disjoint paths such that 

(a) P;NP = {xi, yi}, 1<icm, 

(b) x =x) < x2 < yy <3 ~ Yo <X4< ++) < Xm <Vm-1 < Ym =y on P. 

Dirac (1952) showed that there is a vine on any path in a 2-connected graph. A 
generalization of this result was given by Bondy and Locke (1981). Vines Y and 
2 on a path P(x, y] are disjoint if 

(a) the paths of Y, and Y, are internally-disjoint; 

(b) the only vertex that is the tail of a path of Y, and the tail of a path of P 
is x; 

(c) the only vertex that is the head of a path of Y, and the head of a path of 
P» is y. 

For sets of vertices X and Y ina graph, a path whose tail lies in X and whose 
head lies in Y is called an [X, Y]-path. 


Lemma 4.10 (The Vine Lemma). Let P be a path in a k-connected graph G. Then 
there are k —1 pairwise-disjoint vines P,, P2,..., Py_, on P. 
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Proof. Let P := uot; ---v,. Since G is k-connected, there are k internally-disjoint 
Upvs-paths Q;, Q2,.--, Q, in G. From each path Q,, we construct a family 2; of 
paths as follows. If V(P) M V(Q;) := {wi, W2,...,Wm}, Where w; < wj41 on Q;, 1 < 
j<m-1, we set 


9; = {Q;[w;, Wye]: wy < Wj on PH. 
We now define 
2 = UK, m. 
It follows from these definitions that if 
Xj = {v0,01,--.,0j-1} and Y;:= {Vj41,Vj42,---,U¢e}, where0O<j<é, 


then 2; contains an [X;, Y;]-path avoiding v,, for any j such that uv; ¢ V(Q;). Be- 
cause the paths Q; are internally-disjoint, at most one of them includes any par- 
ticular v;, and so 2 contains at least k —1 [X;, Y;]-paths avoiding v,, for each 
fa<j<d. 

We now construct the desired vines inductively. Initially, we select from 2 k —1 
[X,, Yi]-paths P,, P2,..., P,_, avoiding v; and set P;:— {P;}, 1<igk—1. At 
each subsequent stage, we modify the families ?; as follows. Let p; denote the 
vertex of Y, that is furthest from vp on P. If p; = pz = --- = px_| = ve, the families 
PY, are vines on P satisfying the conditions of the lemma. Otherwise, we define 
U; = p), where we assume (relabellingif necessary) that p, < p2 <--- < py_. Each 
P;, 2<i <k — 1, contains at most one [X;, Y;]-path avoiding v;. Therefore, there is 
at least one more such path Q in 2 \ US P;. Let u; be the tail of Q. We modify the 
family ?, by adding Q and deleting each path whose head lies in P(u;,v;] except 
the one whose tail is closest to ug; the families A, 2 <i < k — 1, are not modified. 
This procedure terminates with & — 1 pairwise-disjoint vines P,, P),..., P,_; on 
P. Do 


Dirac (1952) used the Vine Lemma (4.10) to prove that, if a 2-connected graph 
has a path of length @, it necessarily has a circuit of length at least 22/7. (The grapfh 
of fig. 6 is an extremal graph for this theorem, with @ = 25.) The lemma was also 
applied by Bondy and Locke (1981) and Locke (1982) to establish analogous results 
for graphs of higher connectivity; for example, if a 3-connected graph has a path of 
length @, it necessarily has a circuit of length at least 2¢/5. Of particular note here is 
the marked difference in the behaviour of 2-connected and 3-connected graphs; we 
shall see further instances of this phenomenon shortly. A similar change may well 
occur in the passage from 3-connected to 4-connected graphs; several intriguing 
conjectures support this view, although there is little hard evidence. 

Dirac (1952) also used the Vine Lemma (4.10) to prove Theorem 2.15, a gener- 
alization of his Theorem 2.1. The same proof technique (see Pésa 1963, proof of 
Theorem 3) yields the corresponding generalization of Ore’s Theorem (2.4) first 
stated explicitly by Bermond (1976) and Linial (1976). 
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Theorem 4.11. Let G be a simple 2-connected graph in which the degree sum of 
any two nonadjacent vertices is at least d. Then G contains either a circuit of length 
at least d or a Hamilton circuit. 


Proof. Let P(x, y] be a longest path in G and P := {P;{x,,y]: 1 <i < m} a vine 
on P; note that P, and P,, are of length one, by the choice of P. ‘Conadee the 
circuit 


where C; := = P{xi, yil P i, 1 <i <m, and assume that is chosen so that 

(i) |P| =m is as on as possible; 

(ii) subject to (i), |V(C) NV(P)| is as large as possible. Set P := uguiv2-++u,2, 
where Uo := x, 0; := y, and v; := y, and define 


X := {u;: xvi © E} and Y := {u;: uy € E}. 


If X NY #9, then (as in Proof 2 of Dirac’s Theorem, 2.1) C is a Hamilton circuit of 
G, since P is a longest path. If ¥ 7 Y = 0, then x and y are nonadjacent. Moreover, 
by the choice of ¥, C contains every vertex of (X UY U {y}) \ {v,_1} and thus is 
of length at least d(x) + d(y). By hypothesis, this is at leastd. O 


One consequence of Theorem 4.11 is a frequently used theorem of Erdés and 
Gallai (1959). 


Corollary 4.12, Let G(x, y) be a simple 2-connected graph such that d(v) > d, for 
all v £.x,y. Then there is an xy-path of length at least d in G. 


Proof. Denote by H the graph obtained from G by adding a set S of d—1 
vertices and joining them to one another and to both x and y. Then H is a simple 
nonseparable graph on n vertices, where n > 2d > 4, in which the degree sum of 
any two nonadjacent vertices is at least 2d. By Theorem 4.11, H contains a circuit 
C of length at least 2d. If C includes vertices of 5, C \ S is an xy-path in G of length 
at least d. Otherwise C C G and, because G is 2-connected, there are disjoint paths 
P, and P, in G connecting {x,y} and C. In this case, the subgraph C U P; UP; of 
G contains two xy-paths, at least one of which is of length at least d. O 


Theorem 4.11 is an Ore-type generalization of Theorem 2.15. Voss and Zuluaga 
(1977) generalized the latter theorem in another direction. 


Theorem 4.13. Let G be a simple nonseparable nonbipartite graph of minimum 
degree & on n vertices, where 8 <n/2. Then G contains both an odd circuit of length 
at least 26 — 1 and an even circuit of length at least 25. 


This clearly implies Theorem 2.15 in the case of nonbipartite graphs. For bipar- 
tite graphs, it suffices to add an edge between two vertices in the same part of 
the bipartition and then apply Theorem 4.13. We propose the following conjec- 
ture, a strengthening of one mentioned in Locke (1985). If true, it would imply 
Theorem 4.13 in the case of 3-connected graphs. 


40 J.A. Bondy 


Conjecture 4.14. Let G be a simple 3-connected graph of minimum degree 6 on 
n vertices, where 5 < n/2. Then the circuits of length at least 26 — 1 generate the 
cycle space of G; moreover, every circuit of G can be expressed as the sum of 3 an 
odd number of circuits of length at least 26 — 1. 


We close this discussion of connectivity with an attractive conjecture of S. Smith 
(see Grétschel 1984). 


Conjecture 4.15. In a k-connected graph, where k > 2, any two longest circuits 
have at least & vertices in common. 


4.2. Stability number 


A set S of vertices of a graph G is stable if no two elements of S are adjacent in 
G. The maximum cardinality of a stable set of G is called the stability number of 
G and denoted by a(G). Stable sets are also called independent sets and the stabil- 
ity number the independence number. Determining whether a graph has stability 
number at most k is an “Y-complete problem. 

A simple and surprising relationship linking the stability number, the connectiv- 
ity, and the existence of Hamilton circuits was discovered by Chvatal and Erdés 
(1972). 


Theorem 4.16 (The Chvdtal-Erdés Theorem). Let G be a graph on at least three 
vertices with stability number a and connectivity x, where a < x. Then G is hamil- 
tonian. 


The proof of the Chvatal-Erdés Theorem (4.16) is based on the fundamental 
concept of a bridge, illustrated in fig. 7. Let G be a graph and H a subgraph of 
G. Consider the equivalence relation ~ on E(G)\ E(H) defined by e ~ f if and 
only if there is a walk internally-disjoint from H whose first edge is e and whose 
last edge is f. The subgraphs of G induced by the equivalence classes under this 
relation are the bridges of H in G; isolated vertices of G in G \ H are also regarded ' 
as bridges of H. A bridge having at most one edge is degenerate; all other bridges 
are proper. A chord of H is a degenerate bridge linking two distinct vertices of H. 
If B is a bridge of H in G, the elements of V(B) \ V(H) are the internal vertices 
of B and the elements of V(B)  V(H) the vertices of attachment of B to H. The 
edges of B incident to its vertices of attachment are its edges of attachment to H. 

For the proof of the Chvatal-Erdés Theorem (4.16), we shall also need the 
undirected version of the concept of a bypass, defined for digraphs in section 3. 
Let G be a graph, and let C be a subgraph of G. A C-bypass in G is a path of 
length at least two whose ends lie in C and whose internal vertices lie in G \ C. 


Proof of Theorem 4.16. Let C be a longest circuit in G. Suppose that C is not a 
Hamilton circuit. Let B be a proper bridge of C in G and denote by S its set of 
vertices of attachment to C. Any two vertices of S are connected by a C-bypass 
contained in B. Because C is a longest circuit, we deduce from the Lollipop Lemma 
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Figure 7. A bridge of a circuit. 


(2.2) that S* is a stable set of G and that no internal vertex x of B is adjacent to 
any vertex of S*. Thus S* U {x} is a stable set and 


|S*| <a. 
Also, S is a vertex cut of G (B \ S being one component of G \ S$) and so 
|S] > x. 
But now 
kK <|S|=|S*| <a, 
contradicting the hypothesis. Therefore, C is indeed a Hamilton circuit of G. O 


For future reference, we note the following consequence of the Chvadtal-Erdés 
Theorem (4.16), derived by the same technique as was used to establish Corol- 
lary 2.3 from Dirac’s Theorem (2.1). 


Corollary 4.17. Let G be a graph with stability number a and connectivity x, where 
a<k +1. Then G has a Hamilton path. 


Bondy (1978a) observed that the Chvatal-Erdés Theorem (4.16) implies Ore’s 
Theorem (2.4) and also the following result of Nash-Williams (1971b). 


Corollary 4.18. Let G be a simple d-regular graph on 2d + 1 vertices, where d > 2. 
Then G is hamiltonian. 


The Chvatal-Erdés Theorem (4.16) applies only to graphs G which satisfy a < x. 
Kouider (1994) extended it to all 2-connected graphs. 


Theorem 4.19. Let G be a graph with connectivity x, where x >2, and let H 
be a subgraph of G. Then G has a circuit C such that either V(H) C V(C) or 
a(H\ C) < a(H) — «(G). 
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A conjecture of Amar, Fournier, Germa, Haggkvist and Thomassen (see 
Fournier 1982) on the number of circuits needed to cover the vertex set of a 
2-connected graph follows easily from the above theorem. It, too, is an extension. 
of the Chvdtal-Erdés Theorem (4.16). A vertex covering of a graph or digraph is 
a family of subgraphs whose union spans the graph or digraph. 


Corollary 4.20. Let G be a graph with stability number a and connectivity x, where 
k > 2. Then G admits a vertex covering by at most [a/x] circuits. 


The 5-circuit is one example of a graph which satisfies the inequality a < « of 
the Chvdtal-Erdés Theorem (4.16) but is not pancyclic. There are many others; 
indeed, if G has this property, so does the composition G[K,]. Jackson and Ordaz 
(1990) conjecture, however, that in all such examples a = x. 


Conjecture 4.21. Let G be a graph with stability number a and connectivity x, 
where a < « — 1. Then G is pancyclic. 


The idea of considering the minimum degree sum of two nonadjacent vertices 
was introduced by Ore (1960). More generally, the minimum degree sum of a 
stable set of k vertices, denoted o;, turns out to be a parameter of relevance to 
circuits. Bondy (1981) proposed a common generalization of Ore’s Theorem (2.4) 
and the Chvatal-Erdés Theorem (4.16) involving this parameter. 


Conjecture 4.22. Let G be a simple k-connected graph on a vertices such that 
Ox,1 2n+k(k —1), and let C be a longest circuit in G. Then G\ C contains no 
path of length k —1. 


Conjecture 4.22 has been verified for kK = 1 (Ore’s Theorem 2.4) and k =2 (a 
generalization of Theorem 2.16); these are special cases of Theorem 4.23 below, 
which establishes a weaker form of the conjecture. The case k =3 of Conjec- 
ture 4.22 is pertinent to Conjecture 5.20 and can be proved by slightly modifying 
the proof of Theorem 4.23 (Bondy 1981). 


Theorem 4.23. Let G be a simple k-connected graph on n vertices such that o;,, > 
n+k(k -1), and let C be a longest circuit in G. Then G\C contains no complete 
subgraph of order k. 


Bauer et al. (1989) applied the case k = 2 of Theorem 4.23 to derive a sufficient 
condition for a graph to be hamiltonian. This is yet another generalization (of a 
generalization) of Dirac’s Theorem (2.1). 


Theorem 4.24. Let G be a simple graph of connectivity x on n vertices such that 
Oo; >n+«, where x > 2. Then G contains a Hamilton circuit. 


Another variant of Conjecture 4.22 was established by Fraisse (1986). It concerns 
k-dominating circuits rather than longest circuits, and also generalizes Ore’s Theo- 
rem (2.4). A dominating circuit of a graph G is a circuit C such that V(G) \ V(C) 
is a Stable set of G. More generally, if k is a positive integer, a k-dominating circuit 
of a graph G is a circuit C such that each component of G\ C has fewer than & 
vertices. 
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Theorem 4.25. Let G be a simple k-connected graph on n vertices, where n > 3, 
such that 0,4; 2n+k(k —1). Then G has a k-dominating circuit. 


_The concepts of k-dominating circuit and k-walk are intimately related via a 
graph product called composition. The composition (or lexicographic product) of 
simple graphs G and H is the graph G[H] whose vertex set is V(G) x V(H), 
vertices (i;,0;) and (uz, v2) being adjacent if and only if either u,u, € E(G) or both 
uy = uy and v2 € E(H). Jackson and Wormald (1990a) observed that a graph G 
has a k-walk if and only if the composition G[K;,] has a k-dominating circuit. 
Applying Theorem 4.25, they deduced the following common generalization of 
Theorem 2.17 and Theorem 2.4. 


Corollary 4.26. Let G be a simple connected graph on n vertices such that o,,, > Nn. 
Then G has a k-walk. 


Closely related is a common generalization of Theorem 2.18 and the Ore-type 
analogue of Corollary 2.3, due to Win (1975). 


Theorem 4.27. Let G be a simple connected graph on n vertices such that o, > 
n—1, where k >2. Then G contains a k-tree. 


Fournier and Fraisse (1985) proved a common generalization of Theorem 4.11 
and the Chvatal-Erdés Theorem (4.16) involving degree sums of stable sets of 
vertices. 


Theorem 4.28. Let G be a simple k-connected graph, where k > 2, such that ox, 2 
d. Then G contains either a circuit of length at least 2d/(k +1) or a Hamilton circuit. 


The stability number also plays an important role in digraphs. We define a stable 
set of a digraph to be a stable set of its underlying graph. Gallai and Milgram (1960) 
found a beautiful link between the stability number of a digraph and partitions of 
its vertex set into directed paths. A partition of a graph G is a family of disjoint 
subgraphs whose vertices partition V(G). A path partition is a partition into paths, 
and the minimum number of paths in a path partition of a graph G is denoted by 
7(G). Analogous definitions apply to digraphs, a path partition of a digraph being 
a partition into directed paths. 

For a path partition 2 in a digraph D, we denote by a(2) the cardinality of 
a maximum stable set whose elements belong to different paths of 2, by h(2) 
the set of heads of the paths in 2 and by ¢(2) the set of tails of the paths in 2. 
The following lemma (Bondy 1995) is a slight refinement of one due to Gallai and 
Milgram (1960). 


Lemma 4.29. Let D be a digraph and 2 a path partition of D such that |2| > a(2). 
Then there is a path partition P of D such that |P| =|2\|—1, h(P) C h(Q) and 
UP) c t(D). 
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Proof (by induction on v(D), the case v(D) =1 holding vacuously). Let 2 be a 
path partition of D such that |2| > a(2). Then A(2) is not a stable set, so there 
exist y,z © A(2) such that (y,z) € E. If (z) is a path of 2, we define # to be the 
path partition of D obtained from 2 by deleting (z) and extending the path of 2 
with head y by the edge (y,z). Thus we may assume that (z) is not a path of 2. 
Let x be the predecessor of z on the path of 2 with head z, set D’:= D \ z, and 
let 2’ be the restriction of 2 to D’. Note that 


\2'| =|2| > a(2) 2 a(2'), 
A(Q') = (h(Q)\ {z})U {x} and #(2’) = #(Q). 


By the induction hypothesis, there is a path partition ¥’ of D’ such that |#’| = 
|2"|— 1, A(P’) C A(Q’) and t(F') c t(Q’). If x € h(P"’), we define F to be the path 
partition of D obtained from #’ by extending the path of 9’ with head x by the 
edge (x,z). If x ¢ h(P’), then y € h(P’) and we define # to be the path partition 
of D obtained from #’ by extending the path of #’ with head y by the edge (y, z). 
In both cases, |P| = |2|—1, A(P) C A(Q) and t(P) ce(2). O 


An immediate consequence of this lemma is the theorem of Gallai and Milgram 
(1960) alluded to above. 


Theorem 4,30 (The Gallai-Milgram Theorem). Let D be a digraph with stability 
number a. Then D admits a partition into at most a directed paths. That is, 


Waa. 


Remark 4.31. The Gallai-Milgram Theorem (4.30) can be viewed as a formula 
for the stability number of a graph in terms of the path partition numbers of its 
orientations: 


a(G) = max {n(G): G an orientation of G}. 


To see this, observe that a(G) > a(G) for every orientation G of G, by the 
Gallai-Milgram Theorem (4.30), and that the reverse inequality holds when G is 
an onentation of G in which every vertex of some maximum stable set is a source. 


The Gallai-Milgram Theorem (4.30) includes a well-known theorem of Dilworth 
(1950) as a special case. 


Theorem 4,32 (Dilworth’s Theorem). Let P := (X,~<) be a partially ordered set. 
Then the minimum number of chains of P into which X can be partitioned is equal 
to the maximum number of elements in an antichain of P. 


Proof. Denote by D(P) the digraph whose vertex set is X and whose edges 
are the ordered pairs (u,v) such that u <v in P. Chains and antichains in P then 
correspond to directed paths and stable sets, respectively, in D(P), and the Gallai- 
Milgram Theorem (4.30) implies the existence of a chain partition of cardinality 
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no greater than the maximum number of elements in an antichain. The reverse 
inequality follows from the observation that no two elements of an antichain can 
belong to a common chain. O 


_An analogue of the Gallai-Milgram Theorem (4.30) for strong digraphs was 
proposed by Galiai (1964). 


Conjecture 4.33. Let D be a strong digraph with stability number a. Then D 
admits a vertex covering by at most a directed circuits. 


Two special cases of Conjecture 4.33 are easily verified. When D is the associated 
digraph of a 2-edge-connected graph G, this follows from a result of Pésa (1963) 
which asserts that any graph admits a partition into at most a@ circuits, edges 
and vertices; alternatively, one may apply Corollary 4.20 to an endblock of G 
and use induction. And when a = 1, it is just Camion’s Theorem (3.9). Chen and 
Manalastas (1983) (see also Bondy 1995) verified the case a = 2. 


Theorem 4.34. Let D be a strong digraph with stability number two. Then D 
admits a vertex covering by two directed circuits {C,,C2}, where C, OC, is either 
empty or a directed path of D. 


One interesting consequence of Theorem 4.34 is the following. 


Corollary 4.35. Let D be a strong digraph with stability number two. Then D 
contains a directed Hamilton path. 


This corollary leads to two conjectures. One, a simultaneous generalization 
of Corollary 4.35 and the Gallai-Milgram Theorem (4.30), was proposed (in a 
stronger form) by M. Las Vergnas (see Berge 1983). 


Conjecture 4.36. Let D be a strong digraph with stability number a, where a > 2. 
Then D admits a partition into at most a — 1 directed paths. 


The second conjecture, put forward by Jackson and Ordaz (1990), involves a 
less stringent definition of stability in digraphs. A stable set in a digraph D may 
equally well be defined as a set S of vertices such that D[S] contains no directed 
circuit, or such that D[S] contains no directed 2-circuit. The corresponding stability 
numbers are called the acyclic stability number and the oriented stability number, 
respectively, and are denoted by a, and a). Clearly, a < a, < a. The following 
conjecture would thus generalize Corollary 4.17. 


Conjecture 4.37, Let D be a digraph with oriented stability number a, and strong 
connectivity x, where a2 < «+1. Then D contains a directed Hamilton path. 


Remark 4.38, It is not true in general that a k-strong digraph with stability num- 
ber k +1 contains a directed Hamilton path; while this is so for k =0, by Rédei’s 
Theorem (3.1), and also for k = 1, by Corollary 4.35, it is false for k = 2, a coun- 
terexample being the digraph derived from the composition of K,3 with K2 by 


orienting each copy of Ky» that corresponds to an edge of K,3 as a directed 
4-circuit. 
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Jackson (1987) also made use of the concept of oriented stability number to 
establish a directed analogue of the Chvdtal-Erdés Theorem (4.16). 


Theorem 4.39. Let D be a digraph with strong connectivity x, where k > 2(a2 + 
2)! Then D contains a directed Hamilton circuit. 


Heydemann (1985) likewise proved the directed analogue of Conjecture 4.33 for 
the oriented stability number. 


4.3. Matching number 


In this section, we consider the edge analogue of a stable set. A set M of edges of a 
graph G is a matching if no two elements of M are adjacent in G. The matching M 
is perfect if each vertex of G is incident to an edge of M. The maximum cardinality 
of a matching of G is called the matching number of G and denoted by v(G). 
The Ore-type generalization of Corollary 2.3 referred to at the end of section 
2 asserts that a simple graph on 7 vertices in which the degree sum of any two 
nonadjacent vertices is at least n — 1 has a Hamilton path. Haggkvist (1979) showed 
that when 7 is even, the very same hypothesis implies something much stronger. 


Theorem 4.40. Let G be a simple graph on 2m vertices in which the degree sum 
of any two nonadjacent vertices is at least 2m — 1. Then each perfect matching of G 
is contained in a Hamilton path of G. 


Proof. Let M := {x1y1,x2y2,---,XmYm} be a perfect matching of G, where the 
sets X := {x1,%2,...,Xm} and Y := {y1,y2,...,¥m} are chosen so as to maximize 
the number of edges in the bipartite subgraph of G \ M induced by the bipartition 
(X,Y). Denote this subgraph by B. By the choice of X and Y, 


dp (xi) + da(yi) > 3 (dam (xi) + dau (i) = 3(de (xi) + dg(vi)) - 1, 
1l<igm. 


Let D be the digraph obtained from B by first orienting each edge from X to 
Y and then identifying x; and y;, denoting the resulting vertex by v;, 1 <i <m. 
For any two nonadjacent vertices v; and vu; of D, 


dp(v;) + dp(v;) = dg(x;) + dg(y;) + da (x;) + da(y;) 
2 $(de(x:) + dg(y;)) + 4(de(x;) + dg(yi)) — 2 2 2m - 3. 


It now follows from Meyniel’s Theorem (3.11), by applying the digraph analogue 
of the trick used to establish Corollary 2.3 from Dirac’s Theorem, 2.1, that D has a 
directed Hamilton path. Without loss of generality, let this path be (v1, v2,...,Um). 
Then x, ¥1X2¥2-+:XmYm is a Hamilton path of G through each edge of M. 


An analogous theorem on Hamilton circuits containing perfect matchings also 
due to Haggkvist (1979), was extended by Berman (1983) as follows. 
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Theorem 4.41. Let G be a simple graph on n vertices, in which the degree sum of 
any two nonadjacent vertices is at least n+1, where n > 3. Then each matching of 
G is contained in a circuit of G. 


Theorem 4.41 implies Theorem 4.40. Let G satisfy the hypotheses of Theo- 
rem 4.40. Form G’ from G by adding two new vertices, u and v, and joining each 
to every vertex of G and also to one another. Define M' := M U {e}, where e := wu. 
Then M’ is a perfect matching of G’. Also, the degree sum of any two nonadjacent 
vertices of G' is at least 2+ 3 = (2m +2) +1. By Theorem 4.41, G’ has a Hamilton 
circuit through each edge of M’, so G has a Hamilton path through each edge of 
M. 

Jackson and Wormald (1990b) characterized the extremal graphs for Theo- 
rem 4.41 and derived a strengthening of Dirac’s Theorem (2.1) for graphs whose 
number of vertices is divisible by four. 


Theorem 4.42, Let G be a simple graph of minimum degree 5 on n vertices, where 
6 >n/2 and n=0 (mod 4). Then each perfect matching of G is contained in a 
Hamiiton circuit of G. 


Matchings enjoy a rich and extensive theory. This is discussed in detail in chapter 
3 and in the treatise by Lovadsz and Plummer (1986). The algorithmic aspects of 
matchings are also well developed. In particular, a maximum-weight matching in 
an edge-weighted graph can be found in polynomial time. This leads to an efficient 
method for finding a shortest path of specified parity between two vertices of a 
graph, if there is one. The idea, due to J. Edmonds (see Grétschel and Pulleyblank 
1981), is most easily explained with the aid of the following definition. The cartesian 
product of simple graphs G and H is the graph GOH whose vertex set is V(G) x 
V(A), vertices (u;,v,) and (u2,U2) being adjacent if and only if either wu, = u, and 
U1v2 € E(HM) or vy = v2 and uyu2 € E(G). The cartesian product GOK; is referred 
to as the prism over G; it can be viewed as consisting of two disjoint copies of G, 
G, and G), joined by a perfect matching M := {v,v.|v € V(G)}, where u; is the 
vertex of copy G; corresponding to vertex v of G, i = 1,2. 

There is a natural bijection between the odd xy-paths of G and the perfect 
matchings of the graph H derived from the prism over G by deleting the vertices 
X2 and y2 of G2 that correspond to x and y. Moreover, an odd xy-path of length 2 
in G corresponds to a perfect matching of H that includes exactly n — 1 — £ edges 
of M. Therefore, if each edge of G; in H, i =1,2, is assigned weight n — 1, and 
each edge of M in H is assigned weight n, where n := u(G), a matching of weight 
n(n — 1) — @ in H is necessarily a perfect matching of H and corresponds to an odd 
xy-path of length 2 in G. Thus a maximum-weight matching in H corresponds to 
a shortest odd xy-path in G. A shortest even xy-path in G can be found by simply 
adding a vertex y’ and edge yy’ to G and finding a shortest odd xy’-path in the 
resulting graph G’. 
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4.4. Chromatic number and edge chromatic number 


A k-colouring of a graph G is a function c: V(G) — S, where S is a k-set; com- 
monly, S is the set {1,2,...,k}. The element c(v) is called the colour of vertex vu, 
and the sets C; := {v: c(v) =i}, i € S, the colour classes of G with respect to c. 
A k-colouring of a graph is thus a partition of its vertex set into colour classes. A 
colouring c is proper if each colour class of G with respect to c is a stable set of G; 
only loopless graphs admit proper colourings, and we implicitly assume here that 
all graphs are loopless. A graph is k-colourable if it admits a proper k-colouring. 
A 2-colourable graph is one which is bipartite, and thus can be recognized as such 
in polynomial time (see section 1.6, Search Trees). On the other hand determining 
whether a graph is k-colourable when k > 3 is an NP-complete problem. 

The chromatic number of a graph G is the minimum value of k for which G 
is k-colourable, and is denoted by y(G). A graph G is k-chromatic if y(G) = k, 
and y-critical if x(G \ v) < y(G) for all vertices v of G. Two basic properties of 
x-critical k-chromatic graphs are easily established: they are nonseparable and 
have minimum degree at least k — 1. 

A graph with chromatic number at least three is nonbipartite and so contains an 
odd circuit. Erdés and Hajnal (1966) extended this observation by showing that a 
k-chromatic graph, where k > 3, contains an odd circuit of length at least K —1. A 
word of caution is in order here. Despite its appearance, the theorem of Erdés and 
Hajnal is linked only superficially to the chromatic number; indeed, the theorem 
is an immediate consequence of Theorem 4.13: a k-chromatic graph contains a 
X-critical k-chromatic subgraph and thus has minimum degree at least k — 1. 

A more essential use of the chromatic number was made by Gallai (1968a) and 
Roy (1967), who discovered a simple relationship between the chromatic number 
of a digraph and the length of a longest directed path in the digraph, where the 
chromatic number x(D) of a digraph D is defined to be the chromatic number of 
its underlying graph G(D). We denote the number of vertices in a longest directed 
path of a digraph D by A(D). 


Theorem 4.43 (The Gallai-Roy Theorem). Let D be a digraph with chromatic 
number y. Then D contains a directed path of length at least y — 1. That is, 


AzYX. 


Proof. Let D’ be a maximal acyclic subdigraph of D, A’ the number of vertices 
in a longest directed path of D’, and A‘(v) the number of vertices in a longest 
directed v-path of D’. Noting that V(D’) = V(D), define a A’-colouring c of D by 
setting c(v) := A’(v), v € V(D’), We claim that c is a proper colouring of D. 

Observe, first, that if there is an (x, y)-path P in D’, then c(x) > c(y). For if Q 
is a y-path in D’, PQ is an x-path in D’, because D’ is acyclic. In particular, if 
(x,y) € E(D’), c(x) # c(y). On the other hand, if (x,y) € E(D) \ E(D’), then there 
is an (y, x)-path in D’, by the choice of D’, and again c(x) £c(y). O 


Remark 4.44. The Gallai-Roy Theorem (4.43) can be regarded as a “dual” of 
the Gallai-Milgram Theorem (4.30), the roles of directed paths and stable sets 
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being interchanged; this duality is well understood in the case of acyclic digraphs 
(Frank 1980; see also chapter 2), but how far it extends to all digraphs is not clear. 
The Gallai-Roy Theorem (4.43) can also be viewed as a formula for the chromatic 
number of a graph in terms of the lengths of longest directed paths in orientations 
of the graph: 


x(G) = min {A(G): G an orientation of G}. 


To see this, observe that y(G) < A(G) for every orientation G of G, by the 
Gallai-Roy Theorem (4.43), and that the reverse inequality holds when G is the 
orientation of G in which u dominates v whenever c(u) < c(v), where c:V > 
{1,2,...,x} is any y-colouring of G (cf. Remark 4.31). 


Laborde et al. (1983) proposed the following rather strong refinement of the 
Gallai-Roy Theorem (4.43). 


Conjecture 4.45. Let D be a nontrivial digraph and A, + A2 a partition of A(D) 
into two positive integers. Then there is a partition (V;,V2) of V(D) such that 
A(D(Vi)) ie Qi, i= 1,2. 


The case Az = 1 of Conjecture 4.45 asserts that in any digraph there is a stable 
set meeting every longest directed path; applied recursively, this would yield the 
Gallai-Roy Theorem (4.43). Interestingly, the dual statement is conjectured by 
Hahn and Jackson (1990) to be false. 


Conjecture 4.46. For any positive integer k, there exists a digraph D with stability 
number k such that a(D \ U*_'P;) =a(D) for any family {P;,P2,...,Px-1} of 


k —1 disjoint directed paths of D. 


A generalization of the Gallai-Roy Theorem (4.43) to strong digraphs was ob- 
tained, with the aid of the Bypass Lemma (3.10), by Bondy (1976). 


Theorem 4.47. Let D be a strong digraph with chromatic number x. Then D 
contains a directed circuit of length at least x. 


Linial (1981) proposed a common generalization of the Gallai-Roy Theorem 
(4.43) and the Gallai-Milgram Theorem (4.30). In order to state it, we need two 


definitions. Let D be a digraph and k a positive integer. A path partition ? of D 
that minimizes the function 


SS min{v(P), 


Pe? 


is called k-optimal, and the minimum value of this function is denoted by 7,(D). 
In particular, 7, = 7, the minimum number of directed paths in a path partition 
of D, and m =v(D) if k 2 A(D). A partial k-colouring of D is a family € of 
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k pairwise-disjoint stable sets of D. A partial k-colouring € that maximizes the 
function 


lUcee C| 


is called optimal, and the maximum value of this function is denoted by a;,(D). 
In particular, a; = a, the stability number of D, and a, =v(D) if and only if 
k > x(P). 


Conjecture 4.48. For any digraph D and any positive integer k, 
ak 2 Tx. 


Conjecture 4.48 was first posed, as a question, by Linial (1981), and was proved 
by him for acyclic digraphs; it achieved the status of a conjecture with Berge 
(1982). When k = 1, it is the Gallai-Milgram Theorem (4.30); when k > A(D), it 
is the Gallai-Roy Theorem (4.43). It is unproved for all other values of k. A still 
stronger conjecture was proposed by Berge (1982). A path partition 9 and a partial 
k-colouring @ are orthogonal if every directed path P € meets min{u(P), k} 
different colour classes of €. 


Conjecture 4.49. Let D be a digraph, k a positive integer and a k-optimal path 
partition of D. Then there is a partial k-colouring of D orthogonal to . 


Like its weaker variant, this conjecture is valid for k =1 (Linial 1978) and for 
k > A(D) (Berge 1982). It has also been verified for acyclic digraphs by several 
authors, some using network flow techniques (see, for example, Aharoni et al. 
1985). 

The duality between directed paths and stable sets referred to in Remark 4.44 
suggests potential duals of Conjectures 4.48 and 4.49. The stronger of these dual 
conjectures turns out to be false (Aharoni et al. 1985), but the dual of Conjec- 
ture 4.48, posed as a question by Linial (1981), is still open. Let D be a digraph 
and k a positive integer. A partial k-path partition of D is a family ¥ of k pairwise- 
disjoint directed paths of D. A partial k-path partition P that maximizes the func- 
tion 


|Upeg V(P)| 


is called optimal, and the maximum value of this function is denoted by A,(D). In 
particular, A, = A, the number of vertices in a longest directed path of D, and A, = 
v(D) if and only if k > w(D). A proper vertex colouring @ of D that minimizes 
the function 


S> min{|C|, k} 
CE 


is called k-optimal, and the minimum value of this function is denoted by y,(D). 
In particular, x; = x, the chromatic number of D, and y, = v(D) if k > a(D). 
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Figure 8. The Petersen graph: three drawings. 


Conjecture 4.50. For any digraph D and any positive integer k, 
Ay 2 Xk 


The restriction of Conjecture 4.50 to acyclic digraphs was verified by Hoffman 
(1983). When k = 1, the conjecture is the Gallai-Roy Theorem (4.43); when k > 
a(D), it is the Gallai-Milgram Theorem (4.30). The conjecture is unproved for all 
other values of k. 

We conclude by defining the edge analogue of vertex colouring. A k-edge colour- 
ing of a graph G is a function c: E(G) — S, where S is a k-set. An edge colouring 
c is proper if each colour class of G with respect to c is a matching of G; only 
loopless graphs admit proper edge colourings. A graph is k-edge-colourable if it 
admits a proper k-edge colouring. The edge chromatic number of a graph G is 
the minimum value of & for which G is k-edge-colourable, and is denoted by 
x (G). The edge chromatic number of a bipartite graph is equal to its maximum 
degree, and thus can be determined in polynomial time. For graphs in general, 
however, it is an NP-complete problem to determine whether a graph is k-edge- 
colourable when k 2 3. The topic of circuits in edge-coloured graphs is touched 
upon in Remark 8.39. Edge and vertex colourings are discussed in detail in chapter 
4: Colouring, Stable Sets and Perfect Graphs. 


4.5. Toughness 


We have concentrated up to now on sufficient conditions for the existence of 
Hamilton circuits. Necessary conditions are unfortunately much harder to come 
by. A simple one can be derived as follows. Let G be a hamiltonian graph, 1 
a Hamilton circuit in G and S a vertex cut of G. Clearly H \S has at most || 
components. Because H\S is a spanning subgraph of G\S, the same is true 
of G\S. Thus c(G\S) < |S| for every vertex cut S of G. A graph G with this 
property is called tough. Hamiltonian graphs are tough, as we have just observed, 
but not every tough graph is hamiltonian, the Petersen graph of fig. 8 being a 
counterexample. 

The Petersen graph was, in fact, first constructed by Kempe (1886); it was redis- 
covered twelve years later by Petersen (1898). It is a graph with many remarkable 
properties. It is not 3-edge-colourable, and is the smallest simple 3-regular graph 
with this property. This implies that it is nonhamiltonian, since the edges of a 
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Figure 9. The Coxeter graph. 


Hamilton circuit could be properly 2-coloured and the remaining edges assigned 
a third colour, yielding a proper 3-edge colouring. It is, moreover, maximally non- 
hamiltonian: the addition of any one edge yields a hamiltonian graph. Another 
very interesting 3-regular graph is the Coxeter graph, discovered by H.S.M. Cox- 
eter (see Tutte 1960) and shown in fig. 9; the beautiful drawing is due to M. Randic 
(personal communication 1981). This graph, like the Petersen graph, is not only 
nonhamiltonian (Tutte 1960) but also maximally nonhamiltonian (Clark and En- 
tringer 1983). In addition, it is tough. 

Faced with such examples, Chvatal (1973) was prompted to consider a refine- 
ment of the concept of toughness. Let ¢ be a positive real number. A graph G is 
t-tough if 

(G\s) <4 

o 
for every vertex cut S of G. In particular, a 1-tough graph is one that is tough. The 
toughness of a graph G is the largest value of t for which G is t-tough. Chvatal 
(1973) conjectured the existence of a constant ¢ such that every t-tough graph is 
hamiltonian. Examples constructed by Enomoto et al. (1985) show that, if such 
a constant ¢ does indeed exist, ¢ > 2. A natura] conjecture, then, is the following 
one. 


Conjecture 4.51. Every 2-tough graph is hamiltonian. 


This conjecture, in a slightly weaker form (restricted to locally-connected 
graphs), was formulated by Chvatal (1973), who noted that it would imply a the- 
orem of Fleischner (1974) on squares of graphs. 

Let G be a simple graph and m a positive integer. The graph with vertex set 
V(G) in which two vertices are adjacent if and only if their distance in G is at 
most m is called the mth power of G and denoted by G”. The square of G is the 
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graph G?, the cube of G the graph G®. A fairly easy induction argument shows that 
the cube of a connected graph on at least three vertices is hamiltonian (Sekanina 
1960). Fleischner proved something much stronger, namely that the square of a 2- 
connected graph is hamiltonian. Since squares of k-connected graphs are k-tough, 
Fleischner’s theorem is implied by Conjecture 4.51. A simpler proof of Fleischner’s 
theorem was found by Riha (1991). Underground (1978) noted that the problem 
of determining whether the square of a graph is hamiltonian is W P-complete. 
Win (1989) established a link between toughness and k-trees. 


Theorem 4.52. Let k be a positive integer. Then every (1/k)-tough graph has a 
(k + 2)-tree. 


A refinement, in terms of k-walks, was conjectured by Jackson and Wormald 
(1990a). 


Conjecture 4.53. Let & be a positive integer. Then every (1/k)-tough graph has 
a (k + 1)-walk. 


Attractive though Conjectures 4.51 and 4.53 are, the insight they would provide 
is questionable in light of the fact proved by Bauer et al. (1990) that, for any 
positive constant r, it is YW P-hard to determine whether a graph is f-tough; each of 
these conjectures thus relates two properties whose verification is WP-hard. What 
can be hoped for, nevertheless, is a polynomial-time algorithm that either finds a 
Hamilton circuit (respectively, a (k + 1)-walk) or else exhibits a set S of vertices 
showing that the input graph is not 2-tough (respectively, not (1/k)-tough). 

Many sufficient conditions for hamiltonicity can be weakened marginally by 
imposing a toughness condition (see, for instance, the survey by Bauer et al. 1991). 
Improvement is possible, of course, only if the extremal graphs for the condition 
in question are themselves not tough; imposing toughness then serves to exclude 
them. Hagegkvist (1992) achieved more remarkable results by imposing a stronger 


necessary condition, which one might call path-toughness. A graph G is path-tough 
if 


m(G\S) <|S| 


for every nonempty proper subset S of V(G), or equivalently, if each vertex-deleted 
subgraph is traceable. Hamiltonian graphs are clearly path-tough. Haggkvist’s in- 
sight was that Dirac’s minimum-degree condition can be relaxed substantially for 
path-tough graphs. His lower bound of 8(n — 1)/17 on the minimum degree was 
further reduced to (2n — 3)/5 by Dankelmann et al. (preprint). Schiermeyer (1992) 
made the following conjecture as to the best possible bound. 


Conjecture 4.54, Let G be a simple path-tough graph of minimum degree 5 on n 
vertices, where 6 > (n — 2)/3 and n > 3. Then G is either hamiltonian or isomor- 
Phic to the Petersen graph. 
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Relatively little is known about nonhamiltonian path-tough graphs. Two special 
classes have been studied, however. A graph G is hypohamiltonian if every vertex- 
deleted subgraph G \v is hamiltonian but G is not, and hypotraceable if every 
vertex-deleted subgraph G \ v is traceable but G is not. R. Sousselier (see Berge 
1963) observed that the Petersen graph (fig. 8) is hypohamiltonian; Bondy (1972) 
showed likewise that the Coxeter graph (fig. 9) has this property. But there are 
many other examples. The definitive sources on this topic are Thomassen (1978, 
1981a) where a variety of constructions are described, including one for planar hy- 
pohamiltonian graphs and another for surprisingly dense hypohamiltonian graphs. 
Still, a number of basic questions remain. For instance, while it is easy to see that 
the minimum degree in a hypohamiltonian or hypotraceable graph must be at least 
three, no example is known in which each vertex is of degree at least four. 


Problem 4.55. Is there a hypohamiltonian graph of minimum degree at least four? 


This problem was posed by Thomassen (1978), who proved, with the aid of the 
Bridge Lemma (5.12), that no such graph can be planar. Also worth noting is a 
question of Grétschel (personal communication 1978) on hypotraceable graphs. 


Problem 4.56. Is there a bipartite hypotraceable graph? 


The Petersen graph (fig. 8) has girth five, and the Coxeter graph (fig. 9) girth 
seven. Hordk and Sirdfi (1986) showed how infinite families of maximally non- 
hamiltonian graphs of girths five and seven can be formed from these graphs, and 
asked the following question. 


Problem 4.57. Is there a maximally nonhamiltonian graph of girth g for all g > 3? 


5. Fundamental classes of graphs and digraphs 


We now discuss several classes of graphs and digraphs of particular interest: bi- 
partite graphs, planar graphs, regular graphs, vertex-transitive graphs, line graphs, 
claw-free graphs, oriented graphs and tournaments. As one would expect, stronger 
results than are valid for graphs in general often hold for members of these classes. 
In some instances, the improvements are significant. 


5.1. Bipartite graphs 


By Theorem 1.8, a graph is bipartite if and only if it contains no odd circuits. It 
is not surprising, therefore, that in problems relating the existence of circuits to 
vertex degrees, a good rule of thumb is that the conditions can be weakened by 
half when the graph in question is bipartite. We illustrate this general principle 
with two typical examples. The first is a bipartite analogue of Theorem 2.15 and a 
special case of a result of Jackson (1985). 
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Theorem 5.1. Let G be a simple nonseparable balanced bipartite graph of min- 
imum degree 8. Then G contains either a circuit of length at least 45-2 or a 
Hamilton circuit. 


‘Recall that the closure of a graph G on n vertices is the graph derived from 
G by recursively joining pairs of nonadjacent vertices having degree sum at least 
n. There is an analogous concept for bipartite graphs. The bipartite closure of a 
balanced bipartite graph G(X, Y) on 2 vertices is the graph derived from G by 
recursively joining pairs of vertices in different parts whose degree sum is at least 
n+1. In section 2, it was proved that a simple graph on at least three vertices 
is hamiltonian if and only if its closure is hamiltonian (Closure Lemma 2.6). The 
corresponding result for bipartite graphs can be established by a similar argument 
(Bondy and Chvdtal 1976). 


Theorem 5.2. A simple balanced bipartite graph G is hamiltonian if and only if 
its bipartite closure is hamiltonian. 


5.2. Planar graphs 


A planar embedding of a graph G is a function ¢ that assigns to each vertex a 
point in the plane and to each edge a simple curve in the plane, in such a way 
that the curves ¢(e,) and ¢(e,) meet at a point p if and only if p = ¢(v) for some 
vertex v which is incident in G to the edges e, and e2. A graph is planar if it admits 
a planar embedding; the Platonic graphs of fig. 1 are the prototypical examples. 
Note that a graph is planar if and only if it has an embedding in the sphere. A 
plane graph is a pair (G, @) where G is planar and ¢ is a planar embedding of 
G. The (topological) closures of the connected components of the complement of 
$(G) are the faces of G. An edge and face are incident if the edge is contained in 
the face. The edges incident to a face f form a closed walk in which cut edges are 
each traversed twice. The length of this walk is the degree d(f) of f. If the walk is 
a circuit, it is called a facial circuit. Two faces are adjacent if they are incident with 
a common edge; a face which is incident to a cut edge is adjacent to itself. The 
boundary of a face consists of those edges incident with the face and with some 
other face. The boundary of a face is always an even subgraph. 

If Gis a plane graph, the planar dual of G is the planar graph G* whose vertices 
are the faces of G, two vertices being joined by an edge e* if and only if the 
corresponding faces of G are incident to a common edge e. The bijection between 
E(G) and E(G") which maps e to e* has the striking property that it takes circuits 
to bonds and bonds to circuits. The cycle space of G and the bond space of G* 
are therefore isomorphic vector spaces. 

In a 2-connected plane graph, the boundary of each face is a circuit, and these 
facial circuits generate the cycle space; moreover, omitting any one of them results 
1n a cycle basis in which each edge occurs at most twice (because each edge is 


incident to two faces). MacLane (1937) showed that this property characterizes 
planar graphs. 
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Theorem 5.3. A graph is planar if and only if there is a basis for its cycle space in 
which each edge occurs at most twice. 


Kuratowski (1930) gave a quite different, and more significant, characterization 
of planar graphs. 


Theorem 5.4 (Kuratowski’s Theorem). A graph is planar if and only if it contains 
no subdivision of K33 or Ks. 


Despite these characterizations, and the existence of linear-time algorithms to 
test for planarity (see chapter 5), planar graphs remain something of an enigma, 
on account of the notorious Four-Colour Problem. A k-face colouring of a plane 
graph G with face set F(G) is a function c: F(G) > S, where S is a k-set. The 
face colouring c is proper if adjacent faces receive different colours; thus a graph 
has a proper face colouring if and only if it has no cut edge. A plane graph 
is k-face-colourable if it admits a proper k-face colouring. In 1852, EF Guthrie 
(see Biggs et al. 1976) asked whether every 2-edge-connected plane graph is 4- 
face-colourable. This innocent question was the prime motivation for much of the 
research undertaken in graph theory during the hundred years which followed. 
Known as the Four-Colour Problem, and later as the Four-Colour Conjecture, it 
was given an affirmative answer by Appel and Haken (1976). While the basic ideas 
of their proof are well understood, it is fair to say that the details are still a matter 
of some dispute, on account of their complexity. 


Theorem 5.5 (The Four-Colour Theorem). Every 2-edge-connected plane graph 
is 4-face-colourable. 


By duality, an equivalent statement is that every connected loopless planar graph 
is 4-vertex-colourable. Tait (1880) derived yet another equivalent formulation of 
the Four-Colour Conjecture: every 3-connected 3-regular planar graph is 3-edge- 
colourable. This settled the question, he firmly believed, it being widely supposed 
at the time that every 3-connected 3-regular planar graph was hamiltonian (andy 
hence 3-edge-colourable). However, Tutte (1946) showed the latter assertion to be 
false. Later, Grinberg (1968) gave a simple necessary condition for a plane graph 
to be hamiltonian and used it to produce a variety of nonhamiltonian 3-connected 
3-regular planar graphs, including the cyclically 5-edge-connected one shown in 
fig. 10. Ironically, Grinberg’s approach was familiar already to Kirkman (1881), 
but Kirkman, sharing Tait’s belief that every 3-connected 3-regular planar graph 
had at least one Hamilton circuit, employed it merely as an aid in searching for 
these elusive objects. 


Theorem 5.6 (Grinberg’s Condition). Let G be a plane graph on n vertices with 
a Hamilton circuit C. Denote by fj and f/’ the numbers of faces of G of degree i 
contained in the interior and exterior of C, respectively. Then 


YG -YGH-f') =0. 
i=l 
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Figure 10. The Grinberg graph. 


Proof. Denote by E’ the subset of E(G) \ E(C) contained in the interior of C. 
There are |E’|+1 faces of G in the interior of C, and so 


n 
SoA SIE 41. 


i=l 


Each edge in E’ lies on the boundary of two faces in the interior of C, and each 
edge of C lies on the boundary of one such face. Therefore 


n 


Sift =2|E | +n. 


i=l 


Eliminating |£’|, we have 


yi —2)f,=n-2. 
i=l 


Similarly, 
n 
SoG - Df! =n-2. 
i=] 


These two equations yield the stated identity. O 


To see that the graph of fig. 10 is nonhamiltonian, it suffices to consider the face 
degrees modulo three and observe that Grinberg’s condition cannot hold. It seems 
hard to construct bipartite counterexamples in this way, and Barnette (1969) has 
conjectured that there are none. (That the assumption of planarity is necessary 
here was shown by J.D. Horton; see Bondy and Murty, 1976, p. 240.) 


Conjecture 5.7. Every 3-connected 3-regular bipartite planar graph is hamiltonian. 
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Increasing the connectivity by one, we arrive at a major theorem of Tutte (1956). 
Theorem 5.8 (Tutte’s Theorem). Every 4-connected planar graph is hamiltonian. 


Theorem 5.8 is an easy consequence of a lemma about bridges of circuits in 
planar graphs (the Bridge Lemma, 5.12). Before stating this lemma, we briefly 
review the important role of bridges in planar graphs. (Bridges were introduced 
in section 4, before Theorem 4.16.) 

A characterization of planar graphs in terms of their bridge structure was given 
by Tutte (1959). This characterization is the basis of most planarity-testing algo- 
rithms. Let G be a graph and C a circuit of G. Two bridges B, and B, of C in 
G overlap if C cannot be divided into two segments C; and C, with the property 
that the vertices of attachment of B; to C lie on C;, i=1,2. The overlap graph 
of G with respect to C has as vertices the bridges of C in G, two bridges being 
adjacent if they overlap. 


Theorem 5.9. A graph G is planar if and only if, for every circuit C of G, the 
overlap graph of G with respect to C is bipartite. 


A circuit C in a graph G is nonseparating if it has at most one proper bridge, 
and peripheral if it is nonseparating and has no chords. Tutte (1963) showed that 
in a 3-connected plane graph a circuit is peripheral if and only if it bounds a face. 
It follows that a 3-connected planar graph has a unique plane embedding (in the 
sense that there is a unique planar dual) and that its peripheral circuits generate 
the cycle space. In fact, the latter assertion is true even for nonplanar graphs, as 
Tutte (1963) proved. 


Theorem 5.10. Jn a 3-connected graph the peripheral circuits generate the cycle 
space. 


An insightful exposition of the theory of bridges, leading to a refinement of his 
original proof of Theorem 5.8, is given by Tutte (1977). Bridges clearly play a very 
important role in the study of paths and circuits, and it can be argued that their" 
role is central. This is the point of view adopted by Voss (1991) in his treatise on 
circuits. Of relevance to the study of long circuits is the concept of the span of a 
bridge, introduced by Voss (1991). Let G be a graph, H a subgraph of G and B 
a bridge of H in G. The span of B (Voss uses the term length) is the maximum 
number of edges in a tree of B whose end vertices are the vertices of attachment 
of B to H. Voss (1991) made the following conjecture, and proved it for m < 3. 


Conjecture 5.11. Let G be a 2-connected graph, C a longest circuit of G and B;,, 
1 <# <™m, disjoint bridges of C which induce a tree in the overlap graph of G with 
respect to C. Then 


m 


>> s(Bi) < &(C)/2, 


i=] 


where s(B;) denotes the span of B; and £(C) the length of C. 
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We now return to the statement of the Bridge Lemma. 


Lemma 5.12 (The Bridge Lemma). Let G be a 2-connected plane graph, e an edge 
of G, C’ and C” the facial circuits of G which include e, and e' any edge of C'. 
Then there is a circuit C in G including both e and e! such that 

(i) each bridge of C in G has either two or three vertices of attachment; 

(ii) each bridge of C in G that includes an edge of C' or C" has exactly two 
vertices of attachment. 


Let G be a simple 4-connected plane graph, and define e, C’, C” and e’ as in 
the Bridge Lemma. Then G has a circuit C which includes e and e’ and satisfies 
conditions (i) and (ii). This circuit is necessarily a Hamilton circuit of G. Suppose, 
to the contrary, that C has a proper bridge. By (i), each bridge of C has at most 
three vertices of attachment. Because G is 4-connected, C must be a triangle and 
have exactly one proper bridge B, with three vertices of attachment. Moreover, 
because G is simple, C can have no degenerate bridge. It follows that C = C’ and 
that B includes an edge of C”. But this contradicts (ii), Thus every 4-connected 
planar graph is hamiltonian (Theorem 5.8). Thomassen (1983b) obtained an ana- 
logue (and strengthening) of the Bridge Lemma for bridges of paths in plane 
graphs, and deduced that every 4-connected planar graph is Hamilton-connected. 
A related class of 4-connected graphs are the graphs of simple 4-polytopes. D.W. 
Barnette (see Griinbaum 1970, p. 1145) has conjectured that these too are hamil- 
tonian. 

While they need not be hamiltonian, 3-connected planar graphs always have 
long circuits. Jackson and Wormald (1992a) employed the Bridge Lemma (5.12) 
to prove that every 3-connected planar graph on 7 vertices has a circuit whose 
length is of order at least n°, where c ~ 0.2. Examples due to Moon and Moser 
(1963), constructed from K, by recursively inserting a vertex of degree three in 
each face, show on the other hand that such a graph need not have any circuit 
whose length is of order more than n‘, where c := log 2/log 3 ~ 0.63. 

Another property enjoyed by 3-connected planar graphs was discovered by Bar- 
nette (1966), who proved that every 3-connected planar graph has a 3-tree. A 
stronger conjecture, proposed by Jackson and Wormald (1990a), was confirmed by 
Gao and Richter (1994). 


Theorem 5.13. Every 3-connected planar graph has a 2-walk. 


_ Properties of graphs embeddable in the plane and other surfaces are discussed 
in chapter 5: Embeddings and Minors. 


5.3. Regular graphs and vertex-transitive graphs 


Regular graphs are a particularly fascinating class. The classical examples are the 
graphs of the Platonic solids (fig. 1). Regular graphs of degree three are often 
teferred to as cubic graphs. They appear to capture much of the complexity of 
sraphs in general, and interesting examples abound, the Heawood graph (fig. 4), 
the Petersen graph (fig. 8), and the Coxeter graph (fig. 9) being three notable ones. 
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Nonhamiltonian d-regular d-connected graphs exist for all d > 3. A construction, 
based on the Petersen graph, was first given by Meredith (1973); moreover, an 
infinite family of such graphs in which the longest circuit is of length no more 
than n°, where n is the order and c (< 1) depends only on d, was described by 
Jackson and Parsons (1982). Nevertheless, as we have remarked, the imposition of 
regularity often has surprising consequences. A convincing illustration is provided 
by the following theorem of C.A.B. Smith (see Tutte 1946) on the parity of the 
number of Hamilton circuits in a 3-regular graph. 


Theorem 5.14 (Smith’s Theorem). Let G be a 3-regular graph and e an edge of 
G. Then the number of Hamilton circuits of G containing e is even. 


Smith’s Theorem (5.14) implies that a 3-regular graph with one Hamilton circuit 
has at least three such circuits. But, as we shall see, the proof is not constructive, 
and Chrobak and Poljak (1988) pose the intriguing problem of efficiently finding 
a second Hamilton circuit in such a graph. The proof we give makes use of the 
concept of an even 2-factor. A d-factor of a graph is a d-regular spanning subgraph. 
A 2-factor is even if each of its components is an even circuit. 


Proof. Consider the bipartite graph H(X, Y), where X is the set of even 2-factors 
of G containing e, Y is the set of 3-edge colourings of G, and an even 2-factor F 
containing e is joined to a 3-edge colouring (EF), Ey, E3) if and only if F = E; VE; 
for some {i,j} C {1,2,3}. A vertex F of X with c components has degree 2°~! in 
H, and each vertex of Y has degree two. Thus the vertices of H of odd degree are 
precisely the Hamilton circuits of G containing e. By Corollary 1.2, their number 
iseven. 


The hamiltonian problem remains “¥-complete even when restricted to 3- 
connected 3-regular graphs. Nevertheless, almost all such graphs are indeed hamil- 
tonian, by a theorem of Robinson and Wormald (1992), who also established the 
following more general result (Robinson and Wormald 1991). _ 


Theorem 5.15. For every integer d > 3, almost all d-regular graphs are hamilto- 
nian, 


Surprisingly, planarity changes the picture completely, as was shown by Rich- 
mond et al. (1985). 


Theorem 5.16. Almost all 3-connected 3-regular planar graphs are not hamilto- 
nian. 


Theorems 5,15 and 5.16 are probabilistic statements. For their precise meaning, 
we refer the reader to chapter 6: Random Graphs, or to the book of Bollobds 
(1985). 

Jackson (1980a) obtained an analogue of Dirac’s Theorem (2.1) for regular 
graphs. The effect of regularity here is quite striking. 
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Theorem 5.17 (Jackson’s Theorem). Let G be a 2-connected d-regular graph on 
n vertices, where d > n/3. Then G is hamiltonian. 


The bound on d, already close to best possible, was reduced to (n/3) — 1 for all 
d‘> 3 by Hilbig (1986); there are two exceptions, the Petersen graph (fig. 8) and 
the graph obtained from it by expanding one vertex to a triangle (that is, deleting 
the vertex, adding a triangle, and joining the three former neighbours of the vertex 
to the three vertices of the triangle by a matching). 

Jackson’s Theorem (5.17) indicates that regular graphs can be expected to con- 
tain considerably longer paths and circuits than graphs of the same minimum de- 
gree, and one may ask how long a path or circuit there must be in a 2-connected 
d-regular graph on 7 vertices. The following conjecture, due to Bondy (1978b), 
includes Jackson’s Theorem (5.17) as a special case. 


Conjecture 5.18. Let G be a 2-connected d-regular graph on n vertices, where 
d>n/r,r > 3, and n is sufficiently large. Then G contains a circuit of length at 
least 2n/(r — 1). 


This conjecture is of interest only when r is small compared to n; when r is 
of order n, regularity plays essentially no role and can be replaced by a condi- 
tion on the maximum degree. Voss (1991) showed that a 2-connected graph of 
maximum degree A on n vertices, where 3 < 4 <n —2, has a circuit of length at 
least 4 log, _,m—c, where c is an appropriate constant, while Lang and Walther 
(1968) constructed regular examples containing no circuits of length more than 
4 log, _,+4. For bipartite graphs, we have the following theorem of Jackson and 
Li (1994); Haiggkvist (1978a) conjectures that the bound on d can be reduced to 
n/6. 


Theorem 5.19. Let G be a 2-connected d-regular bipartite graph on n vertices, 
where d > (n/6)+8. Then G is hamiltonian. 


As noted earlier, the passage from 2-connected to 3-connected graphs often has 
a substantial effect on circuit structure. Regular graphs are no exception. Fraisse 
(1986) conjectures that the lower bound on the degree in Jackson’s Theorem (5.17) 
can be reduced significantly for 3-connected regular graphs. The following slightly 


sharper formulation of Fraisse’s conjecture is due to B. Jackson (personal commu- 
nication 1989). 


Conjecture 5.20. Let G be a 3-connected d-regular graph on n vertices, where 
d >n/4. Then G is hamiltonian unless G is either the Petersen graph (fig. 8) or 
the graph obtained from it by expanding one vertex to a triangle. 


Broersma et al. (preprint) showed that, if d > (n+3)/4, such a graph has a 
dominating circuit. Using this fact, they proved that every 3-connected d-regular 
graph on n vertices, where d > (2n/7) + 2, is hamiltonian. Surprisingly, raising the 
connectivity further has little effect. For d= 0 (mod 4), B. Jackson (see Jackson 
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et al. 1991) and H.A. Jung (see Min Aung 1989) independently found an example 
of a nonhamiltonian (d/2)-connected d-regular graph on 4d +1 vertices. 

Of relevance to Conjecture 5.20 is the following theorem of Fan (1985) and Jung 
(1984). 


Theorem 5.21. Let G be a 3-connected d-regular graph. Then G contains either a 
circuit of length at least 3d or a Hamilton circuit. 


When d is small, this theorem is weak. Jackson (1986) proved that a 3-connected 
3-regular graph on n vertices contains a circuit whose length is of order at least n°, 
where c := log,{1 + V5) — 1 ~ 0.69. Examples based on the Petersen graph (fig. 8) 
show, on the other hand, that such a graph need not have any circuit whose length is 
of order more than n‘, where c := log 8/log 9 = 0.96. However, these constructions 
work because the resulting graphs are not cyclically 4-edge-connected; they have 
many 3-edge cuts, which serve to restrict the possible routes taken by a circuit. 
The following conjecture is due to the author (see Jackson 1986), and is implied 
by successively stronger conjectures of Thomassen (1986), Matthews and Sumner 
(1984) and Jackson (1992) (see section 5.4, below). 


Conjecture 5.22. Let G be a 3-regular cyclically 4-edge-connected graph on n 
vertices. Then G contains a circuit of length at least cn, where c is a positive 
absolute constant. 


The question of circuits through specified vertices in regular graphs was con- 
sidered by Holton et al. (1982) and by Kelmans and Lomonosov (1982a). Both 
groups of authors proved the following theorem on 3-regular graphs. 


Theorem 5.23. Let G be a 3-connected 3-regular graph, and let S be a set of nine 
vertices of G. Then there is a circuit of G that includes every vertex of S. 


The Petersen graph (fig. 8) is 3-connected and 3-regular but has no circuit includ- 
ing all ten vertices. Ellingham et al. (1984) showed that any 3-connected 3-regulai’ 
graph having a set S of ten vertices not lying on a common circuit necessarily 
contains a subdivision of the Petersen graph in which S is the set of vertices of de- 
gree three (see, also, Kelmans and Lomonosov 1982b). One may ask the following 
general question. 


Problem 5.24. Does there exist a constant c > 1 such that, for every d-regular 
d-connected graph G and for every set S of |cd| vertices of G, where d > 2, there 
is a circuit of G that includes every vertex of 5? 


Regular digraphs, also, have been studied with regard to circuit structure. For 
instance, Thomassen (1981b) generalized Corollary 4.18 to digraphs. 


Theorem 5.25. Let D be a strict d-diregular digraph on 2d +1 vertices, where d > 
1. Then either D has a directed Hamilton circuit or D is isomorphic to one of two 
exceptional digraphs of orders five and seven. 
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Figure 11. A digraph analogue of the Petersen graph. 


The larger of the two exceptional digraphs in Theorem 5.25 is shown in fig. 11. 
Found by Bondy (1978b), it has several properties reminiscent of the Petersen 
graph (fig. 8), being 3-diregular, maximally nonhamiltonian, and also hypohamil- 
tonian. Moreover, as R. Haggkvist (personal communication 1978b) observed, the 
sets of outneighbours of its seven vertices constitute the edges of the Fano hyper- 
graph (fig. 3). 

In this chapter, we have chosen to concentrate on longest circuits in graphs. 
It is appropriate here, however, to say a word about the girth of regular graphs. 
Regular graphs of diameter k and girth 2k +1 are called Moore graphs, regular 
graphs of given girth with the minimum number of vertices cages. The Petersen 
graph (fig. 8) is a Moore graph and also a cage; the Heawood graph (fig. 4) is a 
cage, too. In general, members of both these classes tend to have much symmetry, 
and are mainly studied by algebraic methods. We refer the reader to chapter 31 
or to the survey by Wong (1982) for further details. 

The directed analogue of the cage appears to have a far simpler structure. For 
d21 and k 22, consider the digraph whose vertices are the integers modulo 
d(k —1)+1, (i,j) being an edge if and only if 1<j-—is<d. This digraph is d- 
diregular and has girth k. Behzad et al. (1970) conjectured that no digraph with 
these properties has fewer vertices. 


Conjecture 5.26. Let D be a strict d-diregular digraph on 7 vertices. Then D has 
girth at most [n/d]. 


We conclude this section by noting that several of the results mentioned here 
do not depend on strict regularity for their conclusions but rather on bounds on 
the minimum and/or maximum degrees. For instance, Jackson (1980a) gave an 
extension of Theorem 5.17 to almost-regular graphs, and Jackson and Wormald 
(1993) generalized the theorem of Jackson (1986) on longest circuits in 3-connected 
3-regular graphs to graphs of arbitrary maximum degree as follows. 


Theorem 5.27. Let G be a 3-connected graph of maximum degree A on n vertices. 
Then G has a circuit of length at least n° /2, where c:= (1 + log, 3 +2 log, A)“. 
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Caccetta and Haggkvist (1978) proposed a generalization of Conjecture 5.26. 


Conjecture 5.28. Let D be a strict digraph of minimum outdegree 5* on n ver- 
tices. Then D has girth at most [2/5*]. : 


This conjecture has been verified for 5+ < 5 (see Hoang and Reed 1987) and 
a near-optimal upper bound, of (#/5*) +2500, was established by Chvatal and 
Szemerédi (1983); the constant has since been reduced, but not eliminated. Sur- 
prisingly, the case when &* 2 n/3 is still unresolved. 

An important subclass of regular graphs, with correspondingly stronger proper- 
ties, are those that are vertex-transitive. An automorphism of a graph is an iso- 
morphism from the graph to itself. A graph is vertex-transitive if its automorphism 
group is transitive on its vertices. Vertex-transitive graphs can be constructed in 
a variety of ways. Two particularly interesting families are the Kneser graphs and 
the Levi graphs. 

Let m be a positive integer and S an n-set, where n > 2m. The simple graph 
whose vertices are the m-subsets of 5, two being adjacent if they are disjoint, 
is called a Kneser graph and denoted by KG(m,n). Such graphs, first defined by 
Kneser (1955), are clearly vertex-transitive. The Petersen graph (fig. 8) is KG(2, 5); 
the Coxeter graph (fig. 9) can be derived from KG(3, 7) by deleting the seven edges 
of a Fano hypergraph (fig. 3). 

Let PG(2,n) be a finite projective plane of order n, with point set P and line set 
L. The incidence graph of the hypergraph with vertex set P and edge set L is called 
the Levi graph of PG(2,n) and denoted by LG(2,7); the Heawood graph (fig. 4) 
is LG(2, 2). If n=, a prime power, the Levi graph LG(2,q) is vertex-transitive 
(see, for example, Biggs and White 1979), 

Lovasz (1970) considered the hamiltonicity of vertex-transitive graphs, and asked 
the following question. 


Problem 5.29. Does every connected vertex-transitive graph have a Hamilton 
path? 


This is so for graphs on p, 2p, 3p, 4p, 5p, p?, 2p” and p* vertices, where p is 
prime (see Alspach 1981). In fact, besides K, and K, only four simple connected 
vertex-transitive graphs without Hamilton circuits are known: the Petersen graph 
(fig. 8), the Coxeter graph (fig. 9), and the 3-regular graphs derived from these by 
expanding each vertex to a triangle. C. Thomassen (see Bermond 1978b) conjec- 
tures that there are only finitely many such graphs. Babai (1979) has shown that 
every connected vertex-transitive graph on 7 vertices has a circuit whose length is 
of order at least n!/2, 

Groups give rise to vertex-transitive graphs in a natural way. Let IT" be a group 
and X a set of elements of I’. The digraph D(I’, X) with vertex set I” and edge set 
{(g,gx): g€ I, x € X} is the Cayley digraph of I with respect to X. If X-! =X, 
D(I,X) is the associated digraph of a simple graph G(I,X), called the Cayley 
graph of F with respect to X. The graph G(I’, X) and digraph D(I, X) are loopless 
if and only if 1¢X and connected if and only if X generates I. When F is a 
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cyclic group, the resulting Cayley graph or digraph is called a circulant or directed 
circulant, respectively. The Koh digraph (fig. 2) is a directed circulant on the group 
I := 2, with generating set X := {1,5}. T.D. Parsons (see Witte and Gallian 1984) 
proposed the following conjecture. 


Conjecture 5.30. Every Cayley graph has a Hamilton circuit. 


A review of progress on this conjecture is given in the survey by Witte and 
Gallian (1984). Vertex-transitive graphs and other highly symmetric graphs are 
discussed in chapter 27. 


5.4. Line graphs and claw-free graphs 


The line graph of a graph G is the graph with vertex set E(G), two vertices being 
adjacent if and only if they are adjacent (as edges) in G. (Line graphs should more 
properly be called edge graphs, but the name is by now too well entrenched to be 
changed.) The line graph of G is denoted by L(G). 

The question of hamiltonicity of line graphs is intimately related to the notion 
of an Euler tour, a closed trail that includes every edge of the graph. A graph is 
eulerian if it admits an Euler tour. Such graphs were characterized, not surprisingly, 
by Euler (1736). 


Theorem 5.31. A graph admits an Euler tour if and only if it is connected and 
even. 


A proof of Theorem 5.31 was given only much later, by Hierholzer (1873). 
Hierholzer’s proof yields a linear-time algorithm for finding an Euler tour in an 
eulerian graph. Harary and Nash-Williams (1965) made the key observation linking 
Hamilton circuits in line graphs to Euler tours. 


Proposition 5.32. Let G be a graph on at least three edges. Then L(G) is hamil- 
tonian if and only if G has a dominating eulerian subgraph. 


Jaeger (1979) established a sufficient condition for a graph to contain a spanning 
(and hence dominating) eulerian subgraph. 


Theorem 5.33. Every 4-edge-connected graph has a spanning eulerian subgraph. 


As Thomassen (1986) observed, it follows from Theorems 5.32 and 5.33 that 
line graphs of 4-edge-connected graphs are hamiltonian. By similar methods, Zhan 
(1991) showed that 7-connected line graphs are Hamilton-connected. These results 
lend support to the following conjecture of Thomassen (1986). 


Conjecture 5,34. Every 4-connected line graph is hamiltonian. 


A basic property of line graphs is that they contain no induced subgraph isomor- 
Phic to K, 3. Such a subgraph is referred to as a claw, and a graph which contains 
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no claw is called claw-free. The original motivation for studying claw-free graphs 
was that they include line graphs as a proper subclass; Minty (1980) and Sbihi 
(1980) described polynomial-time algorithms for finding a maximum stable set in 
a claw-free graph, thereby extending Edmonds’s maximum matching algorithm 
(1965), and a third such algorithm can be found in Lovdsz and Plummer (1986, 
pp. 471-480). The evidence now, however, is that claw-free graphs are of interest 
in their own right. 

In a claw-free graph, the subgraph induced by the neighbours of a vertex has 
stability number one or two. If this subgraph is disconnected, it necessarily consists 
of two disjoint cliques; if it is connected, it contains a Hamilton path by Corol- 
lary 4.17. Claw-free graphs therefore include two quite distinct subclasses: those 
in which each neighbourhood induces a disconnected subgraph (for instance, line 
graphs of triangle-free simple graphs of minimum degree at least two), and those 
in which each neighbourhood induces a connected subgraph (the octahedron and 
icosahedron of fig. 1 being two examples). Graphs with this latter property are 
called locally-connected. 

Graphs in the former class do not necessarily have Hamilton circuits; indeed, 
if G is an arbitrary simple 3-regular graph, the graph H obtained from G by 
expanding each vertex to a triangle is hamiltonian if and only if G is hamiltonian; 
moreover, G and H have the same connectivity. Since it is NP-hard to determine 
whether an arbitrary 3-connected 3-regular graph is hamiltonian, we deduce that 
it is NP-hard to determine whether an arbitrary 3-connected 3-regular claw-free 
graph is hamiltonian. On the other hand, Oberly and Sumner (1979) proved that 
every connected, locally-connected claw-free graph is hamiltonian. 

One may ask how long a circuit a k-connected claw-free graph on 7 vertices 
must contain. For k = 2, an upper bound can be derived from the work of Lang 
and Walther (1968), who constructed 2-connected 3-regular graphs on ” vertices 
with no circuit of length more than 4 log,n + 4 (see the discussion following Con- 
jecture 5.18). By expanding each vertex to a triangle, one obtains 2-connected 
claw-free graphs on 3n vertices with no circuit of length more than 12 log, n + 12. 
Jackson and Wormald (personal communication 1992b) proved that this lower ° 
bound is of the right order of magnitude. They also obtained an analogous result 
for 3-connected claw-free graphs. 


Theorem 5.35. Let G be a 3-connected claw-free graph on n vertices. Then G 
contains a circuit of length at least n°, where c is a positive constant. 


A similar construction shows that this theorem is sharp, too, apart from the value 
of the constant c. The case of 4-connected graphs was addressed by Matthews 
and Sumner (1984), who proposed the following conjecture, a strengthening of 
Conjecture 5.34. 


Conjecture 5.36. Every 4-connected claw-free graph is hamiltonian. 


A still stronger conjecture was put forward by Jackson (1992). A Tutte circuit is 
a circuit of length at least four each of whose bridges has at most three vertices 
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of attachment. The Bridge Lemma (5.12) guarantees the existence of such circuits 
in 2-connected planar graphs on at least four vertices. Jackson conjectures their 
existence in 2-connected claw-free graphs as well. 


Conjecture 5.37. Every 2-connected claw-free graph on at least four vertices con- 
tains a Tutte circuit. 


A similar strengthening of Theorem 5.33 was also proposed by Jackson (1992). 


Conjecture 5.38. Every simple 2-edge-connected graph on at least four edges 
contains a connected even subgraph on at least four edges each of whose bridges 
has at most three edges of attachment. 


Conjecture 5.38 would imply that every 2-connected 3-regular graph on at least 
four vertices has a Tutte circuit, and this would imply the truth of Conjecture 5.22 
(with c = 3/4). 

Li (1988) proved that the bound on the degree of regularity in Jackson’s Theo- 
rem (5.17) can be reduced even further when the graphs are assumed also to be 
claw-free. 


Theorem 5.39. Let G be a 2-connected d-regular claw-free graph on n vertices, 
where d > n/5 and d > 10. Then G is hamiltonian. 


5.5, Oriented graphs and tournaments 


Whereas two edges (one in each direction) may link a pair of vertices in a strict 
digraph, at most one edge is permitted in an oriented graph. For this reason, and 
by analogy with the situation that obtains in bipartite graphs, one might expect 
conditions for the existence of directed paths and circuits in oriented graphs to 
be roughly half as demanding as their digraph counterparts. This is sometimes the 
case, but there are exceptions. Moreover, best possible results are much harder 
to come by. Jackson (1980b) showed that an oriented graph with minimum in- 
degree and outdegree k has a directed path of length at least 2k. However, the 
corresponding question for directed circuits, also considered by Jackson (1980b), 
remains unresolved. 


Conjecture 5.40. Let D be a strong oriented graph with minimum indegree and 
outdegree at least k. Then D has a directed circuit of length at least 2k +1. 


This conjecture is sharp when k is small compared to the order of D, but not 
when k is large. Haggkvist (1993) proved that an oriented graph on a vertices with 
minimum indegree and outdegree at least cn, where c := (1/2) — 2718 = 0.499996, 


a a directed Hamilton circuit, and conjectured that a much smaller constant will 
0. 


Conjecture 5.41. Let D be an oriented graph of minimum outdegree 5* on n 
vertices, where 5+ > (31 — 2)/8. Then D contains a directed Hamilton circuit. 
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An analogous conjecture for diregular oriented graphs was proposed by Jackson 
(1981b). 


Conjecture 5.42. Every k-diregular oriented graph on at most 4k +1 vertices, 
where k # 2, contains a directed Hamilton circuit. 


Evidently, questions about oriented graphs are much easier to come by than 
are answers, and we mention here just one more. Due to Thomassen (1987a), it 
concerns a property which resembles maximal nonhamiltonicity but is meaningful 
only for oriented graphs. 


Problem 5.43. Is there a nonhamiltonian oriented graph in which the reversal of 
any edge results in a hamiltonian graph? 


Remark 5.44. There do exist oriented graphs with the slightly weaker property 
that reversing any edge increases the length of a longest directed circuit; one such 
family, found by Thomassen (1987b), are the oriented graphs C50Cio,,7, k > 0. 


While oriented graphs in general are poorly understood, the theory of tour- 
naments is rich and highly developed, as the beautiful theorems of Rédei (3.4), 
Thomason (3.7) and Forcade (3.8) on Hamilton paths and circuits in tournaments 
demonstrate. Two directed analogues of Hamilton-connectedness were considered 
by Thomassen (1980a). 

A digraph is weakly Hamilton-connected if any two vertices are connected by a 
directed Hamilton path (in at least one direction) and strongly Hamilton-connected 
if any two vertices are connected by directed Hamilton paths in both directions. 
Thomassen (1980a) characterized the weakly Hamilton-connected tournaments 
and proved that 4-connected tournaments are strongly Hamilton-connected. He 
also showed that each edge of a 3-connected tournament lies in a directed Hamil- 
ton circuit. Thomassen’s methods were adapted by Bang-Jensen et al. (1992), re- 
sulting in a polynomial-time algorithm for finding, in a tournament, a directed 
Hamilton path with prescribed head and tail, if there is one. In particular, the al- 
gorithm can be applied to determine whether a tournament is weakly or strongly 
Hamilton-connected and whether there is a directed Hamilton circuit through a 
prescribed edge in a tournament. Bang-Jensen et al. (1992) also made the following 
conjecture. 


Conjecture 5.45. For each fixed k, there exists a polynomial-time algorithm for 
deciding if there is a directed Hamilton circuit through k prescribed edges in a 
tournament. 


Bang-Jensen and Thomassen (1992) proved that the above problem is WP- 
complete if k is not fixed, and they described a polynomial-time algorithm for 
finding a directed circuit in a tournament through two prescribed edges, if there 
is one. This latter problem is WP-complete for digraphs in general, by a result of 
Fortune et al. (1980). 
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We close this brief discussion of oriented graphs by defining the bipartite ana- 
logue of a tournament and noting a bipartite analogue of Camion’s Theorem (3.9) 
due to Gutin (1984) and, independently, Haggkvist and Manoussakis (1989). A bi- 
partite tournament is an orientation of a simple complete bipartite graph. A directed 
d-factor of a digraph is a d-diregular spanning subgraph. 


Theorem 5.46. A strong bipartite tournament has a directed Hamilton circuit if 
and only if it has a directed 1-factor. 


The proof by Haggkvist and Manoussakis (1989) of Theorem 5.46 makes use 
of a neat trick. The bipartite tournament T(V,, V2) is transformed into a 2-edge- 
coloured complete graph G on V, UV, by assigning colour i to each edge of 
G[V;] and to each edge of T whose tail lies in V;, i = 1,2. Noting now that T 
has a directed Hamilton circuit if and only if G has a Hamilton circuit whose 
edges alternate in colour, Haggkvist and Manoussakis (1989) apply a necessary and 
sufficient condition due to Bankfalvi and Bankfalvi (1968) for a 2-edge-coloured 
complete graph to possess such a circuit. 


6. Special proof techniques for paths and circuits 


Thus far, a variety of proof techniques have been introduced. Some, such as count- 
ing in two ways (Proposition 1.1), induction (Proposition 1.5), contradiction (The- 
orem 1.7), search trees (Theorem 1.8), maximal counterexample (Theorem 2.12), 
and inclusion—exclusion (Theorem 3.4) are of general applicability, while others are 
expressly designed to handle questions on paths and circuits. Among the latter are 
the use of maximal paths (Theorem 1.3), longest circuits (Theorem 2.1, Proof 1), 
longest paths (Theorem 2.1, Proof 2), lollipops (Theorem 2.1), the closure (The- 
orem 2.7), bypasses (Theorem 3.9) and vines (Theorem 4.11). In this section, we 
discuss three additional proof techniques specific to paths and circuits. 


6.1. Thomason’s lemma 


Let G be a graph and x a vertex of G. Recall that an x-path is a path in G with 
tail x. If P is an x-path in G whose head y is adjacent to a vertex z of P other 
than its immediate predecessor on P, then P + yz is an (x, z)-lollipop and thus, by 
the Lollipop Lemma (2.2), contains a second spanning x-path Q. The x-paths P 
and Q are said to be related by an elementary exchange. The graph whose vertices 
are the longest x-paths of G, two such paths being adjacent if and only if they 
are related by an elementary exchange, is called the x-path graph of G. Thomason 
(1978) proved a basic lemma on x-paths with the aid of this concept. 


Lemma 6.1 (Thomason’s Lemma). Let G be a connected graph on at least two 
vertices, and let x be a vertex of G. Then the number of longest x-paths of G that 
terminate in a vertex of even degree is even. 
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Proof. Let H be the x-path graph of G and P a longest x-path of G. If P termi- 
nates in y, 


dy(P) =dg(y) -1. 


Thus y is of even degree in G if and only if P is of odd degree in H. Since the 
number of vertices of odd degree in H is even, by Corollary 1.2, the number of 
longest x-paths in G that terminate in a vertex of even degree is also even. Q 


Corollary 6.2. Let G be a graph on at least three vertices, and let x and y be two 
vertices of G. If each vertex of G other than x and y is of odd degree, the number 
of Hamilton xy-paths in G is even. 


Proof. We may assume that G has at least one Hamilton xy-path; otherwise, the 
conclusion is trivial. The longest x-paths of G \ y are then Hamilton paths of G \ y, 
and each Hamilton xy-path of G is an extension of such a path. Let P be a longest 
x-path of G\y terminating in z. If z is of odd degree in G\y, the number of 
edges between y and z is even, since z is of odd degree in G. Thus P gives rise to 
an even number of Hamilton xy-paths in this case. On the other hand, if z is of 
even degree in G\ y, then P gives rise to an odd number of Hamilton xy-paths; 
by Thomason’s Lemma (6.1), however, the number of such paths P is even. Hence 
the total number of Hamilton xy-paths in G is alsoeven. O 


Smith’s Theorem (5.14) is an immediate consequence of Corollary 6.2;.it suffices 
to choose u and uv as the ends of the given edge e. Thomason (1978) also applied 
his Lemma (6.1) to derive a generalization of the theorem of J. Bosdk and A. 
Kotzig (see Bosdk 1967) asserting that every 3-regular bipartite graph of order at 
least four has an even number of Hamilton circuits. 

The x-path graph has other applications. Hendry (1984, 1986) considered the 
class of graphs in which there is a unique Hamilton path from one vertex x to 
every other vertex. He showed that the x-path graph of such a graph is a foresta 
with d(x)/2 components, and deduced that every such graph on n vertices has 
precisely 3(n — 1)/2 edges. 


6.2. Pésa’s lemma 


The naive approach to finding a long path in a graph is to grow a maximal path 
P and consider the paths obtainable from P by means of elementary exchanges 
(applied at either end); if one of these is not maximal, it can be extended to 
a maximal path longer than P. The procedure may then be repeated. While this 
approach fails badly on certain graphs, Pésa (1976) proved that it works remarkably 
well on most graphs of sufficiently high edge density. His argument hinges on the 
following lemma. 


Lemma 6.3 (Pésa’s Lemma). Let G be a graph, P a longest path in G and 9 the 
set of all paths of G obtainable from P by elementary exchanges. Denote by S the 
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set of ends of paths in P and by S~ and S* the sets of vertices immediately preceding 
and following the vertices of S on P, respectively. Then 


N(S) CS-USUS*. 


Proof. Let x € S and y € V(G)\ (S~ USUS*), and consider an x-path Q in 9. 
If y¢ V(G)\ V(P), x and y cannot be adjacent because Q is a longest path in 
G. Suppose, then, that y ¢ V(P) \(S” US US*). Then y has the same neighbours 
on each path in ?, because an elementary exchange that removed one of these 
neighbours would, at the same time, establish either it or y as an element of S. If 
x and y were adjacent, an elementary exchange applied to Q would thus yield a 
path whose head was a neighbour of y on P, a contradiction. Thus, in both cases, 
x and y are nonadjacent. O 


Pésa (1976) used a simple consequence of his Lemma 6.3, most easily stated 
using the concept of the degree of a set of vertices. Let S be a set of vertices in a 
graph G. The degree of S in G, denoted by d(S), is the number of neighbours of 
S in G\S, that is, 


d(S) := |N(S)\ S|. 


(A priori, it might seem more natural to define the degree of a set S of vertices as 
the total number of neighbours of S, |N(S)|, but the definition given here appears 
to be the right one. Note that the two definitions coincide when S is a stable set.) 


Corollary 6.4. Let G be a graph and k and @ integers, where k > —1, £ > 1 and 
k <@. If d(S) >2|S|+k for every nonempty set S of vertices such that |S| < [(é- 
k +1)/3], then G contains a path of length 0. 


Proof. Let P be a longest path in G and define S as in Pésa’s Lemma (6.3). Then 
N(S)\S CS~ US*, and so d(S) <2|S|—2. By the hypothesis of Corollary 6.4, 
|S| > [(@ —k +1)/3]. Let S’ be a subset of S such that |S’| = {(@ — k + 1)/3]. Then 
d(S’) > 2|S'| +k. Since V(P) D S'UN(S’), 


v(P) > |S’ UN(S')| =|S']+ d(S’) 2 3|S'|+k > 241. 
Thus P is of length at least 2. Of 


A graph on n vertices in which d(S) > c|S| for all sets S of at most [dn] vertices 
is called a (c,d)-expander. The case k = 0 of Corollary 6.4 asserts that (2,1/3)- 
expanders possess Hamilton paths. Expanders have many useful properties (see, 
for example, the survey article by Bien 1989). Ways of constructing them are 
discussed in chapter 32. Note that the case k = ¢ — 2 of Corollary 6.4 is simply 
Theorem 1.3(i). 

Using Corollary 6.4, Pésa proved that a random graph with 7 vertices and 
cn log n edges, where c is a sufficiently large constant, is almost surely hamil- 
tonian, Apart from the value of the constant c, this result is best possible. Pésa’s 
bound was sharpened fully by Komlés and Szemerédi (1983). Corollary 6.4 has 
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been invoked to help resolve a number of other problems on paths and circuits 
in random graphs. We limit ourselves here to one remarkable application by Beck 
(1983); further applications can be found in the stimulating survey article by Frieze 
(1989). ; 


Theorem 6.5. There exists a family of graphs {G;: £ > 3} such that e(G,) < 9002 
and every subgraph of G, with at least e(G,)/2 edges contains a path of length €. 


Remark 6.6. Theorem 6.5 implies that in any 2-edge-colouring of such a graph 
G, there is a monochromatic circuit of length m. Statements of this type, assert- 
ing the existence of a monochromatic configuration in an edge-coloured graph or 
hypergraph, are studied under the collective name of Ramsey Theory. Chapter 25 
by NeSettil is devoted to this topic, as is the book by Graham et al. (1980). 


6.3. Woodall’s hopping lemma 


P6sa’s Lemma (6.3) demonstrates the effectiveness of iterative procedures in estab- 
lishing sufficient conditions for Hamilton circuits; so too does the Closure Lemma 
(2.6). Here, we discuss a powerful technique of this kind, developed by Woodall 
(1973). 


Lemma 6.7 (The Hopping Lemma). Let G be a graph and C a longest circuit of G 
such that G\ C has as few components as possible. Suppose that some component 
of G\C is an isolated vertex, u. Put Xo := N(u) and define, recursively, sets Xj, 
i21, by 

X; = N((X,_, NX7_1) U {u}), 
where S~ and S*, respectively, denote the sets of vertices immediately preceding and 
following the vertices of S on C with respect to a fixed orientation. Set 

X := lim X;. 

t—00 

Then (i) X C V(C), and (ii) XN X* =. 
Corollary 6.8. Let G, C, u and X be defined as in the Hopping Lemma (6.7) and 
Set 

Y:= XxX’. 
Then (i) N(Y U {u}) = X, (ii) Y U {u} is @ stable set, and (iii) v(C) > 3|X| —|¥|. 
Proof. Assertion (i) follows directly from the definitions of X and Y. Assertion 
(ii) is implied by assertion (i), part (ii) of the Hopping Lemma (6.7) and the 
definition of Y. To prove (iii), consider the partition of V(C) induced by the |X| 
disjoint segments of C whose tails belong to X and whose heads belong to X~. By 
part (ii) of the Hopping Lemma (6.7) each such segment has at least two vertices. 


Moreover, by the definition of Y, a segment has exactly two vertices if and only if 
it includes a vertex of Y. Therefore 


v(C) > 3(|X|-|¥1)+2|¥|=3/X|-|¥|. oO 
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We give one illustration, due to Woodall (1978a), of the use of the Hopping 
Lemma (6.7). [The following statement is slightly stronger than the one given by 
Woodall, who considered only the case 2 =n and imposed the condition d(S) > 
(n + |S| — 1)/3 on all nonempty sets S of vertices. The proof, however, is the same.] 


Theorem 6.9. Let G be a simple 2-connected graph of minimum degree 5 on n 
vertices, where & > (n+ 2)/3, and let € be a positive integer. If d(S) > (£ + |S|—1)/3 
for every stable set S of vertices of G such that 2 < |S| < (€ + 1)/2, then G contains 
a circuit of length at least €. 


Proof. Let C be a longest circuit of G. Suppose that C is of length less than é. 
By Theorem 2.16, V(G) \ V(C) is a stable set. Let uc V(G)\ V(C), and define 
the sets X and Y as in the Hopping Lemma (6.7) and Corollary 6.8. By part (ii) 
of Corollary 6.8, Y U {u} is a stable set. Since Y cC V(C), we deduce that 


IY U{u}| < fu(C) +1 < 3(2 +1). 
By parts (i) and (iii) of Corollary 6.8, 

d(¥ U{u}) = |X] < L0(C) +|¥) < HE +|¥|—0), 
contradicting the hypothesis, with S:= YU{u}. a 


Theorem 6.9 implies that a simple 2-connected graph on 7 vertices in which 
the minimum degree is at least ( +2)/3 and the degree of any stable set of two 
vertices is at least 2/2 is hamiltonian. It thus generalizes Dirac’s Theorem (2.1) 
and, for graphs of minimum degree at least (nm +2)/3, Ore’s Theorem (2.4). The 
Petersen graph (fig. 8) is an example of a simple nonhamiltonian 2-connected graph 
on 7 vertices in which the degree of any stable set of two vertices is at least n/2. 
Broersma et al. (1993) characterized all such graphs, thereby generalizing Ore’s 
Theorem (2.4) fully. Of related interest is a generalization of Dirac’s Theorem (2.1) 
by Hagekvist (1985b), restated here in terms of degrees of sets of vertices. 


Theorem 6.10. Let G be a simple graph of minimum degree 5 on n vertices. If 
there is a positive integer k < 6 such that d(S) > n — 6 for every set S of k vertices, 
then G has a Hamilton circuit. 


The Hopping Lemma (6.7) (and variants of it) has diverse applications, among 
them the theorems of Haggkvist and Thomassen (4.8) and Jackson (5.17) referred 
to earlier. Suitably generalized, it may well be the appropriate tool to handle open 
questions such as Conjectures 4.7, 4.9, 4.22, 4.51, 5.11, 5.18, 5.20, 5.22 and 5.36 
and Problems 4.55 and 5.24, as well as others to be discussed in later sections, 
such as Problem 7.14. The following conjectures, due to Thomassen (personal 
communication 1976; see also Alspach and Godsil 1985, p. 466) and to Fleischner 
and Jackson (1989), respectively, should also be mentioned in this context. 


Conjecture 6.11. Let C be a longest circuit in a 3-connected graph. Then C has 
at least one chord. 
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Conjecture 6.12. Let G be a 3-regular cyclically 4-edge-connected graph. Then 
G has a dominating circuit. 


Conjecture 6.12 is implied by Conjecture 5.36 and implies Conjecture 5.22. Fleis- 
chner and Jackson (1989) have shown that it is equivalent to the conjecture of 
Thomassen (1986) that every 4-connected line graph is hamiltonian, Fouquet and 
Thuillier (1990) that it is equivalent to the assertion that every 3-regular cyclically 
4-edge-connected graph has a dominating circuit containing two specified non- 
adjacent edges. (Fouquet and Thuillier 1990, establish a similar strengthening of 
Conjecture 5.7.) Conjecture 6.12 is true for planar graphs by the Bridge Lemma 

5.12). 

ie toughness is a measure of how well-knit a graph is. Another such measure 
is the binding nurnber, introduced by Woodall (1973). This is the largest value of 
t such that |N(S)| > ¢|S| for all sets S of vertices such that N(S) # V. Unlike the 
toughness, the binding number can be computed in polynomial time, using network 
flow techniques (Cunningham 1990). Woodall (1973) applied the Hopping Lemma 
(6.7) to prove that a loopless graph is hamiltonian if its binding number is at least 
3/2. His conjecture that such a graph is pancyclic under the same condition was 
verified by Shi (1987). 


Theorem 6.13. A loopless graph with binding number at least 3/2 is pancyclic. 


7. Lengths of circuits 


Most of the discussion so far has centred on long circuits, and we have been 
content to establish lower bounds on the lengths of circuits in graphs with given 
properties, rather than their exact values (except, of course, in the case of Hamilton 
circuits). Here, we consider questions of the latter type. We discuss conditions 
on the minimum degree and the number of edges guaranteeing the existence of 
various types of circuits. 

Two basic lemmas, both due to Erdés (1965a,b), prove to be very useful in 
attacking extremal questions such as these. Roughly speaking, the first asserts that 
a graph with many edges has a subgraph of large minimum degree, the second 
that a graph of large minimum degree has a bipartite spanning subgraph of large 
minimum degree. 


Lemma 7.1. Let G be a graph on n vertices and more than (c — 1)n edges, where 
C is @ positive integer. Then G has a subgraph H of minimum degree at least c. 


Proof. Any minimal subgraph H with more than (c — 1)v(H) edges has the re- 
quired property: if H had a vertex uv of degree at most c — 1, the subgraph H \v 
would contradict the choice of H. 0 


Lemma 7.2. Every graph G has a bipartite spanning subgraph B such that dy(v) > 
dcg(v)/2 for all vertices v. 


Basic graph theory: Paths and circuits 75 


Proof. Any bipartite spanning subgraph B(X, Y) with the maximum number of 
edges has the required property: if B had a vertex v of degree less than dg(v)/2, 
where without loss of generality v € X, the bipartite spanning subgraph with bi- 
partition (X \v, ¥ U {v}) would contradict the choice of B. DO 


These lemmas are often invoked in proofs of extremal theorems. If a best possi- 
ble result is not sought, but only one in which the number of edges is of the right 
order of magnitude, their use serves to restrict the class of graphs under consid- 
eration to bipartite graphs of large degree, and such graphs are easier to treat. 
One reason for this is that breadth-first search trees, which are very useful in the 
analysis of circuit structure, take on a particular form in bipartite graphs of large 
degree: the sets of vertices equidistant from the root are stable and initially tend 
to grow rapidly in size with increasing distance from the root. 


7.1. Circuits of given length modulo k 


The existence of odd and even circuits has already been covered in some detail 
(see, for example, Theorems 1.8 and 4.13 and the discussions thereon). Barefoot 
et al. (1991) considered the case of circuits of length 0 modulo 3. They proved 
that simple 3-regular graphs necessarily have such circuits, and made a stronger 
conjecture, proved by Chen and Saito (1994). 


Theorem 7.3. Every simple graph of minimum degree at least three contains a 
circuit of length 0 modulo 3. 


A still more general conjecture was formulated by N. Dean (see Chen and Saito 
1994). 


Conjecture 7.4. Every simple graph of minimum degree at least k, where k 2 3, 
contains a circuit of length 0 modulo k. 


The case k = 4 of this conjecture was verified by Dean et al. (1993). Other circuit 
lengths modulo 3 were considered by Dean et al. (1991), who showed that a simple 
2-connected graph of minimum degree at least three contains a circuit of length 1 
modulo 3 unless it is the Petersen graph (fig. 8), and a circuit of length 2 modulo 
3 unless it is Ky or K33 (see also Saito 1992), 

Thomassen (1983c) proved that every simple graph of minimum degree at least 
2£(k +1) contains a circuit of even length £ modulo k. Note that a bipartite graph 
has no circuit of odd length modulo k if k is even, no matter how large its minimum 
degree. On the other hand, for odd k, Bollobds (1977) established the existence 
of a function f such that every simple graph with minimum degree at least f(k) 
contains circuits of all lengths modulo k; Thomassen (1983c) likewise established 
the existence of a function g such that every simple 2-connected nonbipartite graph 
with minimum degree at least g(k) contains circuits of all lengths modulo k, and 
speculated as to the best possible such function g. 


Conjecture 7.5. Every simple 2-connected nonbipartite graph of minimum degree 
at least k +2 contains circuits of all lengths 2 modulo k. 


76 J.A. Bondy 


A degree condition for the existence of a directed circuit of length 0 modulo 
k in a strict digraph was given by Alon and Linial (1989). The same condition 
guarantees, more generally, a directed circuit of weight 0 modulo k in a graph with 
integral edge weights. Alon (1991) has asked whether there exists a polynomial- 
time algorithm for finding such a circuit; the proof of Alon and Linial, which uses 
probabilistic methods, does not supply one. 


7.2. Circuits of given length 


The complete bipartite graph Kj/2n/2 has n?/4 edges and no odd circuits. Theo- 
rem 1.7 tells us that any simple graph on 7 vertices and more than this number of 
edges contains a triangle. Bondy (1971b), with the aid of Theorem 2.9, showed that 
any such graph contains circuits of all lengths 2, 3 < £ < [(m +3)/2]. This number 
of edges is extremal for odd @ but not for even @. In the range |(n +3)/2] <é <n, 
Woodall (1972) determined the exact bound, namely (‘5') = (""$**); the connected 
graph with blocks K,_, and K,_¢,2 has this number of edges but no circuit of length 
£. 

The situation as regards short even circuits is less clear. Reiman (1958) observed 
that only n°/?/2+n/4 edges are needed to guarantee the existence of a circuit of 
length four; a simple graph G with this number of edges satisfies the inequality 


os) " (3) 
x (7) >G 
and thus has two vertices with a pair of neighbours in common, that is, a 4-circuit. 
By exploiting the structure of breadth-first search trees, Bondy and Simonovits 
(1974) showed, more generally, that a simple graph on n vertices and more than 
cn'*"/! edges, where c := (2) is a function only of £, contains a 2¢-circuit. Apart 
from the value of c, this number of edges is probably extremal; it is known to be 
so for = 2, 3 and S. For instance, the Levi graph LG(2,q) of a finite projective 
plane of order g has 2(q? + q + 1) vertices, (q + 1)(q? + q + 1) edges and no 4-circuit* 
Brown (1966) and Erdés et al. (1966b) constructed denser examples in the case 
£ =2, also based on finite projective planes, and Fiiredi (1983, preprint) proved 
that these examples are precisely the extremal graphs when n = q7+q+1andq ¢ 
{1,7,9,11, 13}. Benson (1966) settled the cases £ = 3 and @ = 5, also by constructing 
graphs from finite geometries. Conceptually simpler examples, with fewer edges 
but still of the right order of magnitude, were constructed by Wenger (1991) for 
the same three values of @. 


Problem 7.6. For each integer @ > 2 and for infinitely many values of n, construct 
a simple graph on n vertices and c n'*+!/é edges, where ¢ is a constant depending 
only on @, containing no 2é-circuit. 


By Lemma 7.2, the extremal problem is no easier for bipartite graphs. For 
digraphs, on the other hand, the extremal question was completely settled by 
Haggkvist and Thomassen (1976). They determined the number of edges needed 
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in a digraph on n vertices to guarantee the existence of a directed circuit of length 
é, and also derived bounds that are close to best possible for the restriction of this 
problem to strong digraphs. These and many other extremal questions on circuits 
in digraphs are discussed in the survey by Bermond and Thomassen (1981) and in 
the book by Bollobds (1978). 


7.3. Circuits of many lengths 


The proof of Theorem 1.3 shows that a simple graph of minimum degree k has at 
least k —1 different circuit lengths; this is best possible, in view of the complete 
graph K,,,. A more sensitive measure of the richness of circuit lengths in a graph 
— the sum of their reciprocals - was proposed by Erdés and Hajnal (1966) (see 
also Erdés 1975). Defining 


2£(G) = )*{1/£: G has a circuit of length ¢}, 
and noting that 
L(Kiu) log k+y—3 and = L(Ky ) © 3 (log k + y — 3), 


where y is the Mascheroni constant (y ~ 0.5772), they asked if there exists a 
constant c such that the sum of the reciprocals of the circuit lengths in a simple 
graph of minimum degree k is at least c log k, regardless of the order of the graph. 
An affirmative answer was provided by Gyarfas et al. (1984). Note that, by virtue 
of Lemma 7.1, the same remarkable property holds (with a different constant) in 
a graph of average degree k. Sparse graphs, with average degree 1 + (1/k), where 
k is sufficiently large, exhibit a similar behaviour. Gyarfas et al. (1985) proved 
that the sum of the reciprocals of the circuit lengths in such a graph is at least 
1/(300k log k). 


7.4. Circuits of all lengths 


Pancyclic graphs, those that have circuits of all possible lengths, were introduced 
in section 2. Sufficient conditions for hamiltonicity frequently imply pancyclicity. 
Corollary 2.10 is a typical instance of this phenomenon, which provides convinc- 
ing evidence that it is hard to capture the essence of hamiltonicity. This is not 
surprising, of course, hamiltonicity being an W P-complete property. 

A digraph analogue of Theorem 2.9 was established by Haggkvist and 
Thomassen (1976), and Corollary 2.10 was generalized to digraphs by Thomassen 
(1977). A bipartite analogue of Theorem 2.9 was considered by Entringer and 
Schmeichel (1988). Using similar methods, they proved that a hamiltonian bipar- 
tite graph on 2n vertices and more than n”/2 edges is bipancyclic, that is, has 
Circuits of every even length 22, 2 < <n. This bound on the number of edges 
is not thought to be best possible, however, and could conceivably be reduced 


by a-factor of two. Specifically, one has the following conjecture of Mitchem and 
Schmeichel (1985). 


Conjecture 7.7. Let G be a simple hamiltonian bipartite graph on 27 vertices and 
at least n*/4 +n +1 edges. Then G is bipancyclic. 
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Mitchem and Schmeichel (1985) also proposed a related conjecture, which can 
be regarded as the bipartite analogue of Corollary 2.10. 


Conjecture 7.8. Let G be a simple hamiltonian bipartite graph of minimum de- 
gree 5 on 2n vertices, where 5* — 5 > n. Then G is bipancyclic. 


The Levi graphs of finite projective planes (see sections 5.3 and 7.2) show that 
the lower bound on 6 here is necessary. 

We turn now to pancyclicity in tournaments, recalling that every strong tourna- 
ment is vertex pancyclic. A digraph on 7 vertices is edge-é-cyclic if each edge lies 
in a directed circuit of length 2, and edge pancyclic if each edge lies in a directed 
circuit of each length £, 3< @ <n. Wu et al. (1981) proved that every edge-3- 
cyclic tournament on na vertices is edge-€-cyclic for all £, 3 < 2 < |(n +5)/2]. This 
somewhat surprising result was extended by Tian et al. (1982). 


Theorem 7.9. An edge-3-cyclic tournament is edge pancyclic unless it belongs to 
one of two specified families. 


One corollary of Theorem 7.9 is the theorem of Alspach (1967) that regular 
tournaments are edge pancyclic. A number of other properties stronger than pan- 
cyclicity have been studied. Faudree and Schelp (1975) define a graph on n ver- 
tices to be panconnected if any two vertices are connected by paths of every length 
é, d<é@ <n-—1, where d is their distance. Hendry (1990) calls a circuit C extend- 
able if there is a circuit C’ such that v(C’) = v(C)+1 and V(C’) > V(C), and a 
graph G circuit extendable if G contains at least one circuit and every nonspanning 
circuit of G is extendable. He also introduces the concept of a pancyclic ordering, 
a linear ordering v,,v2,...,v, of the vertices of a graph such that the subgraph 
induced by {v;,v2,...,0;} is hamiltonian for all 7, 3 <i<n. 


7.5. The number of circuits 


Results on the existence of circuits lead naturally to questions about their number, 
and to some intriguing conjectures. It is lower bounds on the number of circuits 
that are of principal interest. We consider, first, two problems on the total number 
of circuits. 

The number of circuits in 3-connected 3-regular graphs was investigated by Bare- 
foot et al. (1986), who observed that the total number of circuits in a 3-regular 
graph on 7 vertices which has no circuit of length more than n‘, where c < 1, is 
necessarily subexponential in n. Such graphs exist (see the discussion following 
Theorem 5.21). Barefoot et al. (1986) conjectured, however, that the number of 
circuits is always moderately large. 


Conjecture 7.10. The number of circuits in a 3-connected 3-regular graph on n 
vertices is superpolynomial in n. 


A long-standing conjecture on the number of directed circuits in a digraph is 
the following one, due to Adém (1964). 
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Conjecture 7.11. Let D be a strict digraph with at least one directed circuit. Then 
D has an edge whose reversal reduces the total number of directed circuits. 


Although not explicit in Ad4m’s formulation, the requirement that D be strict 
js essential. Examples of digraphs with multiple edges in which the reversal of any 
edge increases the number of circuits have been described by Thomassen (1987b) 
and by Grinberg (1987). Thomassen’s examples are based on the existence of 
digraphs such as those described in Remark 5.44, in which the reversal of any edge 
increases the length of a longest directed circuit. Multiplying each edge suitably 
many times yields a digraph with the desired property. 

Theorems asserting the existence of Hamilton circuits in graphs are numerous, 
prominent among them being those of Dirac (2.1), Ore (2.4), Chvdtal and Erdés 
(4.16), Jackson (5.17), and Tutte (5.8). In each case, one may ask how many Hamil- 
ton circuits there must be. Surprisingly, of these theorems, only Tutte’s has been 
investigated from this aspect (although the theorems of Dirac and Ore have been 
studied with regard to the number of edge-disjoint Hamilton circuits; see the dis- 
cussion following Theorem 8.8). Hakimi et al. (1979) showed that a 4-connected 
planar graph on n vertices has at least n/ log, n Hamilton circuits and conjectured 
that this bound can be improved to 2(m — 2)(n — 4). 

A tournament has at least one directed Hamilton path, by Rédei’s Theorem (3.1), 
but need not have more. Moon (1972) showed, however, that a strong tournament 
on n vertices has at least c’-' directed Hamilton paths, where c := 6'/4 ~ 1.565. 
Thomassen (1985b) examined the same question for directed Hamilton circuits. By 
Camion’s Theorem (3.9), a strong tournament has at least one directed Hamilton 
circuit but, again, need have no more. Thomassen nevertheless found an exponen- 
tial lower bound in terms of the minimum outdegree: a strong tournament with 
minimum outdegree at least 3k +3 has at least 4*k! directed Hamilton circuits. 
Thomassen (1980b) also obtained an analogue of Moon's theorem, proving that a 
2-strong tournament on 7 vertices has at least c*/32-1 = 1.014" directed Hamilton 
circuits. Thomassen (personal communication 1990) remarks that no construction 
is known of a tournament with more than (m — 1)!/2" directed Hamilton circuits, 
although an elementary counting argument establishes the existence of such a tour- 
nament. A related question posed by Thomassen (personal communication 1990) 
is the following one. 


Problem 7.12. Is there a diregular tournament on n vertices with at least (n — 
1)!/2" directed Hamilton circuits? 


As already noted, Smith’s Theorem (5.14) implies that a 3-regular hamiltonian 
graph has at least three Hamilton circuits. One may ask which have precisely three. 
The smallest example is the graph on two vertices joined by three edges, K;’. 
Bosdk (1967) observed that any graph obtainable from this graph by recursively 
expanding vertices to triangles has the same property and conjectured, incorrectly, 
that there are no others; Nint4k (1974) found a nonplanar counterexample, a 
generalization of the Petersen graph, and asked whether Bosdk’s conjecture is 
true when restricted to planar graphs. 


80 J.A. Bondy 


Problem 7.13. Let G be a 3-regular planar hamiltonian graph with precisely three 
Hamilton circuits. Can G be obtained from K,” by recursively expanding vertices 
to triangles? Equivalently, if G K®), must G contain a triangle? 


If G is a 3-regular graph with precisely three Hamilton circuits H,, 2, H3, the 
sets E; := E(G) \ E(H,), i = 1,2,3, constitute a proper edge colouring in which the 
edges of any two colours induce a Hamilton circuit. The graphs possessing such an 
edge colouring were introduced by Kotzig (1962) under the name of Hamiltonian 
graphs; they are now referred to as strongly hamiltonian, perfectly 1-factorable or 
Hamilton 1-factorable. A selection of open problems about them is given in Kotzig 
and Labelle (1979). 

It is not known whether a simple 4-regular hamiltonian graph must have at least 
two Hamilton circuits. Sheehan (1975) conjectures that this is so. Fleischner (1994) 
has constructed a 4-regular uniquely hamiltonian multigraph; thus the condition 
that the graph be simple cannot be dropped. Also, Entringer and Swart (1980) 
have constructed uniquely hamiltonian simple graphs of minimum degree three. 
No example with minimum degree four is known. 


Problem 7.14. Is there a uniquely hamiltonian simple graph of minimum degree 
four? 


8. Packings and coverings by paths and circuits 


This is an important and extensive topic, rich in challenging open questions. The 
concepts of packing and covering are best explained in the context of hypergraphs. 


8.1. Packings and coverings of hypergraphs 


Recall that a hypergraph H consists of a set V(H) of elements, called vertices, 
and a family E(H) of subsets of V(H), called edges. A packing of H is a set of 
disjoint edges, a covering of H a set of edges whose union is V(H). A partition Sf 
H is a set of edges that constitutes at the same time a packing and a covering. 

Given a hypergraph H, several questions naturally arise. How many edges can 
there be in a packing of H? How few edges can there be in a covering of H? Does 
H admit a partition? If so, how many or how few edges can there be in a partition 
of H? Consideration of the first question leads to the concept of a transversal of a 
hypergraph, a set of vertices that meets every edge in at least one vertex. Packings 
and transversals are linked in a simple way. 


Proposition 8.1. Let H be a hypergraph, ? a packing of H, and T a transversal 
of H. Then 


|P| <7. 


Proof. Because 7 is a transversal, each edge of # contains at least one vertex 
of T. On the other hand, because # is a packing, no two edges in # contain the 
same vertex of 7. The inequality follows. 0 


Basic graph theory: Paths and circuits 81 


Proposition 8.1 implies the following inequality for any hypergraph: 
max{|#|: # a packing of H} < min{|T|: T a transversal of H}. 


Hypergraphs H for which equality holds in this inequality are said to have the 
minmax property. Such hypergraphs are of particular import in the field of com- 
binatorial optimization. In order to find a packing of maximum cardinality or a 
transversal of minimum cardinality in a hypergraph with the minmax property, it 
suffices to find a packing # and a transversal T of the same cardinality: Proposi- 
tion 8.1 guarantees that is then maximum and T minimum. Finding such optimal 
packings and transversals efficiently is, of course, another matter. However, it is a 
striking fact that almost all packing problems for hypergraphs satisfying the min- 
max property do indeed admit polynomial-time algorithms. This phenomenon has, 
as yet, no theoretical foundation. 

We now define what we mean by a packing or covering of a graph. Associated 
with a graph G and a family ¥ of subgraphs of G are two hypergraphs, one 
having vertex set V(G) and edge set {V(F): F € #}, the second having vertex set 
E(G) and edge set {E(F): F ¢ #}. Packings and coverings of these hypergraphs 
correspond in a natural way to packings and coverings of G by members of ¥, 
and lead to the following definitions. 

A family ¥ of subgraphs of a graph G is a vertex packing of G if the members of 
¥ are vertex-disjoint, an edge packing if they are edge-disjoint, a vertex covering 
if UregV(F) = V(G) and an edge covering if UrcgE(F) = E(G). A partition (or 
vertex partition) is a vertex packing which is also a vertex covering, a decomposition 
(or edge partition) an edge packing which is also an edge covering. We shall be 
interested in the case where the family ¥ consists of paths and/or circuits. 


8.2. Packings by paths and circuits 


For paths, the vertex-packing problem is trivial unless restrictions are placed on 
the paths. If they are required to be of length one, we have the maximum matching 
problem referred to in section 4 (at the end of the discussion of stability number). 
In this case, the minmax property is valid for bipartite graphs, but not in general 
~ a subtler minmax property holds for all graphs (see chapter 3). If the paths 
are required to be of any other fixed length, the problem is W#-complete, as is 
the vertex-packing problem for circuits of any fixed length (Kirkpatrick and Hell 
1983). 

Vertex packings by circuits do not satisfy the minmax property: only one cir- 
cuit can be packed into Ks, for instance, whereas a transversal requires at least 
three vertices. Nevertheless, a relationship does exist between the cardinalities of 
a maximum packing and a minimum transversal. 


Theorem 8.2, Let G be a graph and k a positive integer. Either G has k disjoint 
circuits or there is a set of at most cklogk vertices in G meeting all circuits, where 
c ts an absolute constant. 


This theorem, due to Erdés and Pésa (1962), leads to a straightforward 
polynomial-time algorithm for determining if a graph has k disjoint circuits, for 
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fixed k. Let s := [(cklogk]. One first checks each subgraph G\S, where § CV 
and |S| =s; there are no more than n° of these, where 7 is the order of G. If no 
such subgraph is a forest, Theorem 8.2 guarantees that G has k disjoint circuits. If, 
on the other hand, some G \ S is a forest, every circuit of G meets S. The circuits 
of G can then be enumerated directly. If C is a circuit meeting S in a subset R, 
there are at most n? choices for the two edges of C incident to any vertex v of 
R, and so at most n”" choices for all the edges of C incident to R, where r := |R|. 
Moreover, each such choice gives rise to at most one circuit of G; if there were 
two, their symmetric difference would be a nonempty even subgraph of the forest 
G\S. Thus the number of circuits of G is polynomial in n. Each set of k circuits 
can now be checked individually. 

Whether there is also a function f such that every digraph has either k disjoint 
directed circuits or a set of f(k) vertices meeting all directed circuits is not known. 
This question was first asked by Gallai (1968b). Thomassen (1983d) noted that 
the existence of such a function would imply a polynomial-time algorithm for 
determining whether a digraph has k disjoint directed circuits. Younger (1973) 
conjectured that {(2) = 3. This was confirmed by McCuaig (1993). 


Theorem 8.3. Let D be a digraph. Either D has two disjoint directed circuits or 
there is a set of at most three vertices in D meeting all directed circuits. 


Corradi and Hajnal (1963) gave a sharp bound on the minimum degree for a 
graph to have k disjoint circuits. 


Theorem 8.4. Let k be a positive integer and G a simple graph of minimum degree 
6 on n vertices, where 6 > 2k and n > 3k. Then G has k disjoint circuits. 


Thomassen (1983a) conjectured that if n is a sufficiently large function of k, 
the hypothesis of Theorem 8.4 guarantees the existence of k disjoint circuits of 
the same length. This was proved by Egawa (preprint). Haggkvist (1985c) showed 
that regular graphs have many disjoint circuits of the same length; there are &t 
least n/64(log, m)?, for instance, in a 3-regular graph on 7 vertices. For digraphs, 
Thomassen (1983d) proved the existence of a function f such that any strict digraph 
of minimum outdegree at least f(k) has k disjoint directed circuits, and further 
conjectured the existence of a function g such that any strict digraph of minimum 
outdegree at least g has k disjoint directed circuits of the same length. He also 
ventured a conjecture as to the smallest value of the function f (see, also, Bermond 
and Thomassen 1981). 


Conjecture 8.5. Let k be a positive integer and D a strict digraph of minimum 
outdegree at least 2k — 1. Then D has & disjoint directed circuits. 


Edge packings by xy-paths in graphs, and by (x,y)-paths in digraphs, satisfy 
the minmax property, by Menger’s Theorem (4.1). Lucchesi and Younger (1978) 
established that edge packings by circuits also have the minmax property, in planar 
digraphs. 
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Theorem 8.6. Let D be a planar digraph. Then the maximum number of edge- 
disjoint directed circuits in D is equal to the minimum number of edges of D whose 
deletion destroys all directed circuits. 


A related minmax equality for planar digraphs was conjectured by Woodall 
(1978b). 


Conjecture 8.7. Let D be a planar digraph. Then the minimum number of edges 
in a directed circuit of D (the girth of D) is equal to the maximum number of 
disjoint sets of edges of D the deletion of each of which destroys all directed 
circuits. 


Theorem 8.6 and Conjecture 8.7 are special cases of the Lucchesi- Younger The- 
orem and Woodall’s conjecture, respectively. Stated in dual form, in terms of di- 
rected bonds rather than directed circuits, the latter apply to all digraphs, not just 
planar ones. A related result, on edge transversals of maximal directed edge cuts, 
was obtained by Maamoun (1985). 


Theorem 8.8. in any digraph, there is a directed path or directed circuit whose 
edges meet every maximal directed edge cut. 


McCuaig (1993) obtained an edge analogue of Theorem 8.3. 


Theorem 8.9. Let D be a digraph. Either D has two edge-disjoint directed circuits 
or there is a set of at most three edges in D meeting all directed circuits. 


Theorem 2.11 refines Dirac’s Theorem (2.1) by giving a lower bound on the 
number of edge-disjoint Hamilton circuits in a graph satisfying the hypotheses 
of Dirac’s Theorem (2.1). A similar refinement of Ore’s Theorem (2.4), but valid 
only for graphs of relatively small minimum degree, was given by Li (1989). Corol- 
lary 4.18 shows that a simple 2d-regular graph on at most 4d +1 vertices has at 
least one Hamilton circuit. Jackson (1979) proved that every simple d-regular graph 
on n vertices, where d > (n —1)/2 and n > 14, contains at least |(3d — n+ 1)/6] 
edge-disjoint Hamilton circuits, deducing this from his Theorem 5.17. Thomassen 
(1982) conjectured the existence of a function f such that every f(k)-connected 
tournament has k edge-disjoint directed Hamilton circuits. The case of triangles 
was examined by Tuza (1984), who made the following conjecture. 


Conjecture 8.10. Let G be a simple graph and k a positive integer. Either G has 


k +1 edge-disjoint triangles or there is a set of at most 2k edges in G meeting all 
triangles. 


Tuza (1990) made a similar conjecture for directed triangles in tournaments 
and established a minmax equality for directed triangles in planar oriented graphs 
(Tuza’ 1995), 

We have omitted mention here of conditions on the number of edges guaran- 
teeing the existence of k disjoint or edge-disjoint circuits. For these, the reader 
may consult either chapter 23 or the book of Bollobds (1978). 
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8.3. Coverings by circuits 


We have already seen a variety of ways in which theorems on the existence of 
long circuits can be generalized. Yet another way is to determine how few circuits 
it takes to cover the vertices or edges of the graph in question. For instance, 
Theorem 4.47 implies that a strong digraph on n vertices with stability number 
a has a directed circuit of length at least n/a, while Conjecture 4.33 makes the 
stronger assertion that a directed circuits suffice to cover the vertex set of the 
digraph. In the same vein, Kouider and Lonc (preprint) established the following 
generalization of Theorem 2.14. 


Theorem 8.11. Let k be a positive integer and G a simple graph on n vertices such 
that 0x,; 2n. Then V can be covered by k circuits, edges or vertices. 


When a(G) < k, this is a consequence of Corollary 4.20. The following conjec- 
ture on edge coverings by circuits was proposed by Bondy (1990). If true, it would 
imply the theorem of Pyber (1985) that the edges of a simple graph on n vertices 
can be covered by n — 1 circuits and edges. 


Conjecture 8.12. Let G be a simple 2-connected graph on n vertices. Then E(G) 
can be covered by at most (2n — 1)/3 circuits. 


Thomassen (1985b) proved that the edges of a diregular tournament on n ver- 
tices can be covered by 12 directed Hamilton circuits. This result is of interest with 
regard to a question of PJ. Kelly on decompositions of diregular tournaments into 
directed Hamilton circuits (see Conjecture 8.25 and the subsequent discussion). 

The cardinality |@| of a circuit covering € is but one measure of the efficiency 
of the covering; another is the length |||] of the covering, defined by 


Nell = Sel). 
Cee t 
We denote the minimum length of a circuit covering of a 2-edge-connected graph 
G by ce(G). 

Thomassen (preprint) conjectured that the problem of determining whether 
cc(G) < k is NP-complete. Nevertheless, a good lower bound for cc(G) can be 
found in polynomial time by determining a shortest postman tour of G. A postman 
tour of a connected graph G is a closed walk that includes every edge of G. We 
denote the length of a shortest postman tour of G by pt(G). A polynomial-time 
algorithm for finding a minimum-weight postman tour in an edge-weighted graph 
was developed by Edmonds and Johnson (1973). It is based on the shortest-path 
algorithm of Dijkstra (1959) and the weighted matching algorithm of Edmonds 
(1965). The idea is to find shortest paths between all pairs of vertices of odd de- 
gree, and then determine a pairing of these vertices for which the set of connecting 
shortest paths is of minimum total weight. A postman tour that traverses the edges 
of the selected paths twice and each other edge once is one of minimum weight. 
If @ is a circuit covering of a 2-edge-connected graph G, the edge-disjoint union 
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of the circuits in @ is an eulerian spanning supergraph of G and thus admits an 
Euler tour of length ||@||. The corresponding tour in G is a postman tour of the 
same length. It follows that 


cc(G) > pt(G). 


Clearly, cc(G) = pt(G) = e(G) if G is eulerian, and cc(G) > pt(G) > e(G) other- 
wise. An upper bound for cc(G) in terms of e(G) was conjectured by J.C. Bermond, 
B. Jackson and F. Jaeger (F. Jaeger, personal communication, 1989). 


Conjecture 8.13. Every 2-edge-connected graph on e edges admits a circuit cov- 
ering of length at most 7e/5. 


The bound of 7e/5 is attained by the Petersen graph (fig. 8). Bermond et al. 
(1983) proved that a 2-edge-connected graph on e edges admits a circuit covering 
of length at most Se/3; this was generalized to edge-weighted graphs by Fan (1990). 


8.4. Partitions into paths and circuits 


The question of partitioning a graph or digraph into as few paths or circuits as 
possible is strongly related to — indeed includes — the hamiltonian problem. We 
have already studied many aspects of this question. Moreover, path partitions of 
digraphs and their connections to stable sets and colourings were discussed in 
section 4. 

Path partitions of 3-regular graphs were examined by Reed (1989), who proved 
that any simple 2-connected 3-regular graph on n vertices can be partitioned into 
at most 7/9 paths and conjectured a slightly stronger conclusion. 


Conjecture 8.14, Let G be a simple 2-connected 3-regular graph on n vertices. 
Then G admits a partition into at most 7/10 paths. 


One may consider, more specifically, partitions of a graph or digraph into paths or 
circuits of prescribed length. El-Zahar (1984) proposed an appealing generalization 
of Dirac’s Theorem (2.1) in this vein. 


Conjecture 8.15. Let be a positive integer, > 3, and let m +22 + ---+n,, where 
n; > 3, be a partition of n into k parts, s of which are odd. If G is a simple graph 
of minimum degree 6 on n vertices, where 5 > (n + 5)/2, then G admits a partition 
into k circuits, of lengths ny, m2, ...,M,. 


The case k =2 was verified by El-Zahar. The case n = 3k is true by virtue of 


Theorem 8.4, but would also be a consequence of the following conjecture of L. 
Pésa (see Erdés 1964). 


Conjecture 8.16. Let G be a simple graph of minimum degree 6 on n vertices, 
where 6 > 2n/3. Then G contains the square of a Hamilton circuit. 
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Fan and Kierstead (1995) established an asymptotic version of this conjecture 
by showing that G contains the square of a Hamilton circuit if, for some « > 0, 
& > (2+ )n/3 and n is sufficiently large (as a function of ¢). They also (Fan and 
Kierstead, preprint) proved the analogue (and corollary) of Conjecture 8.16 for 
Hamilton paths. 


Theorem 8.17. Let G be a simple graph of minimum degree & on n vertices, where 
8 > (2n — 1)/3. Then G contains the square of a Hamilton path. 


Aigner and Brandt (1993) verified a weaker form of Conjecture 8.16 in which 
the bound (1 +s)/2 is replaced by (2n — 1)/3. This result is implied by Theorem 
8.17. 


8.5. Decompositions into paths and circuits 


A path decomposition or circuit decomposition is a decomposition into paths or 
circuits, respectively, a Hamilton decomposition a decomposition into Hamilton 
circuits. Similar definitions apply to digraphs, a path or circuit decomposition con- 
noting a decomposition into directed paths or circuits as the case may be. Circuit 
decompositions were touched on briefly in section 1. Just as Theorem 3.4 refines 
Rédei’s Theorem (3.1), so Theorem 1.4 was refined by Toida (1973). 


Theorem 8.18. Amn even graph admits an odd number of circuit decompositions. 


Theorem 8.18 can be proved using Thomason’s Lemma (6.1) (Bondy and Hal- 
berstam 1986); for a different approach, see Woodall (1990). Another refinement 
of Theorem 1.4, on decompositions of planar graphs into even circuits, was found 
by Seymour (1981). 


Theorem 8.19. Let G be a 2-connected planar even graph on an even number of 
edges. Then G admits a decomposition into even circuits. a 


The number of circuits in a decomposition is the subject of a longstanding con- 
jecture of G. Hajés (see Lovdsz 1968, and Dean 1986). 


Conjecture 8.20. Every simple even graph on n vertices admits a decomposition 
into at most (n — 1)/2 circuits. 


Hajos’s conjecture was verified for planar graphs by Tao (1984). T. Gallai (see 
Lovasz 1968) proposed an analogous conjecture on path decompositions. 


Conjecture 8.21. Every simple connected graph on 7 vertices admits a decompo- 
sition into at most (n+ 1)/2 paths. 


Lovasz (1968) proved that every simple connected graph on 7 vertices admits 
a decomposition into at most n/2 paths and circuits. If the graph is odd, such a 
decomposition must necessarily consist of n/2 paths; thus Lovdsz’s theorem implies 
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Conjecture 8.21 in this case. Donald (1980) refined Lovdsz’s argument, establishing 
that every simple connected graph on n vertices admits a decomposition into at 
most (39/4) + (7/2) paths, where mp and m, denote the numbers of vertices of 
even and odd degree, respectively. 

: A related question, on decompositions into circuits and edges, was considered 
by P. Erdés and T. Gallai (P. Erdés, personal communication, 1990; see also Erd6s 
et al. 1966a, where it is discussed without attribution). 


Conjecture 8.22. Every simple graph on n vertices admits a decomposition into 
at most cv circuits and edges, where c is an absolute constant. 


Dean (1986) reviews analogues of Conjecture 8.20 for digraphs and poses several 
conjectures, of which the following is typical. 


Conjecture 8.23. Let D be an eulerian oriented graph on n vertices. Then D 
admits a decomposition into at most 2n/3 directed circuits. 


Theorem 8.8, applied inductively, shows that a strict digraph on n vertices admits 
a decomposition into at most k directed paths or circuits, where k is the number of 
edges in a largest directed edge cut, and is thus at most n”/4. This can be viewed 
as a digraph analogue of the theorem of Lovdsz noted above. O’Brien (1977) 
obtained a sharper result, a digraph analogue of Conjecture 8.21. 


Theorem 8.24, Let D be a strict digraph on n vertices. Then D admits a decom- 
position into at most n* /4 directed paths. 


Alspach and Pullman (1974) defined the excess of a digraph D by the formula 


ex(D) := 5 max {d*(v) — d-(v), 0}, 


veV 


noting that any decomposition of D into directed paths must have at least this car- 
dinality because at least max {d*(v) — d~ (v),0} of the paths in the decomposition 
have v as their tail. Alspach et al. (1976) conjectured that this bound is tight if D is 
a tournament of even order. N.J. Pullman (see Reid and Wayland 1987) extended 
the conjecture to all regular oriented graphs of odd degree. 


Conjecture 8.25. Let D be a regular oriented graph of odd degree. Then D admits 
a decomposition into ex(D) directed paths. 


Hagegkvist (1985a) found very general results on circuit decompositions of 
almost-complete graphs and complete bipartite graphs. The basis of his approach 
1s the following observation. 


Proposition 8.26. If G is a path or circuit of length n, the composition G[2Kj] is 
decomposable into two circuits of length 2n. 
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Define a list {F,, F),..., Fm} of graphs to be even if each entry occurs an even 
number of times, and to be proper, for a graph H, if 

(1) e(H) = 3772, e(F), and 

(2) H has a subgraph isomorphic to F;, 1 <i < m. 

An immediate consequence of Proposition 8.26 is that, if G has a Hamilton 
decomposition, G[2K,] has a decomposition into any proper even list of even 2- 
factors; in particular, K4,,. — M, where M is a perfect matching, and K,,4, both 
have this latter property, the first being isomorphic to K,41(2K,], the second to 
Kon2n(2K,]. Another (less immediate) consequence is that K4,,.—M admits a 
decomposition into any proper even list of even circuits not including circuits of 
length 4n, and that K,,4, admits a decomposition into any proper even list of even 
circuits not including circuits of lengths 81 — 2, 8n — 4 or 8n — 6. 

To conclude, we note a conjecture on triangle decompositions due to Nash- 
Williams (1970). 


Conjecture 8.27. Let G be a simple even graph of minimum degree 6 on n ver- 
tices and m edges, where 5 > 3n/4. Then G admits a decomposition into triangles 
provided that m =0 (mod3) and 7 is sufficiently large. 


This appears to be a very hard problem. The result of Gustavsson (1991) that 
the assertion holds if 5 > (1 — 10~*4)n is therefore significant. 


8.6. Decompositions into Hamilton circuits 


The decomposition of graphs and digraphs is a topic with a wealth of problems and 
results; indeed, a chapter could easily be devoted to decompositions of the com- 
plete graph alone. In a sense, such questions belong to the realm of combinatorial 
design (see chapter 14) where algebraic techniques are employed as a rule. How- 
ever, graph-theoretical methods are often highly effective, as Proposition 8.26 well 
illustrates. The theory of edge colourings was applied by Hilton (1984) to establish 
the following elegant result on Hamilton decompositions of complete graphs (sub- 
sequently generalized by Nash-Williams (1987) using network flow techniques). 


Theorem 8.28. Let r and n be positive integers, r < 2n, let G and H be complete 
graphs of orders r and 2n +1, respectively, and let {E\, E2,...,E,} be an n-edge 
colouring of G. Then there is a Hamilton decomposition {H,,H,...,H,} of H 
such that E; C H;, 1 <i <n, if and only if the spanning subgraph of G with edge 
set E; is a path system with at most 2n+r—1 components, 1 <i <n. 


That complete graphs of odd order admit Hamilton decompositions is a classical 
result; one such construction, due to a Monsieur Walecki, is described in the book 
by Lucas (1891, pp. 161-164). Hilton (1984) explains how Theorem 8.28 can be 
used to find all such decompositions. An extension of Walecki’s theorem to regular 
graphs of high degree (and at the same time a strengthening of Corollary 4.18 for 
even d) was proposed by Nash-Williams (1971b). 


Conjecture 8.29. Let G be a simple 2d-regular graph on at most 4d + 1 vertices, 
where d > 1. Then G admits a Hamilton decomposition. 
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Hamilton decompositions of cartesian products of graphs are the subject of a 
conjecture of Bermond (1978a), several special cases of which have been settled 
by Stong (1991), and a problem of Alspach and Rosenfeld (1986). 


Conjecture 8.30. If G and H admit Hamilton decompositions, so does GOH. 


Problem 8.31. If G is a simple hamiltonian 3-regular graph, does the prism over 
G, GUK;, admit a Hamilton decomposition? 


In fact, no simple 2-connected 3-regular graph is known whose prism fails to 
admit a Hamilton decomposition. Hamilton decompositions of 4-regular graphs 
were considered also by Thomason (1978), who discovered an analogue of Smith’s 
Theorem (5.14) for such graphs. 


Theorem 8.32. Let G be a 4-regular graph, and let e and f be two edges of G. 
Then the number of Hamilton decompositions of G in which e and f belong to 
different Hamilton circutts is even. 


Berman (1986a) applied methods similar to those of Thomason (1978) to ob- 
tain yet another parity theorem: the number of decompositions of a graph into a 
Hamilton circuit and a spanning tree containing a specified edge is even (see also 
Chrobak and Poljak 1988). A corollary of Theorem 8.32 is that a 2d-regular graph 
with one Hamilton decomposition has at least 34-!(d — 1)! such decompositions 
(Thomason 1978). The question as to which graphs admit Hamilton decomposi- 
tions has received much study. A graph that can be decomposed into d Hamilton 
circuits must be regular of degree 2d and have edge connectivity 2d. Petersen 
(1891) proved that a regular graph of even degree necessarily admits a decom- 
position into 2-factors. The examples of Meredith (1973) show, however, that a 
decomposition into Hamilton circuits is not guaranteed. Kotzig (1957) found an 
elegant necessary and sufficient condition for certain 4-regular graphs to admit 
Hamilton decompositions. 


Theorem 8.33. The line graph of a simple 3-regular graph admits a Hamilton 
decomposition if and only if the graph is 3-edge-colourable. 


Many 4-regular graphs without Hamilton decompositions can be produced us- 
ing this theorem, the smallest being the line graph of the Petersen graph (fig. 8). 
But no 2-connected planar example can be so obtained because, by virtue of the 
Four-Colour Theorem (5.5), every simple 2-connected 3-regular planar graph is 
3-edge-colourable. Such examples can, however, be found with the aid of a neces- 
Sary condition akin to Grinberg’s condition (Theorem 5.6), for a 4-regular planar 
graph to admit a Hamilton decomposition (Bondy and Haggkvist 1981). A partial 
analogue of Theorem 8.33 for 4-regular graphs was found by Jaeger (1983). 


Theorem 8.34. The line graph of a simple 4-regular graph admits a Hamilton 
decomposition if the graph itself admits a Hamilton decomposition. 
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The above theorem is not a true analogue of Theorem 8.33 because the con- 
dition is not both necessary and sufficient. Indeed, the line graph of any simple 
3-connected 4-regular graph has a Hamilton decomposition, by a result of Jack- 
son (1991), whereas some 3-connected 4-regular graphs are not even hamiltonian. 
Jackson (1991) put forward the following general conjecture. 


Conjecture 8.35, The line graph of a simple 2d-regular graph admits a Hamilton 
decomposition if and only if it has edge connectivity 4d — 2. 


Conjecture 8.25 includes a longstanding question of PJ. Kelly (see Moon 1968, 
p. 7, Exercise 9) as to whether every diregular tournament admits a decomposition 
into directed Hamilton circuits: deleting a vertex from any such tournament T on 
2m +1 vertices, one is left with a tournament 7’ on 2m vertices and excess m; 
a decomposition of T’ into m directed paths is necessarily a decomposition into 
Hamilton paths, and these can be extended to yield a decomposition of T into 
directed Hamilton circuits. The corresponding question for bipartite tournaments 
was put forward by Jackson (1981a). 


Conjecture 8.36. Every diregular bipartite tournament admits a decomposition 
into directed Hamilton circuits. 


Hamilton decompositions of graphs and digraphs are surveyed by Alspach et al. 
(1990). 


8.7. Compatible circuit decompositions 


We discuss here a question on circuit decompositions of eulerian graphs which has 
important links to coverings. A transition in a graph G is a pair of adjacent edges. 
If v is a vertex of even degree in G, the edges incident to uv can be partitioned 
into transitions; if v is a vertex of odd degree, all but one of these edges can be so 
partitioned. Specifying such a partition for each vertex v determines a transition 
system for G. Two sets of transitions are compatible if they are disjoint. Transition 
systems were first studied by Kotzig (1968). 

An Euler tour induces a transition system whose elements are the pairs of con- 
secutive edges of the tour. Likewise, a circuit decomposition induces a transition 
system whose elements are the pairs of consecutive edges of circuits of the decom- 
position. The tour and decomposition are compatible if the systems of transitions 
they induce are compatible (except at vertices of degree two). Kotzig (1968) proved 
that to each circuit decomposition of an eulerian graph there corresponds a com- 
patible Euler tour. The converse was conjectured by G. Sabidussi (see Fleischner 
1980). 


Conjecture 8.37. Let G be an eulerian graph and T an Euler tour of G. Then G 
admits a circuit decomposition compatible with T. 


Fleischner (1980) observed that the conjecture is true if d(v) =0 (mod4) for 
each vertex v. Colouring the edges of 7 alternately red and blue induces a de- 
composition of G into two even subgraphs, one red and one blue. Each of these 
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subgraphs can be decomposed into circuits, resulting in a circuit decomposition of 
G compatible with 7. Fleischner (1980) also proved a stronger theorem in the case 
of planar graphs. A transition system J is nonseparating if G \ {e;, €2} is connected 
for each transition {e,,e.} € J. 


Theorem 8.38. Let G be a planar eulerian graph, and let FJ be a nonseparating 
transition system of G. Then G admits a circuit decomposition compatible with J. 


This readily implies Theorem 8.19 in the case of 4-regular graphs, as Fleischner 
and Frank (1990) remark. As before, one colours the edges of an Euler tour 
alternately red and blue. Since the number of edges is even, each vertex is then 
incident with two edges of each colour. The pairs of edges of the same colour define 
a nonseparating transition system 7 of G. Any circuit decomposition compatible 
with J consists of circuits whose edges alternate in colour, and which are therefore 
even. 


Remark 8.39. More generally, any edge colouring of a graph induces a set of 
transitions, namely the pairs of adjacent edges of the same colour; a circuit is then 
compatible with the edge colouring if no two consecutive edges of the circuit have 
the same colour. Such compatibility questions have been considered by a number of 
authors (see, for example, Grossman and Haggkvist 1983). In particular, Fleischner 
and Frank (1990) generalized Theorem 8.38 to sets of transitions induced by edge 
colourings in this way. An alternative set of transitions determined by an edge 
colouring is the set of all pairs of adjacent edges of different colour; in this case, 
a compatible circuit is one that is monochromatic. Compatibility questions of this 
latter type are touched upon in Remark 6.6. 


A generalization of Dirac’s Theorem (2.1) involving the notion of compatibility 
was proposed by R. Haggkvist (see Hellgren 1988, p. 15). 


Conjecture 8.40. Let G be a simple graph of minimum degree 6 on n vertices, 
where 6 > n/2 and n is sufficiently large, and let J be a transition system for G. 
Then G contains a Hamilton circuit compatible with 7. 


The theory of eulerian graphs is extensive, and we refer the reader to the treatise 
by Fleischner (1990) for further details. 


8.8. Double covers 


By Theorem 1.4, every even graph admits a circuit decomposition, but no graph 
with a vertex of odd degree admits such a decomposition, however. In order to 
formulate an analogue of Theorem 1.4 valid for all graphs, one is led to the concept 
of a circuit double cover, that is, a collection € of circuits such that each edge of 
the graph belongs to exactly two members of €. 

A necessary condition for a connected graph to admit a circuit double cover is 
that it be 2-edge-connected, because each edge must lie in some circuit. Szekeres 
(1973) and Seymour (1979), motivated by quite different considerations, conjec- 
tured that this condition is also sufficient. 
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Conjecture 8.41 (The Circuit Double Cover Conjecture). Every 2-edge-connected 
graph admits a circuit double cover. 


According to Tutte (personal communication 1992), the Circuit Double Cover 
Conjecture was already known in the 1950s. It has been studied extensively. A 
recommended reference is the survey article by Jaeger (1985); additional material 
can be found in Goddyn (1988), including a proof that the girth of a minimal 
counterexample is at least ten. The conjecture has close links to several other 
basic questions. It is implied by the combination of Conjectures 8.37 and 6.12 
(Fleischner 1984). Perhaps more surprisingly, it is also implied by Conjecture 8.13 
(Jamshy and Tarsi 1992). Moreover, to each 2-connected loopless graph can be 
associated an eulerian graph that has an even circuit decomposition if and only if 
the original graph has a circuit double cover (Jaeger 1985). 

Several refinements of the Circuit Double Cover Conjecture have been formu- 
lated, of which we mention four. First, a conjecture on embeddings, due to Haggard 
(1977). An embedding of a graph G in a surface & is a function ¢ that assigns to 
each vertex a point of 2 and to each edge a simple curve in 2, in such a way that 
curves (e,) and ¢(e) meet at point ¢(v) of & if and only if edges e, and e2 are 
incident with vertex v in G. The connected components of 2 \ 6(G) are the faces 
of G. 


Conjecture 8.42. Every 2-edge-connected graph can be embedded in some sur- 
face in such a way that each face is bounded by a circuit. 


Conjecture 8.42 implies the Circuit Double Cover Conjecture because the cir- 
cuits bounding the faces in such an embedding constitute a circuit double cover. 
A still stronger conjecture is that the surface can be chosen to be orientable and 
the embedded graph 5-face-colourable (see Jaeger 1985). 

Next, a conjecture of Seymour (1979) on edge-weighted graphs. A collection @ 
of circuits in a graph G induces a weighting w of G, the weight w(e) being the 
number of circuits of @ that include edge e. Seymour asked which edge-weighted 
graphs arise in this way, that is, which edge-weighted graphs are sums of circuits. 
One obvious necessary condition is that: 

(i) for each edge e, w(e) is a nonnegative integer. 

There are also two simple necessary conditions involving edge cuts, both conse- 
quences of the fact that a circuit meets an edge cut in an even number of edges: 

(il) for each edge cut S, }°,<5 w(e) is even; 

(iii) for each edge cut S and each edge f € S, w(f) < Dees W(e)- 

Seymour (1979) proved that conditions (i)-(iii) are sufficient for an edge- 
weighted planar graph to be a sum of circuits. They are not sufficient in general: a 
Petersen graph (fig. 8) in which the edges of a perfect matching have weight two 
and the remaining edges have weight one is not a sum of circuits. Alspach et al. 
(1994) extended Seymour’s result by proving that this configuration is present in 
any edge-weighted graph that is not a sum of circuits. 


Theorem 8.43. Let G be an edge-weighted graph whose weight function w satisfies 
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conditions (i)-(iii) above. Then either G is a weighted sum of circuits or there is a 
weighting w' of G such that 

(a) w'(e) < w(e) for each edge e of G, 

(b) w’(e) € {0,1,2} for each edge e of G, and 
i (c) the support of w' can be obtained from the edge-weighted Petersen graph 
described above by recursively applying the following two operations: subdividing 
an edge of weight one or two, and replacing an edge of weight two by two multiple 
edges of weight one. 


Seymour (1979) conjectured that conditions (i)—(iii) are sufficient, however, if 
all weights are even (in which case, condition (ii) is automatically satisfied). 


Conjecture 8.44. An edge-weighted graph with nonnegative even weights is a 
sum of circuits if and only if, for each edge cut S and each edge f € S, w(f) < 


Dees W(e). 


Thirdly, an analogue of Conjecture 8.20 for double covers, proposed by Bondy 
(1990). 


Conjecture 8.45. Every simple 2-edge-connected graph on n vertices admits a 
circuit double cover @ such that |€| <7 —1. 


Two implications of this conjecture have been established. One, concerning path 
double covers, was obtained by Li (1990). A path double cover of a graph G is a 
collection P of paths such that each edge of G belongs to exactly two members 
of #. 


Theorem 8.46. £very simple graph G admits a path double cover P such that each 
vertex of G occurs exactly twice as an end of a path of #. 


The other is an implication for edge-weighted graphs, proved by Bondy and Fan 
(1991). 


Theorem 8.47. Let G be a simple 2-edge-connected edge-weighted graph on n 
vertices. Then G contains a circuit of weight at least 2w(G)/(n — 1). 


Fourthly, a conjecture due to Celmins (1984) and Preissmann (1981) on the 
minimum number of even subgraphs, rather than circuits, in a double cover. 


Conjecture 8.48. Every 2-edge-connected graph admits a double cover by five 
even subgraphs. 


Jaeger (1976) and Kilpatrick (1975) obtained a theorem on coverings by even 


subgraphs that is pertinent to Conjecture 8.48 and has other interesting implica- 
tions. 


a 8.49. (i) Every 4-edge-connected graph is the union of two even sub- 
graphs. 


(ii) Every 2-edge-connected graph is the union of three even subgraphs. 
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Theorem 8.49(i) implies that every 4-edge-connected graph admits a double 
cover by three even subgraphs: such a graph is the union of two even subgraphs, 
and these subgraphs together with their symmetric difference constitute a double 
cover of the graph. Bermond et al. (1983) noted that Theorem 8.49(ii) likewise 
implies that every 2-edge-connected graph admits a quadruple cover by seven 
even subgraphs. This is not thought to be best possible, however. Fulkerson (1971) 
conjectured that every 2-edge-connected 3-regular graph admits a double cover by 
six perfect matchings, and Jaeger (1985) observed that Fulkerson’s conjecture can 
be reformulated in terms of even subgraphs as follows. 


Conjecture 8.50. Every 2-edge-connected graph admits a quadruple cover by six 
even subgraphs. 


Fan (1992) proved that every 2-edge-connected graph has a sextuple cover by 
ten even subgraphs; for 3-regular non-3-edge-colourable graphs, this number is 
least possible. Many results on circuit covers, such as this one, are proved by the 
methods of integer flows. This topic is discussed in the appendix to chapter 4 by 
Seymour; another excellent source is the survey by Jaeger (1988). 

We close by noting an attractive result of Chen (1971) and T. Gallai (see Lovasz 
1979, p. 254) akin to the two assertions of Theorem 8.49. An inductive proof due to 
L. Pésa is described in Lovdsz (1979); for a very short proof, see Woodall (1990). 


Theorem 851. Every graph can be decomposed into an even subgraph and an 
edge cut. 
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1. Introduction, preliminaries 


Intuitively, a graph is felt to be connected if there is no way to separate it into two 
parts with no connection between the two parts. Or, equivalently, for any two 
nodes of the graph there is a path connecting them. 

If one is interested in various properties of graphs, it is often useful to 
dismantle the graph into connected components and then investigate those 
components separately. For example, to decide whether a graph is k-colorable it 
suffices to deal with the connected components. A similar idea works for higher 
connectivity, as well. Namely, we separate first the graph into highly connected 
parts, then establish properties of these parts, and finally, using these properties 
we try to obtain information about the whole graph. For example, to prove 
Kuratowski’s theorem on plane representation of graphs one can assume first that 
the graph is 3-connected since otherwise the graph can be decomposed along a 
2-separation and the planar representations of the smaller parts can be pieced 
together. Second, by exploiting stronger properties of 3-connected graphs one can 
more easily deduce Kuratowski’s theorem for 3-connected graphs. (See Thomas- 
sen 1980b.) (We remark that the same program proved extremely useful for 
matroids, as well.) 

But what does “higher connectivity” mean? Intuitively, one may have at least 
two possible definitions for k-connectivity. First, the graph is not only connected 
but remains so after deleting any set of at most k — 1 nodes. Second, a graph is 
k-connected if there are & (openly) disjoint paths between any two of its nodes. 
Fortunately, by a theorem of Whitney, these seemingly different concepts 
coincide. Whitney’s (1932) theorem is an easy consequence of what can be 
considered the fundamental result of this whole chapter: Menger’s theorem. It 
states that either there are k openly disjoint paths between two specified nodes or 
there is a set of less than & elements separating these two nodes. 

Another source of the theory to be surveyed here is network flows. Its basic 
result, the max-flow min-cut theorem, can be considered as a capacitated (and 
directed) counterpart of Menger’s theorem. Network flow theory is a systematic 
treatment of combinatorial optimization problems. It found a great number of 
applications in practice as well as in other branches of mathematics. 

In this chapter we try to provide a rather comprehensive overview of results 
belonging to this area. Certain (mostly easier) parts are discussed in greater detail 
in order to give some hints on the general techniques used. Other parts are more 
difficult so we confine ourselves to give a general framework filled with various 
results but no proofs. In some cases, however, when it did not need too much 
Space, proofs of some deep theorems (e.g., Tutte’s wheel theorem or Nash- 
Williams’ theorem on covering trees) have been included. 

Throughout the chapter we use the following notation. Let V be a finite set and 
5, ¢elements of V. A subset X of V is called an sf-set if s© X CV — t. We often do 
not distinguish between a one-element set and its single element. Let D = (V, A) 
be a digraph. The elements of A are called directed edges or arcs. For SCV let 
A*(S ) denote the set of arcs with tail in § and head in V—S. We use the notation 
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A’ (S):=A*(V—S), 5° (S) =:|47(S)|, 6°(S):=6*(V—S). For a graph or di- 
graph G and vector x:A—>R, let d,(X, Y) denote the sum } (x(a): a € A, one 
end of a is in X — Y, the other end is in Y ~~ X). When x = 1 we use d(X, Y) for 
d,(X, Y). 

If it is not ambiguous we will use the notation e = uv for an edge with endpoints 
u and v. Similarly e = (u, v) stands for an arc with tail u and head v. An edge uv 
or an arc (u, v) is said to leave (enter) S if S is a uv-set (vui-set). 

To complete this introductory section let us draw attention to some other 
survey papers and books. Concerning connectivity two books of Tutte (1966, 
1984) deserve special mentioning. Mader’s survey paper (1979) includes a long list 
of connectivity results and their relationship. Bollobas’ (1978) book is also an 
excellent reference that includes the proof of many difficult theorems. 

As far as network flows are concerned the classical book of Ford and Fulkerson 
(1962) is even today a refreshing reading. Recently, Ahuja et al. (1993) provided 
a comprehensive book on network flow techniques. Another useful survey on 
network flow theory, given by Goldberg et al. (1990), appeared in a book entitled 
“Paths, Flows, and VLSI-Layout” (B. Korte et al., eds., Springer, 1990). This 
book includes other important surveys concerning connectivity results. One of 
them, due to A. Schrijver, is concerned with the homotopic paths packing 
problem. The survey of N. Robertson and P.D. Seymour outlines the authors’ 
very complex disjoint paths method. A third survey paper from the same book, 
written by the present author, provides an overview on packing paths, circuits, 
and cuts. 


2. Reachability 


2.1. Paths and walks 


In a graph by a walk W we mean an_ alternating sequence 
(Up, 1, U1, €2,-+--+,€,,U,) Consisting of nodes and edges where e; is an edge 
between v,_, to u;. The nodes uv, and v, are called the endpoints of W. Sometimes 
we say that W is a walk between v, and v, or that W is a walk from v, to v, (or 
from vu, to vy). In a digraph by a (directed) walk W we mean an alternating 
sequence (Uy, €,,U,,€2,.--,&,,U,) where e, is an arc from u,_, to v;. We say that 
W is a walk from v, to v, or that v, is reachable from vy by W. 

The following definitions concern both graphs and digraphs. The number & of 
edges of a walk is called the length of the walk. The distance of t from s is the 
minimum length of a path from s to fr. Obviously, if W, is a walk (directed or 
undirected) from u to v and W, is a walk from v to w, then the concatenation 
W=W,W, is a walk from u to w. 

We say ‘that a walk is simple if all its defining terms are distinct. A simple walk 
is called a path. A walk is called closed if its endpoints coincide. If otherwise the 
terms of a closed walk are distinct, it is called a circuit. 

Let W = (up, €;,..- , €,,U,) be a walk. Suppose that v, = u, for some i, j,O<i< 
j <k, and the subsequence C = (u,, e;,,,..- ,€;-1, ¥,) is a Circuit. Reducing W by 
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circuit C means that we define a new walk W’ := (Up, €,, ~~) Uj ja + + > gs Ug): 
Simplifying W means that one reduces W as long as possible. The final walk is a 
path from uy to v, (that may depend on the order of reductions). Thus we have 
the following. 


Proposition 2.1. (a) /n @ graph if there is a walk between two nodes u and v, there 
is a path between u and v. 

(b) in a digraph if there is a directed walk from u to v, there is a directed path 
from u to v. 


Let us call two nodes of a graph G=(V,£) equivalent if there is a path 
connecting them. This is an equivalence relation: from the definition of path it is 
symmetric and reflexive, by Proposition 2.1 it is transitive. An equivalence class is 
called a component of G. If G has exactly one equivalence class, G is called 
connected. Equivalently, G is connected if there is a path between any two of its 
nodes. 

We call two nodes u and v of a digraph D =(V, A) equivalent if there is a 
directed path from u to v and one from v to u. This is again an equivalence 
relation. An equivalence class is called a strong component. If D has exactly one 
equivalence class, D is called strongly connected. Equivalently, a digraph D is 
strongly connected if there is a directed path from every node to any other. 


Proposition 2.2. Let s and t be two specified nodes of a digraph D = (V, A). There 
is a directed path from s to t if and only if for all st-sets S there is an edge leaving S. 


Proof. The necessity is straightforward. The sufficiency follows by observing that, 
if there is no path from s to ¢, the set S of nodes reachable from s has no leaving 
edges. O 


Let us introduce some further notions. An undirected graph is called a tree if it 
is connected but deleting any of its edges disconnects the graph. 


Proposition 2.3. For a graph G = (V, E) the following are equivalent: 

(a) G is a tree. 

(b) G is a connected graph containing no circuit. 

(c) In G there is a unique path between any pair of nodes. 

(d) G is connected and \E\=|V|—1. 

(e) G can be built up from any of its nodes by consecutively adjoining edges so 
that one end of the currently added edge belongs to the graph having already been 
constructed while the other endpoint does not. 


A graph is called a forest if each of its components is a tree. A digraph 
D = (V, A) is called an arborescence if D arises from a tree by orienting the edges 
in such a way that every node but one has one entering arc. The exceptional 
node, called the root, has no entering arc. The union of node-disjoint arboresc- 
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ences is called a branching. (Equivalently, a branching is a directed forest such 
that the in-degree of each node is at most one.) 


Proposition 2.4. For a digraph D the following are equivalent: 

(a) D is an arborescence. 

(b) D contains a node r such that every node can be reached from r by a unique 
path, 

(c) D contains a node r such that every node can be reached from r and deleting 
any edge yields a node that is not reachable from r. 

(d) D can be built from a node r by adjoining sequentially arcs so that the tail of 
the currently added new arc belongs to the digraph having already been constructed 
while the head is a new node. 


Let D =(V, A) be a digraph with a specified node s. We describe now a simple 
device, the labeling technique, to determine the set S of nodes reachable from s 
along with a sub-arborescence of D rooted at s that spans S. 

We use a label called R-label for every node v showing if uv has already been 
reached or not. If not, the label has entry “NON-REACHED”. If vu is reached its 
R-label says REACHED and contains the arc (u, v) € A along which v has been 
reached. The only exception is the source node s: the entry of its label is always 
REACHED. At the beginning every node but s has NON-REACHED in its 
R-label. 

We also use another label called S-label for every node to indicate whether v is 
SCANNED or UNSCANNED. 

At the beginning of the algorithm for every node the entry of every S-label is 
UNSCANNED. In a general step we pick up an unscanned node uw that has 
already been reached (at the beginning only the source is such) and decide if there 
is a non-reached node v such that (u, v) is an arc of D. If there is none, declare u 
SCANNED and repeat. Otherwise declare vy REACHED and put (u, v) inta, its 
R-label and repeat. 

The algorithm terminates if there is no more unscanned node which is reached. 


Proposition 2.5. The set S of nodes that have REACHED in their R-iabel has no 
leaving arcs and consists precisely of nodes reachable from s. The set of arcs 
occurring in their R-labels forms an arborescence rooted at s with node set S. 


Note that the procedure can be applied to undirected graphs as well. In the 
algorithm there is much freedom in choosing a reached and unscanned node. One 
possible strategy is to choose each time an unscanned node u which has been 
reached earliest. In this case the procedure is called breadth first search (BFS). 

An application of BFS is to compute the distance of the nodes in S from s. The 
only modification in the above algorithm is that we need a third variable dist(v) at 
every node v to store the distance of v from s. At the beginning this is 0 at s and 
© at all other nodes. When a node v is reached from u we define dist(v) to be 
dist(u) + 1. 
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Another natural strategy is to choose each time an unscanned node that has 
been reached latest. In this case the procedure is called depth first search (DFS). 
Depth first search has a great number of important applications and we will 
mention three of them in the next section. 

' Finally, we mention a third kind of search, the so-called maximum cardinality 
search. Here each time an unscanned node is chosen that has a maximum number 
of already reached neighbours. This search was introduced by Tarjan and 
Yannakakis (1984) in order to find a simplicial ordering of the nodes of a chordal 
graph. Nagamochi and Ibaraki (1992) showed how. maximum cardinality search 
can be used to find a sparse k-connected subgraph of a k-connected graph. 

We remark that it is not difficult to implement these search procedures so as to 
run in linear-time. (For details, see Tarjan 1983.) 


2.2. 2-Connectivity and strong connectivity 


Given a graph or digraph G = (V, £), a node v EV is called a cut node if E can be 
partitioned into two non-empty subsets £, and E, such that V(E,) and V(E,) have 
just the node v in common. (For F C E, V(F) denotes the set of nodes incident to 
at least one element of F.) In particular, a node incident to a loop and to another 
edge is a cut node. If G is loopless, then v is a cut node if and only if its deletion 
increases the number of components: c(G — v) > c(G). 

A connected graph is called a block if it has no cut node. A graph is called 
2-connected if it is a block and has at least three nodes. 


Proposition 2.6. For a loopless graph G=(V,E) with |V|23 the following are 
equivalent: 

(a) G is 2-connected. 

(b) For any two nodes there is a circuit containing them. 

(c) For any two sets A, BCV with |A|, |B|=2 there are two disjoint paths 
connecting nodes of A and B. 

(d) Any pair of edges is contained in a circuit. 

(e) G can be built up from a circuit by sequentially adjoining edges (loops are 
not allowed) and subdividing edges (in any order). 


The following is a useful reduction property of 2-connected graphs. 


Proposition 2.7. For every edge e of a 2-connected graph with at least four nodes 
either the deletion or the contraction of e results in a 2-connected graph. 


(Contracting an edge uv means that we identify u and v into a new node z and 
for each edge uw or vw of G we introduce an edge zw. Deleting an edge e = uv 
means that we leave out e from E.) 

_ Astrongly connected digraph with at least three nodes is called a strong block if 
it has no cut node. 
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Proposition 2.8. For a digraph D =(V, A) with |V| 23 the following are equiva- 
lent: 

(a) D is a strong block. 

(b) D can be built up from a directed circuit by sequentially adding arcs io 
loops aliowed) and subdividing arcs. 


Subdividing an arc (u,v) means that we replace (u,v) by a path P from u tov 
where the inner nodes of P are new nodes of the graph. 

(Note that it is not true that every pair of nodes of a strong block lies on a 
directed circuit.) 

Let G =(V, E) be a connected but not necessarily 2-connected graph. A block 
of graph G is a subgraph that is a block and is maximal with respect to this 
property. The blocks of G form a tree-like structure in the following sense. Let 
B,, B,,..., B, be the blocks of G. Form a bipartite graph T= (V, B; F) where 
the elements of B= {b,,b,,...,6,} correspond to the blocks of G. In T let 
nodes uv; and b, be connected by an edge if v, € B,. 


Proposition 2.9. The blocks of a graph G=(V,E) partition the set E of edges. 
Two edges belong to the same block if and only if there is a circuit containing both. 
Any two blocks have at most one node in common and the nodes belonging to 
more than one block are cut nodes. The graph T = (V, B; F) is a tree. 


We call an edge e of a graph G = (V, E) a cut edge or an isthmus if G — e has 
more components than G. A connected graph is called 2-edge connected if it 
contains no cut edges. 


Proposition 2.10. For a connected graph G = (V, E) the following are equivalent: 
(a) G is 2-edge-connected. 
(b) For any pair of nodes there are two edge-disjoint paths connecting them. 
(c) Any edge is contained in a circuit. 
(d) G can be built up from a node by sequentially adjoining edges (loops are 
allowed) and subdividing edges. 


Property (d) is sometimes formulated in another way. By an ear-decomposition 
of G we mean a sequence G,, G,,...,G,=G of subgraphs of G where Gy 
consists of one node and no edge, and each G, arises from G,_, by adding a path 
P, for which the two (not necessarily distinct) end-nodes belong to G,_, while the 
inner nodes of P, do not. The paths P, are called ears. (P; may consist of a single 
edge). Now (d) is equivalent to saying that a graph is 2-edge-connected if and 
only if it has an ear-decomposition. There are several other ear-decomposition 
theorems. One is mentioned in the next proposition. Another asserts that a graph 
is 2-connected if and only if there is an ear-decomposition using only open ears. 
Yet another (due to L. Lovasz) says that a graph is factor-critical if and only if 
there is an ear-decomposition using only ears of odd length. (See the chapter on 
matchings by W. Pulleyblank.) 
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Proposition 2.11. Let D=(V, A) be a digraph whose underlying graph is con- 
nected. The following are equivalent: 
(a) D is strongly connected. 
_(b) There is at least one arc leaving each set X CV(D), X #9, that is, there is no 
directed cut. 
(c) Every arc is in a directed circuit. 
(d) D can be built up from a node by sequentially adding arcs (loops are 
allowed) and subdividing arcs. 


Let D=(V,A) be a digraph whose underlying graph is connected. Let 
C,,C,,...,C, be maximal strongly connected subgraphs of D. These subgraphs 
are called the strong components of G. The name is justified by the following 
proposition. 


Proposition 2.12. The node sets V(C,) (i=1,...,k) form a partition of V. By 
contracting each C, into a node one obtains an acyclic digraph. 


Using depth first search, both the blocks of an undirected graph and the strong 
components of a directed graph can be found in linear time (Tarjan 1972). 

Propositions 2.10 and 2.11 indicate that the analogous concepts for graphs and 
digraphs are 2-edge-connectivity and strong connectivity. Actually, parts (d) of 
these propositions immediately imply a theorem of Robbins (1939). 


Corollary 2.13. A graph G has a strongly connected orientation if and only G is 
2-edge-connected. 


Here we provide another proof of this result that gives rise to a linear time 
algorithm (Tarjan 1972). Let s be an arbitrary node of G. Let T be a spanning 
tree determined by depth first search. Define an arborescence F by orienting the 
edges of T away from s. 


Claim. The unique path in T connecting the endpoints of any edge e = uv CE - T 
is a directed path P, in F. 


_The orientation of G obtained by orienting each edge e € E — T so as to form a 
directed circuit with P, is strongly connected. 


, The following slight extension of Corollary 2.13 also holds (Boesch and Tindell 
980). 


Theorem 2.13a. The undirected edges of a mixed graph G (i.e., a graph having 
directed and undirected edges) can be oriented in such a way that the resulting 


digraph is strongly connected if and only if there is no cut edge in G and there is no 
directed cut. 


Corollary 2.13 naturally gives rise to the following question. Given a digraph 


120 A. Frank 


D=(V, A), what is the minimum number of arcs the reversal of which makes D 
strongly connected? We must require that the underlying graph of D is 2-edge- 
connected. Obviously, the required subset of arcs meets all the directed cuts. 
Conversely, from Theorem 2.13a one can derive the following. 


Proposition 2.14. If F is a minimal set of edges covering all the directed cuts, the 
reversal of the elements of F leaves a strongly connected digraph. 


Therefore the following deep theorem of Lucchesi and Younger (1978) answers 
the question above. (For a relatively simple proof, see Lovasz 1976b, for a 
constructive proof yielding a polynomial-time algorithm, see Frank 1981.) 


Theorem 2.15 (Lucchesi and Younger 1978). Let D=(V,A) be an arbitrary 
digraph. The minimum number of arcs covering all the directed cuts is equal to the 
maximum number of pairwise disjoint directed cuts. 


3. Directed walks and paths of minimum cost 


3.1. Walks and paths 


Throughout this section we use the terms walk, path, circuit to mean diwalk, 
dipath, dicircuit, respectively. The /ength of a path is the number of edges of the 
path. Let D =(V, A) be a loop-free digraph on ” nodes and s and ¢ two specified 
nodes called source and sink, respectively. Given a cost function c: A— R, find a 
path from s to t of minimum cost. Here the cost c(P) of a path P is the sum of the 
cost of its arcs. A path is called a min-cost path if it has minimum cost among the 
paths from its origin to its terminus. 

In this section we survey this problem and its variants. (For more detailed 
analysis, see Lawler 1976 and Tarjan 1983.) It turns out that computing ore 
min-cost path from s to ¢ is not simpler than computing a min-cost path from s to 
every other node reachable from s. So we focus mainly on that problem. We are 
going to describe several algorithms but the emphasis will be put on ideas and we 
do not seek for finding the most efficient procedures. The complexity of an 
algorithm heavily depends on the representation of the problem and the data 
structure used. 

Let w,(v) denote the minimum cost of a walk of length at most & from s to v. If 
there is no such a walk, let w,(v):=%. We will assume that each node of D is 
reachable from s. The following recursion is straightforward. 


Proposition 3.1. w,,,(v) = min(w,(v), min(w,(u) + c(u, v): (u,v) E A)) for vEV. 


(The minimum taken over the empty set is defined to be ~.) 

Relying on this proposition one can easily design an O(k|A|) algorithm to 
compute w,(v) for vu EV as well as a walk W,(v) from s to v of length at most & 
with cost w,(v). 
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To outline this let us assume for convenience that for every node vEV—s 
there is an arc (s,v). If this is not the case, add a new arc e=(s,v) with 
c(s,v)=%. At the beginning let w,(v):=c(s,v) for vEV—s and w,(s)=0. 
Fusthermore, let W,(v) = {s, (s,v),v)} for vEV—s and W,(s):={s}. In the 
(k + 1)th phase of the algorithm w,,,(v) and W,,,(v) is computed with the td 
of the formula in the proposition. 

What about minimum cost paths? If the cost function c is arbitrary, there is no 
hope for a good algorithm since the problem of finding a longest path from s to ¢ 
is NP-complete and that problem can be formulated as a minimum cost path 
problem by choosing c(e) = —1 for every arc e. 

Therefore it would be natural to suppose that c > 0. However, not everything is 
lost if we do not require that much. The minimum cost path problem is tractable 
under the weaker assumption that there are no circuits of negative total cost (or, 
shortly, negative circuit). In particular, in acyclic digraphs the min-cost paths can 
be computed in polynomial time for arbitrary cost functions. 

We call a cost-function c conservative if there is no directed circuit with negative 
total weight. A function 7: V—R is called a feasible potential (subject to c) if 
a(v) — m(u) <c(u, v) for every arc (u,v) EA. 


Theorem 3.2. Given a digraph D=(V, A) and a cost-function c:A—>R, c is 
conservative if and only if there is a feasible potential. The potential can be chosen 
integer-valued if c is integer-valued. 


Proof. Suppose oS that am is a_ feasible potential and C= 
(vy, E,,0,,€2,...,€,,U, =Uy) iS an arbitrary circuit. Then we have c(C)= 
Bee) 23 [we,)—a,))=0 

Conversely, suppose that there is no negative circuit. We can assume that there 
is a node s from which every other node is reachable. For otherwise adjoin a new 
node s to D and an arc (s, v) for each v EV. Define the cost of the new arcs to be 
0. Since there is no circuit containing the new node, the extended digraph 
contains no negative circuit. 

We claim that 7(v):= w,(v) (where n =|V{) is a feasible potential. Indeed, if 
there is no negative circuit, then w,(v) can be realized by a path P, which has at 
most n— 1 arcs. Then (u) + c(u, v) =c(P,)+ cu, v)=a(v). O 


Let p,(v) denote the minimum cost of a path of length at most k from s to v. 
Obviously, p,(v) is the minimum cost of a path from s to v. The preceding proof 
also shows the following. 


Proposition 3.3. [f none of the walks W,(v) (v EV) induces a negative circuit, then 
there is no negative circuit in D, i.e., c is conservative. 


Since for a node v EV simplifying W,(v) can easily be carried out (in linear 
time) we have obtained an algorithm, due to Bellman (1958) and Ford (1956), 
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that either finds a negative circuit or computes a min-cost path from s to every 
other node v. The complexity of the algorithm is O(|A||V|). 

In section 5 we will make use of the following problem, also interesting for its 
own sake. Suppose that there are negative circuits in a digraph D = (V, A) and we 
want to eliminate all of the negative circuits by increasing the cost of every edge 
by the same value e so that ¢ is as small as possible. It is an easy exercise to see 
that « has the following interpretation: —« is the minimum circuit mean. (The 
mean of a circuit C is c(C)/|C].) 

To compute ¢ revise the algorithm mentioned after Proposition 3.1 as follows. 
Whenever a negative circuit C is detected by the algorithm, compute the current 
mean cost e’ of C and update the cost function by increasing the cost of each arc 
by |e’|. Obviously C becomes a circuit of zero cost. The algorithm halts at the nth 
stage when there is no more circuit of negative (current) cost. One can see that 
the minimal « is the sum of increments of costs and that a circuit C that became of 
zero cost last has the minimum mean cost. (See Karp 1978.) 

Next we list some basic properties of minimum cost paths. For convenience, let 
us assume that every node of D can be reached from a specified node s. 


Proposition 3.4. Suppose that c is conservative. If P=(S=v,,€,,...,8;, 
Uj,+++yj,U;,---+5¢) is @ min-cost path from s to t, then a subpath R= 
(U,;,..-,€;,0;) is @ min-cost path from v, to v,. (Note that the corresponding 
statement for undirected graphs does not hold.) 


Proof. Let R’ be a path from vu; to uv, for which c(R’) <c(R). Construct a walk W 
by replacing the segment R in P by R' and let P’ be a path obtained by 
simplifying W. Since there is no negative circuit we have c(P')<c(W)<c(P). DO 


Theorem 3.5. if c is conservative, the minimum cost of a path from s to t is equal 
to max(z(t) — a(s): a @ feasible potential). 


Proof. Let P=(s=v,,...,u,=¢) be a path from s to t and w a feasible 
potential. Then (*) c(P)=% (c(v,_,,¥,:2=1,...,4) 2 (av,) - au,-,):i = 
1,...,k)= a(t) — w(s) and so max < min. 


To see the other direction let p,(v) denote the minimum cost of a path from s 
to v. We have seen that p, is a feasible potential. Since p,(t) — p,(s) = p, (t) = c(P) 
we have equality in (*). O 


For a feasible potential 7 we say an arc (u,v) to be tight if mv) — m(u) = 
c(u,v). By Theorem 3.5 a path P is a min-cost path if and only if there is a 
feasible potential zr such that P consists of arcs which are tight with respect to 7. 

A min-cost-path s-arborescence F is an arborescence of D rooted at s for which 
the unique path in F from s to any other node v is a minimum cost path in D. 


Proposition 3.6. if there is no negative circuit, there is a spanning min-cost-path 
5-arborescence. 
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Proof. Let L be the union of arcs belonging to any min-cost path starting from s. 
The arcs in L are tight with respect to p, and L contains a spanning s- 
arborescence. O 

Proposition 3.7. A spanning s-arborescence F is a min-cost-path s-arborescence if 
and only if c,(v) — c,(u) <c(u, v) for every (u, v) € A (c,(v) denotes the cost of the 
unique path in F from s to v). 


Above we outlined a method of complexity O(|A||V|) to decide if a cost 
function c is conservative and, if so, to compute a min-cost-path s-arborescence. 
It is not known whether there is a method of complexity O(|V|”). There are, 
however, two special cases when such an algorithm exists. 


3.2. Non-negative costs and acyclic digraphs 


Assume the cost function c is non-negative. We briefly summarize Dijkstra’s 
(1959) method. The basic observation is the following. 


Lemma 3.8. If T is a min-cost-path s-arborescence (not necessarily spanning) and 
m, := min(p,(u) + c(u, v): (u,v) € A“ (V(T)) is attained at an arc a = (u,,U,), then 
T +a is a min-cost-path s-arborescence. 


Proof. Let P, denote the path obtained from the path in T from s to u, by adding 
a. Let P be any path from s to v, and e = u,v, the first arc on P that leaves V(7T). 
Since c 2 0, c(P’) <c(P) where P’ is the subpath of P from s to v,. By the choice 
of a, c(P,)<c(P’) so P, is a min-cost path. O 


Dijkstra’s method 3.9 consists of n — 1 phases. Starting at s we build up, arc by 
arc, a min-cost-path s-arborescence T. In order to compute m, we maintain a 
label [(v) = min(p,(u) + c(u, v): (u, v) € A* (V,)) for v €EV—V(T). This label tells 
us which arc a=(u,,v,) has to be added to the current JT. When an arc 
a = (u,,v,) has been added to T, label /(v) is updated by /(v) := min(((v), lv,) + 
c(v,, v)). Thus updating all /(v) needs O(n) time and the overall complexity of the 
algorithm is O(n’). 


As an application, let us return for a moment to the general case when c may 
not be non-negative but there is no negative circuit. Dijkstra’s algorithm can be 
used to show that there is an O(n’) algorithm for computing a min-cost-path 
s-arborescence provided that a feasible potential a is available. Indeed, define 
c’(u,v):= c(u, v) — mv) + au). 


Claim 3.10, A path P from s to t is a min-cost path with respect to c if and only if 
P is a min-cost path with respect to c’. 


Proof. For any path R from s to ¢ the difference c(R) — c'(R) = 7(t) — 7(s) does 
not depend on R whence the claim follows. O 
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Therefore one can apply Dijkstra’s algorithm to c’. 

Another important special case when an O(n’) algorithm is available is the one 
of acyclic digraphs. We can suppose again that every node is reachable from s. 
The following slight modification of Dijkstra’s algorithm works. 


Lemma 3.11. Let T be a min-cost-path s-arborescence and v GV—V(T) such that 
A (v) CA*(V(T)). If mv) := min(p,(u) + e(u, v): (u,v) EA*(V(T)) is attained 
at an arc e, then T + e is a min-cost-path s-arborescence. 


The proof is straightforward. To implement the algorithm one has to find an 
ordering v, =5,v2,...,U, of the nodes such that (v,, v,) can be an arc only if i<j 
and then build the s-arborescence along this ordering. (Using depth first search 
such an ordering can easily be found in O(|E|) time.) 

This algorithm enables us to find a maximum path in an acyclic digraph, in 
particular, a maximum weight chain in a weighted poset. 


3.3. Shortest paths in undirected graphs 


Finally, we are interested in finding a min-cost path between two nodes s and ¢ of 
an undirected graph G=(V,E). If the cost function ¢ is non-negative, the 
min-cost path problem can easily be reduced to the directed case by replacing 
each edge by a pair of oppositely directed arcs. This reduction, however, does not 
work in the general case, even if there is no negative circuit, since then we would 
introduce a negative (2-element) circuit. To overcome this difficulty one has to 
invoke matching theory, in particular, the theory of 7-joins. 

Let E := {eE E: w(e) <0}. Let 7, := {vu EV: an odd number of edges from 
E~ is incident to v} and T:=({s,t}®T, where @ denotes the symmetric 
difference. Define w’(e) :=|w(e)| for each e € E. If there is no circuit of negative 
w-cost, then, for any T-join F of minimum w’-cost, F®E consists of an st-path 
of minimum w-cost and some disjoint circuit of zero w-cost. Therefore in order to 
compute a min-cost st-path it suffices to compute F. This, in turn, can be done 
with the help of a weighted matching algorithm. (While solving the min-cost path 
problem for directed graphs was not too difficult, one may be wondering if it is 
indeed necessary to invoke such a sophisticated tool, the matching algorithm, for 
solving the minimum cost path problem in undirected graphs. However this is not 
surprising anymore once one observes that an algorithm solving the latter 
problem can easily be used to compute a minimum weight perfect matching.) 

For the structure of distances, see Seb6 (1993). 


4. Circulations and flows 


4.1. Feasible circulations and maximum flows 


In the theory of network flows there are several models which are, on one hand, 
equivalent via elementary constructions. On the other hand, for different type of 
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applications it is convenient to have various models. We are going to survey the 
two basic ones: flows from a source to a sink and circulations. Although 
historically flows came earlier here we start with circulations. For more detailed 
network flow theory, see Ford and Fulkerson (1962), Lawler (1976), Phillips and 
Garcia-Diaz (1981), Lovasz and Plummer (1986), Ahuja et al. (1993), Goldberg 
et al. (1990). 

Throughout this section we work with a digraph D =(V, A). Let f:A->RU 
{-%} be a lower capacity, g: A—RU {+} an upper capacity such that f <g. 
For a vector x:A->R and a subset SCV let 5, (S):= Lew, v): (u,v) EA, 
(u,v) enters S) and let 6° 2 (S ):=6,(V—S). Vector x is called a circulation if the 
conservation rule 6. (v) = ‘a sv) holds at every node v. It is easily seen that, given 
a circulation x, 8 w(X)= §° ~(X) for all XCV. A circulation x is feasible if 


f<x<g. 
Theorem 4.1 (Hoffman 1960). There exists a feasible circulation if and only if 
8; (X)<8)(X) foreveryXCV. (4.1) 


If f and g are integer-valued and (4.1) holds, there is an integer-valued feasible 
circulation. 


Proof. Necessity. If x is a feasible circulation, then 67 (X) — 6; (X) =6,(X) — 
5, (X) =0 and (4.1) follows. 
"Sufficiency. Let y(X) := 57 (X) — 5, (X). Then (4.1) is equivalent to y(X) = 0. 


Lemma 4.2. y(X) + oY) = v(XOY)+y(XUY)+d,_ (X,Y). 
Proof. The contribution of any arc to the two sides is the same. O 


Choose a counter-example for which the number q of arcs with f(a) < g(a) is 
minimum. There is such an arc a= (s, t) since otherwise x := f(=g) is a feasible 
circulation (by (4.1)). Modify f by increasing f(a) as much as possible without 
violating (4.1). By the minimal choice of q the modified f(@) is still smaller than 

g(a). Furthermore, there is a ¢5-set T for which 6; (T)=6 ; (7), that is; y(T) = 0. 
Similarly, reduce g(a) as much as possible without violating (4.1). Again, the 
penne g(a) is bigger than (the modified) f(a) and there is an st-set S for which 
WS) 

Because of arc a the value d,_ ,(S, T) is strictly positive. Thus, by Lemma 4.2 
and by (4.1) we have 0+0= WS) + ¥(T)>yANT)+ (SUT) 2=0+0, a con- 
tradiction. The same proof shows that if f and g are integer-valued, then there is 
an integer-valued feasible circulation. O 


Let D =(V, A) be a digraph with a specified source s and sink t. We assume, 
without restricting generality, that no arcs enter s and no arcs leave ¢. Let 
8:A—R, be a capacity function that is positive everywhere. A vector x: A>R, 
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is called a flow from s to ¢, or an st-flow, if 5, (v)=6,(v) holds for every 
vGV- {s,t}. A flow x is called feasible if O<x<g. 

It can easily be seen that for a flow x and for any sf-set S the netflow leaving S 
defined by 67 (S)—8;(S) does not depend on the choice of S. This common 
value 6 ; (s) is called the flow value of x and denoted by val(x). The value x(u, v), 
(u, v) € A, is called the arc-flow. A flow x is called a path-flow if x is positive only 
along a path from s to t. 

The fundamental theorem of network flows, called the max-flow min-cut 
(MFMC) theorem, is due to Ford and Fulkerson (1956) and to Elias et al. (1956). 


Theorem 4.3. The maximum value of a feasible st-flow is the minimum of 6; (S) 
over all st-sets S. If g is integer-valued, there is an integer-valued maximum flow. 


Proof. Let x be a feasible st-flow and S an si-set. Then we have val(x) = 67 (S) — 
5, (S) <6; (S) from which max < min follows. 

To see the other direction let m denote the minimum in question. Adjoin a new 
arc e = (t,s) to D and define f(e) := g(e):=m. Let f be zero on all old arcs. It is 
easy to see that (4.1) holds for that choice of f and g so by Theorem (4.1) there is 
a feasible circulation (integral if f and g are integral). This circulation, without the 
new arc e, is an st-flow of value m. O 


From the first part of the proof we see that an x is a maximum flow and 6; (S) 
is of minimum if and only if 

(a) x(a) = g(a) for every arc a leaving S and 

(b) x(a) =0 for every arc a entering S. 

These optimality criteria are crucial for the next algorithm due to Ford and 
Fulkerson. This provides an algorithmic proof of Theorem 4.3 for the case when 
the capacity function g is integer-valued (and thus, when g is rational), A 
refinement of that method, due to J. Edmonds and R. Karp and to E.A. Dinits, 
will provide a strongly polynomial algorithm and an algorithmic proof for 
arbitrary capacities. 


4.2. Augmenting paths method 


The algorithm of Ford and Fulkerson starts with an arbitrary feasible st-flow x 
(for example x = 0) and iteratively improves it. To describe one iteration let x be 
a feasible st-flow. Construct an auxiliary digraph D, = (V, A,) as follows. An arc 
(u, v) belongs to A, if either (i) (u,v) EA and x(u, v) <g(u, v) and then this arc 
of D, is called a forward arc, or (ii) (v, vu) € A and x(v, u) > 0 and then (u, v) is a 
backward arc. 

Let § denote the set of nodes reachable from s in D,. 

Case 1: t€S. Since no arc of D, leaves S, optimality criteria (a) and (b) hold 
and the algorithm terminates. 

Case 2: tE S. Let P be any path in D, from s to ¢. 

Let A, := min(g(u, v)—x(u,v): (u,v) is a forward arc of P) and A,= 
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min(x(v, #): (u,v) is a backward arc of P). Let A=min(4,,4,). Then A is 
positive. Call an arc of P critical if A is attained at that arc. 

Update x as follows. If (u,v) is a forward arc of P, increase x(u,v) by A. If 
(u, v) is a backward arc of P, decrease x(v, u) by 4. An easy consideration shows 
that the revised x’ is a feasible st-flow with val(x’) = val(x) + 4. Consequently, if g 
is integer-valued case 2 can occur only finitely many times. 

In case of rational g we can easily reduce the problem by multiplying through 
the components of g by the least common denominator. When g is irrational, the 
algorithm above may not terminate. An example to show this pathological 
situation is given in Ford and Fulkerson (1962) (a simpler one occurs in Lovadsz 
and Plummer 1986). Another drawback of the algorithm is that even for integer 
capacities the number of iterations may be proportional to the largest occurring 
capacity M, as the following example shows, see fig. 4.1. 

If among the possible augmenting paths we always choose the one of three arcs, 
then the flow augmentation A is just one in every step. Therefore the complexity 
of the algorithm is exponential in log M, the size of the input. 


4.3. The method of Edmonds, Karp and Dinits 


To overcome these difficulties Edmonds and Karp (1972) and Dinits (1970) 
proposed to choose a shortest augmenting path at each iteration. This simple 
modification makes it possible to bound the number of iterations in the Ford— 
Fulkerson algorithm by a polynomial of |V| and | A], irrespective of the capacities. 

Let o,(v) denote the distance of v from s in D,. (If there is no path from s to v, 
then o,(v) = ~). Let P be a shortest path in D, from s to t. Then for any arc (x, v) 
of P, a,(v)=o,(u) +1. 


Lemma 4.4. Performing an augmentation along P does not decrease a,(v). 


Proof. Let us consider how an augmentation affects D,. Since the flow has been 
changed only at the arcs of D corresponding to the arcs of P, D, may be changed 
at the arcs of P. Namely, the (possibly) new arcs of D, are the arcs of P in 
teverse Orientation (while the critical arcs of P disappear). The distance of a node 
v from s could decrease only if we adjoin an arc (u, w) for which o,(w) > 0,(u) + 1 
and the lemma follows. O 


The sequence of augmentations can be divided into phases. In one phase o,(¢) 
Temains the same. By Lemma 4.4 there may be at most |V|— 1 phases. 


Figure 4.1. 
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Lemma 4.5. Within one phase at most |A| augmentations may occur. 


Proof. Let o,(v) denote the distance of v from s in the auxiliary digraph at the 
beginning of a given phase /. Call an arc (u, v) i-tight if o,(v) = o,(u) + 1. Within 
phase i only i-tight arcs may be used. By Lemma 4.4 an augmentation eliminates 
at least one i-tight arc from the current auxiliary digraph and no new i-tight arc 
arises. Since an auxiliary digraph may have at most |A| arcs, the lemma 
follows. O 


By Lemma 4.5 the algorithm needs at most |V||A| augmentations. One 
augmentation can be performed in O(|V|) time so the overall complexity is 
O(|V|* |4l). 


Remark. The original augmenting paths method leaves freedom in choosing 
augmenting paths. The modification above imposes certain restrictions but still 
the algorithm may have different runs and, as a result, it may end up with 
different maximum fiows. The final minimum cut 5 *(S), however, provided by 
the augmenting paths method is independent of the run of the algorithm. It is 
easy to show that if both X and Y minimize 8, (Z ) over si-sets, then both XN Y 
and XUY are minimizing sf-sets and therefore there is a unique minimal 
minimizing set S. The augmenting path method ends up with this S. 


Since the algorithm of Edmonds, Karp and Dinits quite a few improvements on 
the complexity of max-flow min-cut algorithms have been devised. See Cherkas- 
skij (1977a), Malhotra et al. (1978), Galil (1980), Shiloach (1978), Sleator (1980). 

Max-flow algorithms using augmenting paths have the characteristic feature that 
the current flow is always changed in a “‘big piece”: along all the edges of an 
augmenting path. An important conceptional development was the introduction 
of preflows by Karzanov (1974). A preflow is a non-negative function x on the 
edges of a digraph so that 5; (v) = 8 (v) holds for every node v distinct from the 
source. A preflow-push algorithm changes the current preflow each time along 
just one edge. This is the basis of the greater flexibility and efficiency of 
preflow-push algorithms. See Shiloach and Vishkin (1982), Goldberg and Tarjan 
(1986), Cheriyan and Maheshwari (1989). Alon (1990) described a deterministic 
version of a randomized algorithm of Cheriyan and Hagerup (1990) whose 
complexity is O(m log n) 

The interested reader may find a much more detailed comparison of the 
max-flow algorithms in the survey paper of Goldberg et al. (1990) and in the 
textbook of Ahuja et al. (1993). 


4.4. Finding feasible circulations 


We have seen how the max-flow min-cut theorem could be derived from 
Hoffman’s circulation theorem. We show now the reverse direction. This way we 
will have a tool by which a feasible circulation can be found (or a set violating 
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(4.1)) with the help of one max-flow min-cut computation (on a slightly larger 
digraph). 

For simplicity we restrict ourselves to finite f and g (the general case is left to 
the reader). For uv EV denote y(v) =5, (v)—6 ; (v). If y is zero everywhere, then 
f is a feasible circulation. Otherwise, the sets S = (vu: y(v) > 0) and T= (vu: y(v) < 
0) are non-empty. Let D’=(V',A') where V'’=VU{s,t} and A’=AU 
(s,v):u ES U((v, t):v ET). Define a capacity function g’ as follows. g‘(s, v):= 
yv) if vES, g'(v,t)=—yv) if uET and g'(a)=g(a)—f(a) if aE A. Let 
M=Y (yv): vES). 


Lemma 4.6. (a) x és an st-flow of value M in D' (with respect to g') if and only if 
f +x (restricted circulation). 
(b) A set X CV violates (4.1) if and only if 5 (xX +s)<M. 


4.5. Other models and applications 


Via elementary constructions flows and circulations are equivalent. There are 
other, more sophisticated variants. For example, one can impose lower bound on 
the arc-flows in the maximum flow problem. Or, lower and upper bounds can be 
requested on the in-flows (6, (v)) at nodes. Moreover, instead of one source and 
one sink multiple sources and sinks can be specified. This generalization is 
sometimes called a transshipment problem. The transportation problem consists of 
finding a minimum cost degree-constrained subgraph of a bipartite graph. With 
relatively simple elementary constructions all these models go back to flows or 
circulations. We refer to the classical book of Ford and Fulkerson (1962). 

Next, we are going to survey some of the combinatorial consequences of the 
flow theory. 


Theorem 4.7 (Menger’s theorem arc-version). Let D =(V, A) be a digraph with 
two specified nodes s and t. The maximum number of arc-disjoint paths from s to t 
is the minimum of 5*(S) over all st-sets S. 


Proof. Apply Theorem 4.3 with g = 1 and notice that every flow from s to ¢ is the 
sum of path-flows from s to ¢ and a non-negative circulation. OO” 


In section 7 some other versions of Menger’s theorem will be discussed. 


Theorem 4.8 (Konig 1915). The maximum number of disjoint edges of a bipartite 
graph G =(V,,V,; E) is the minimum number of nodes covering all the edges. 


Proof. Orient the edges of G from V, to V,. Then extend G by two new nodes s 
and t and new arcs (s,v) (v EV,) and (v, 2)(u EV,). Let the capacities of all the 
new arcs be 1 and the other capacities M, a big number. A maximum st-flow of 
value k corresponds to k independent edges of G. By Theorem 4.7 there is an 


130 A. Frank 


st-set S for which 6 7 (S)=k. No arc of capacity M can leave S$ therefore 
(V, — S) U(V, OS) is a covering of E with cardinality k. O 


4.6. Gomory—Hu trees 


In many applications the ability to compute the minimum cut separating two 
nodes is more important than finding a maximum flow. In undirected graphs the 
minimum cuts separating every pair of nodes has an especially attractive 
structure. 

Let G =(V, E) be a connected undirected graph and g:E—R, a non-negative 
capacity function. Let A(u, v) denote the maximum flow value between u and v. 
We say that a set Z separating u and v is uv-minimal if d,(Z) is minimal over all 
subsets separating u and v. Equivalently, d,(Z) = A(u, v). 

A set X is called critical if X is uv-minimal for some u and v. In order to have 
some insight into the structure of critical sets, one may be interested in a list of 
critical sets that contains a uv-minimal set for each pair u, v. How short can this 
list be? By choosing a separate uv-minimal set for each pair there is a list of 
n(n — 1)/2 sets. But one can do much better. 

Let G, =(V, F) be any tree on node set V (not necessarily a subgraph of G). 
For every edge e in F let m(e):=d,(X,) where X, and V—X, are the two 
components of G, ~ e. G; is called a Gomory—Hu tree of G (with respect to the 
given capacity function g) if (a) for every pair {s,t} of nodes A(s,¢) is the 
minimum of m-values over the edges of the unique path in G; connecting s and t 
and (b) if e is an edge where the minimum is attained, then X, is st-minimal. 

For example, if G = K, , and g = 1, then a star of five edges forms a Gomory-— 
Hu tree (and there is no other one showing that a Gomory—Hu tree cannot be 
chosen, in general, as a subgraph of G). 


Theorem 4.9 (Gomory and Hu 1961). Every graph possesses a Gomory—Hu tree. 


We are going to consider only the case g=1. For general g the proof goes 
along the same line. We need the following terminology. A family ¥ of subsets of 
nodes is called laminar if for any two non-disjoint members of ¥ one of them 
includes the other. We say that ¥ separates nodes u and v if at least one member 
of ¥ separates u and v. 

Let ¥ be a laminar family and {u, v} a pair of nodes not separated by ¥. We 
say that a subset X of nodes separating u and v is uv-minimal with respect to $ if 
# U{X} is laminar and d(X) is as small as possible. Note that such an X can be 
computed by one MFMC computation in a graph obtained from G by contracting 
the complement of the smallest member X of ¥ containing u and v and 
contracting the maximal members of F included in X. 


Proof. Let us construct a laminar family ¥ of m —1 sets as follows. Let #, be 
empty. Suppose we have constructed a laminar family ¥,_, = {A,,...,A,_3} 
for some k=1,...,”%—1. Let {u,,v,} be any pair of nodes not separated by 
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F, 1. oe a set A, that is u,v,-minimal with respect to ¥,_, and let 
F, i= Sec 1U{A,}. 

Let F:=F,_,. Let E,:= {ujv;:i=1,...,n—1}. From the construction we 
see that T,:=(V, E,) is a tree. (This is just an auxiliary tree for the proof.) 
Claim 1. Let X be an xx'-minimal set and Y a critical set. If Y does not contain x 


and x', then either X — Y or X UY is xx'-minimal. If Y contains x and x', then 
either XM Y or Y — X is xx’-minimal. 


Proof. To prove the first statement suppose that Y is yy’-minimal. Then either 
Y — X or YX separates {y, y’}. In the first case one has A(x, x’) + A(y, y’) = 
d(X) + d(Y) = d(X — Y) + d(Y — X) = A(x, x’) + A(y, y’). Hence equality follows 
everywhere showing that X — Y is xx’-minimal. 

In the second case A(x, x’) t ACy, y') = @(X) + d(Y)>d(X UY) + d(XNY)= 
A(x, x') + ACy, y’). Hence equality follows everywhere showing that X UY is 
xx'-minimal, as required for the first statement. 

The second statement follows from the first one if we replace Y by its 

complement. O 
Claim 2. A; € ¥ is u;v,-minimal for each i=1,...,n—1. 
Proof. The claim is clear for i= 1. Suppose we have already proved it for 1, 
2,...,¢-1 and let x:=u,, x’:=v,. By induction on j=0,1,...,i-—1 we are 
going to show that (*) there is an xx’-minimal set X for which ¥ U {X} is 
laminar. From this the claim will clearly follow. 

(*) obviously holds for j =0. Suppose we have already shown (*) for some 
Q<j<i-1. Let X’ be the xx’-minimal set assured by Claim 1 when applied to X 
and Y:= A,. Then ¥ U {X’} is laminar, as required. O 
Claim 3. For every pair {s,t} of nodes there is an st-minimal member of F. 


Proof. Let P be the unique path in 7, connecting s and ¢ and let M:= 
min(A(u,,v;): u,v, an edge of P). By the MFMC theorem A(s, 1) = M. 

Let j be the smallest subscript for which u,v, is an edge of P and A(u,, u; Pi M. 
We claim that A; does not separate any other edge of P. Indeed, if A, separates 
an edge u,v, of P, then i<j by the construction of ¥. Furthermore, M< 
Ma, vu) <= d(A, ) = M and hence A(u,, v,) = M, contradicting the minimal choice of 


Therefore A, must separate {s,1} and hence we have M > A(u,,v,) = d(A,;)> 
A(s, 1) = M. We can conclude that A, is an st-minimal set. O 


Let Ay:=Vand ¥':= FU {V}. For each A, € ¥’ the union of maximal sets of 
F included in A, is precisely one element smaller than A,. Let ¢, denote this 
element and for i>1 let $,:=t, where A, is the unique minimal element of ¥’ 
including A,. Let F’:= ((s,, t,): i=1,...,2-1}. Then G,=(V, F’) is an ar- 
borescence such that each arc of it enters one member of ¥. Let G, denote the 
underlying (undirected) tree. By this construction and by Claim ‘5, G; is a 
Gomory—Hu tree. O 


The following corollary, due to Padberg and Rao (1982), found a nice 
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application in a linear programming approach to matching problems. It basically 
asserts that a Gomory—Hu tree encodes not only a minimum cut separating any 
given pair of nodes but also a minimum T-cut for any even subset T of nodes. 
(For the definition of T-cut and 7-join, see the chapter on matchings.) 


Corollary 4.10 (Padberg and Rao 1982). Let G; be a Gomory-—Hu tree and T an 
even subset of nodes. Then a minimum T-cut can be obtained by choosing an edge 
e of G; for which the cut determined by the two components of G,— eé is a T-cut 
and m(e) is as small as possible. 


Proof. Let C be a minimum T-cut and let J be the set of edges e of G, for which 
the cut C, determined by G,—e is a 7-cut. Clearly J is a T-join and therefore 
there is an edge e=uuGCNJ. We have |C|2=A(u,v) = m(u, v) =(C,|>=|C| 
showing that C, is also a minimum 7-cut. UO 


Another corollary states that if the degree of each node of a graph is at least k, 
then there are two distinct nodes which are connected by at least k edge-disjoint 
paths. Indeed, if u is a node having degree one in the Gomory—Hu tree and v is 
its neighbour in the tree, then uw and v will do. 


5. Minimum cost circulations and flows 


5.1. Min-cost circulations 


In the previous section we got to know how to find feasible circulations. It is not a 
less striking problem to find a possible circulation that minimizes a specified linear 
cost function. To attack this problem let us consider the set Q of all feasible 
circulations. Q forms a polyhedron in R® called a circulation polyhedron and 
denoted by C(D; f, g). (That is, C(D; f, g) = {xER*: 8, (v) =62(v) for vEV 
and f(u, v) =x(u, v) < g(u, v) for (u,v) € A}. @ is said to be integral if f and g are 
integer-valued. 

Theorem 4.1 implies that a (non-empty) integral circulation polyhedron 
contains an integer point. Since the face of an (integral) circulation polyhedron is 
obviously an (integral) circulation polyhedron, we have proved the following. 


Theorem 5.1. Every face of an integral circulation polyhedron Q contains an 
integer point. 


Let Q = C(D; f, g) be a non-empty circulation polyhedron and c: AR a cost 
function. The cost cx of a circulation x is defined by }: (x(a)c(a): a € A). What is 
the minimum cost of a feasible circulation and when does this minimum exist? 

Define a digraph D’ = (V, A’) and a cost function c’ on A’ as follows. An arc 
(u, v) belongs to A’ if either (i) (v, u«) € A and f(v, u) = —@© or (ii) (4, v) E A and 
g(u, v) =~, In case (i) let c’(u, v) = —c(v, wv), in case (ii) let c’(u, v) = c(u, v). 
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Theorem 5.2. The following are equivalent: 
(1a) There is a feasible circulation of minimum cost. 
(1b) There is no negative circuit in D' with respect to c’. 
i (1c) There is a potential 7:V—R such that 


a(v) — w(u) <c(u,v) whenever (u,v) € A, glu, v) =~ (5.1i) 
and 
a(v) — 7(u) 2 c(u,v) whenever (u,v) EA, f(u,v)=-%. (5.1ii) 


Note that a potential satisfying (1c) can be found (if one exists) by a min-cost 
path computation. 

How can we characterize optimal feasible circulations? For x€@Q define a 
digraph D, = (V, A,) and a cost function c, as follows. Let an arc (u, v) belong to 
A, if either (i) (u, v) € A, x(u, v) < g(u, v) and then let c,(u, v) = c(u, v) (forward 
arc of A,) or (ii) (¥,u)€ A, x(v, u) >f(v,u) and then let c,(u,v) = —c(v, u) 
(backward arc of A,). 


Theorem 5.3. For a feasible circulation x the following are equivalent: 
(2a) x is of minimum cost. 
(2b) There is no negative circuit in D, with respect to c,. 
(2c) There is a potential 7:V—R such that 
a(v) — rtu)<c(u,») if x(u,v)<g(u,v)(u, v) © A) (5.21) 
and 


a(v) — 7(u)=c(u,v) if x(u,v) > flu, v)((u, v) € A). (5.2ii) 
We call (5.2i) and (5.2ii) optimality criteria. 


5.2. Min-cost circulation algorithm 


The algorithm (Ford and Fulkerson 1962) starts with a feasible circulation x and a 
potential 7 satisfying (1c). If x and 7 satisfy (2c) as well, we are done. Otherwise, 
let (t,s)EA violate, say, (5.2i), that is x(t,s)<g(t,s) and m(s) — a(t) > c(t, s). 
(The case when an arc violates (5.2ii) is analogous.) , 

Define a digraph D, = (V, A,) and a capacity function g,: A, R, as follows. 
An arc (u,v) belongs to A, if either 


(u,v) EA, rv) — wu) =c(u,v) and x(u,v)<g(u,v) (3i) 
or 
(v,u)E A, r(u)— z(v)<cv,u) and x(v,u)>f(v,u). (3ii) 


Let 8&,(u, v) = g(u, v) — x(u, v) in case (i) and g,(u, v) =x(v, u) — fv, u) in case 
(ii). Let M denote the maximum value of an st-flow and let 4 = min(M, g(t, s) — — 
x(t, s)). With the help of a max-flow min-cut computation determine an st-flow z 
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of value A. Revise x as follows. 
x‘(u,v)=x(u,v)+A_ if (u,v) isa forward arc of D, , 
x'(u,v)=x(u,v)—-A_ if (v,u) isa backward arc of D, . 


Claim. x’ is a feasible circulation. An arc (u,v) € A Satisfies (5.2i) and (5.2ii) with 
respect to x’ if it satisfies with respect to x. 


There may be two cases. 

Case 1: val(z) = g(t, s) — x(t, 5). In this case arc (t,s) no longer violates the 
optimality criteria. 

Case 2: M(=val(z)) < g(t, s) — x(t, 5). Let S be the sf-set determined by the 
max-flow min-cut computation. Let ¢, = min(c(u, v) — m(v) + m(u): (u, v) leaves 
S, x(u, v) <g(u, v)). Let e, = min(a(u) — 7(v) + clu, v): (u, v) enters S, x(u, v) > 
fu, v)). Define ¢ = min(e,, €,) and revise 7 as follows: '(v) = w(v) + « if uF S 
and =n(v) if vES. 

Repeat the procedure with x’ and 7’. 

It can be shown that this algorithm is finite for any cost and capacity functions. 
It is not necessarily of polynomial time, however, even if c, f, g are integer- 
valued. We describe a machinery, the scaling technique, to make the above 
algorithm of polynomial time. The method is due to Edmonds and Karp (1972). 


5.3. Scaling technique 


Suppose that the cost function c is integer-valued and that a feasible circulation x 
and an integer-valued potential 7 satisfying the optimality criteria are available. 
Let c’ be another cost function differing from c on one arc by 1. The basic 
observation is that the algorithm above finds in polynomial time a feasible 
circulation x’ and an integer-valued potential 7’ satisfying the optimality criteria 
with respect to f, g, c’ provided that we start with the available x and 7. Indeed, 
if case 2 occurs, then e = 1 and arc (t,s) violates no longer the optimality criteria. 
In other words, with one max-flow min-cut computation (whether case 1 or case 2 
occurs) the required x’ and 7’ can be obtained. 

Consequently, if c” is an integer-valued cost function differing from c on every 
arc by at most one, then, starting with x and 7, at most |A| max-flow min-cut 
computations yield an x” and 7” satisfying the optimality criteria with respect to 
Coe 

Another, trivial, observation is that if x and a satisfy the optimality criteria 
with respect to c, then so do x and 27 with respect to 2c. 

Assume now for convenience that c is non-negative and let c be given in binary 
base. Let the maximum of c(a) have K digits. Then there are K 0-1 vectors 
Cos--+yCx-, in Z4 such that c= Y (2’¢,:i=0,..., K-1). 

First solve the min-cost circulation problem for cy_,. This needs at most |A| 
MFMC computations. Let the solution be x,_, and ,_,. Starting with x,_,, 
27,_, solve the min-cost circulation problem for 2cx_, + Cx_z. This also needs at 
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most |A| MFMC computations. Continuing this way, after at most K|A| 
applications of the MFMC algorithm, we obtain a feasible circulation x and a 
potential 7 satisfying the optimality criteria with respect to c. 


Remark. Let us draw attention to a small technical difficulty which can, however, 
be easily overcome: it may happen that there is a min-cost circulation with respect 
to c but there is none with respect to an intermediate cost function. 


5.4, Strongly polynomial algorithm 


Comparing the complexity of the scaling technique and the maximum flow 
algorithm of Edmonds, Karp and Dinits there is a significant difference. Namely, 
the complexity of the latter algorithm does not depend on the magnitude of the 
numbers (if we assume that adding and comparing two numbers is one step) and 
in this sense this algorithm is “strongly’’ polynomial while the complexity of the 
scaling technique is proportional to the number of digits. 

Tardos (1985) was the first who constructed a strongly polynomial algorithm for 
finding a min-cost circulation. Since her work many other strongly polynomial 
algorithms have been developed. The fastest one is due to Orlin (1988). Here we 
briefly outline the algorithm of Goldberg and Tarjan (1989) that seems to be 
conceptionally the most attractive. 

Optimality criterion (2b) suggests the following procedure. Start with a feasible 
circulation x. If (2b) holds, x is optimal. Otherwise, choose a circuit C in D, 
violating (2b) and cancel along C. Canceling along C means that we increase 
x(u, v) by A if (u, v) is a forward arc of C and decrease x(u, v) by A if (v, uv) is a 
backward arc of C. Here A is the smaller value of min(g(u,v) —x(u,v): uv a 
forward arc of C) and min(x(u, v) — f(u, v): (v,u) a backward arc of C). 

Clearly, the modified x’ is a feasible circulation and its cost is smaller than that 
of x. The algorithm consists of repeating this canceling procedure as long as (2b) 
is violated. This procedure is not necessarily of polynomial time. However, 
Goldberg and Tarjan proved that the following selection rule makes the algorithm 
strongly polynomial: each time choose a circuit C in G, to be one of minimum 
mean cost. In section 3 we indicated how to compute such a circuit. 

A beautiful feature of the algorithm of Goldberg and Tarjan is that it can be 
considered as a straight generalization of the Edmonds—Karp-Dinits algorithm 
for computing a maximum flow. Indeed, in the proof of Theorem 4.3 it was shown 
how a max-flow problem can be formulated as a minimum cost circulation 
problem. The Goldberg—Tarjan algorithm, when applied to this special min-cost 
circulation problem, yields precisely the Edmonds—Karp-—Dinits algorithm. 


5.5..Minimum cost flows 


Let us be given again a digraph D=(V, A) with a source s and a sink 4. A 
non-negative capacity function g and a non-negative cost function c are given on 
A. We assume that both g and ¢ are integer-valued. We have seen how to compute 
the maximum value M of an st-flow. This time we are interested in finding a 
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minimum cost st-flow of value m for all possible integers m, 0 <m = M. The cost 
of a flow z is defined by cz = }) (c(e)z(e): e € A). We say that an st-flow z is a 
min-cost flow if z has the minimum cost among the feasible st-flows of value 
val(z). 

We have seen the equivalence between the feasible circulation problem and the 
maximum flow problem. Using the same elementary construction the min-cost 
st-flow problem could be solved in strongly polynomial time with the help of a 
strongly-polynomial min-cost circulation algorithm. 

Here we briefly survey a direct algorithm due to Ford and Fulkerson (1962). 
This algorithm is strongly polynomial only if the capacities are small integers. The 
reason why we include this algorithm is that it has a nice combinatorial 
application. 

A flow z is of minimum cost if and only if there is a function 7: V— Z,, called 
a potential, (m(s)=O0<a()<-7(t) for v€V) for which the following two 
optimality criteria hold: 


mv) — w(u) <c(u,v)> z(u,v) =0, (5.3i) 
av) — m(u) > c(u, v) > z(u, v) = gu, v). (5.3ii) 


We use the notation ¢(u, v) = c(u,v) — 7(v) + m(u) for (u,v)E A. The method 
(Ford and Fulkerson 1962) can be considered as a refinement of the max-flow 
min-cut algorithm of Ford and Fulkerson. It constructs a min-cost flow for all 
possible (integer) flow values m. 

The algorithm starts with the identically zero flow and the identically zero 
potential. Then the flow value is increased one by one and the potential is 
appropriately increased so that the optimality criteria are throughout maintained. 
The algorithm terminates when a maximum flow (and a minimum cut) is found. 

Iterative step. At the general step we are given a flow z and a potential, 
satisfying (i) and (ii). Construct an auxiliary digraph D’ = (V, A’) as follows. D’ 
has two types of arcs: forward and backward. An arc (u,v) is a forward arc if 
uv EA, c(u,v)=0 and z(u,v)<g(u,v). An are (u,v) is a backward arc if 
(v, u) EA, cv, vu) =0 and z(v, u) > 0. Let S be the set of nodes reachable from s 
in D’, There are two cases. 

Case 1: t¢S. Define e, = min(¢(u, v): (u, v) € 5 *(S), z(u,v) <g(u,v)) and 
€, = min(—C(u, v): uv €5 *(V— S), z(u, v) > 0) where the minimum is defined to 
be & if it is taken over the empty set. Let « = min(e,, €,). The optimality criteria 
and the construction of S imply that « is positive. 

If ¢ =~, the algorithm terminates since we have & (S) = val(z) and thus the 
current fiow z is maximum and 8*(S) is a minimum cut. 

If e<©, revise a by increasing m(v) for every v GV—S by e. 


Claim. The revised potential and the unchanged flow satisfy the optimality criteria. 


Repeat the procedure. Observe that in the new auxiliary digraph the set of 
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reachable nodes from s is strictly larger than S. Therefore, after at most |V|—1 
occurrences of case 1 either s = © or case 2 occurs. 

Case 2: t€ S. Let P be a path in D’ from s to t. Modify z as follows. Let 
2'(u, v) = z(u,v) +1 if (u,v) is a forward arc of P and let z'(u, v) = z(u, v) — 1 if 
(v, u) is a backward arc of P. 


Claim. The revised flow and the unchanged potential satisfy the optimality criteria. 


What can we say about the complexity of the algorithm? We need roughly M 
flow augmentations. So the algorithm is polynomial if all the M minimum cost 
flows (of value 1,2,...,M) are required. The algorithm is not necessarily 
polynomial if one wants to compute only a min-cost flow of value M since the 
complexity is proportional to M. 

If the maximum capacity is not too big (namely, its value can be bounded by a 
polynomial of |V|, then the algorithm is (strongly) polynomial, irrespective of the 
cost function. 


5.6. An application to partially ordered sets 


Let P={p,, P2,...,p,} be a partially ordered set. Dilworth’s (1950) famous 
theorem asserts that the maximum cardinality a of an antichain is equal to the 
minimum number of covering chains. Another result of this type asserts that the 
maximum cardinality c of a chain is equal to the minimum number of covering 
antichains. In this section we discuss common generalizations of these results. 

For a family @ = {B,, B,,..., B,} denote (J@ = U(B;:i=1,...,k). By a 
chain family €,={C,,C,,...,C,} we mean a set of y disjoint non-empty 
chains. Let C, denote the set of chain families of y chains and C the set of all 
chain families. Let c, = min(|UC,|: € €C,). 

By an antichain family 3, ={A,,A,,..-,A,} we mean a set of a@ disjoint 
non-empty antichains. Let A, denote the set of antichain families of a antichains 
and A the set of all antichains. Let a, = min(|\)#,|: 4, €A,). 

, By Dilworth’s theorem c, =n, by its polar a, =n. What can be said about c, 
(1<y <a) and about a, (1<a <c)? 


Theorem 5.4a (Greene and Kleitman 1976). a, = min(qa + |P— U&,|: €, EC). 
Theorem 5.5a (Greene 1976). c, = min(qy + |P — U,|:0, GA). 


Since a chain and an antichain can share at most one element, a, and c, do not 
exceed the minimum in question. 


Definition. An antichain family #, = {A,,A,,...,A,) and a chain family ©, = 
(Ci5.G ota ey C,} are said to be orthogonal if 


P=(Us#s,)U(U4,) (a) 
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and 

A,NC,#@ forl<i<a,1<j<y. (b) 
The non-trivial parts of Theorems 5.4a and 5.Sa can be reformulated as follows. 
Theorem 5.4b. For every a, 1<a<c, there are 4, EA, and €,EC, for some y, 
which are orthogonal. 


Theorem 5.5b. For every y, 1<y <a, there are €, EC and 4, €A, for some a, 
which are orthogonal. 


A common generalization is due to Frank (1980). 


uae 5.6. There exists a sequence €,|s,,4, ,..., H,.|€,-1,€.-2 5 +++ 
€,- = 1%, +_>-+-+ Which arises as a combination of two sequences 6,1 Cg-19 +++ 
and 4,,A,,...,A,, where € EC and 4, €A, with the property that any member 


of the sequence ‘(whether €, or s,) is orthogonal to the last member of other type 
preceding it. (That is, &,,H,..., 4, are orthogonal to €, and 


¢ a 


€a-11 €a-25- ++» €,_;, are orthogonal to x, , and so on.) 
Proof. Associate a digraph D=(V,A) with P where V:= {s,t,x,,x.,..., 
Xny Vir Y20---Ynt, and A:= {((s,x,):§=1,2,...,n}U {((y,, 9:2=1,2,..., 7} 
U {(x,, y;): if p, = p,}. Define all arc capacities g(a) to be 1, while the costs are 
c(e) =1 if e=(x,, y,) and 0 otherwise. 

Apply the min-cost flow algorithm to this network and let z and 7 be a flow 
and a potential at an intermediate stage of the algorithm. By analyzing the effect 
of a flow augmentation and a potential change and using the optimality eniienta 
the following lemma can easily be proved. 


Lemma 5,7. (a) Either a(y;) = a(x;) or w(y;) = m(x;) + 1. 
(0) If p,>p, and z(x,, y,)=1, then a(x,) = m(y,). 


The arcs (x,, y;) <j) for which 2(x,, y;)=1 correspond to a chain family €, 
yay y =n-—val(z). For a = 2(t) define a ‘family H,={A,,A,,...,A,} where 
A,= {p;: a(x;) + 1 = a(y,) =i}. 
Lemma 5.8. x, is an antichain family and is orthogonal to €,. 


The proof easily follows from the optimality criteria and from Lemma 5.7. 


Now the Ford—Fulkerson algorithm and Lemma 5.8 immediately imply 
Theorem 5.6. O 
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6. Trees and arborescences 


6.1. Minimum cost trees and arborescences 


Given a connected graph G=(V,£) and a cost function c:E—>R, find a 
minimum cost spanning tree. This problem is one of the earliest combinatorial 
optimization problem that has been solved. For an excellent historical survey, see 
Graham and Hell (1985). For algorithmic details, see Tarjan (1983). 

The following property of trees is crucial. 


Lemma 6.1. Let T, and T, be the edge sets of two trees on the same node set. Then 
for any edge e €T, there is an edge f © T, such that both T, -e+ f and T, —f+e 
are trees. 


Proof. If e€7,, then f:=e will do. Suppose that e=st¢T7T,. T,—e has two 
components C, and C,. T, contains a path P connecting s and ¢. Let f be an edge 
of P that connects C, and C,. This f satisfies the requirement of the lemma. O 


Let T be a spanning tree of a graph G =(V, E). A fundamental cut belonging 
to an element e of F is a cut of G determined by the two components of T—e. A 
fundamental circuit belonging to an edge f = uv € E — T is a circuit consisting of f 
and the unique path in T connecting u and v. Clearly, an edge e € T is in the 
fundamental cut of an edge f € E — T if and only if e is in the fundamental circuit 
of f. 

Lemma 6.1 immediately implies the following. 


Theorem 6.2. For a spanning tree T of G the following are equivalent: 
(a) T is of minimum cost. 
(b) c(e)<c(f) for any edge e ET and edge f of the fundamental cut of e. 
(c) c(e)=c(f) for any edge e T and edge e of the fundamental circuit of f. 


* One of the simplest (and earliest) algorithms in combinatorial optimization is 
the greedy algorithm to construct a minimum-cost spanning tree of a connected 
graph G =(V, E). 


Greedy algorithm (Boruvka 1926, Kruskal 1956). The procedure consists of 
building a spanning forest by adding edges one by one. It starts with a forest of 
node-set V that has no edges and stops when the current forest is a spanning tree. 
The general step consists of adding an edge of minimum cost that connects two 
distinct components of the current forest. 


There is another version of the greedy algorithm. 


Dijkstra—Prim algorithm (Dijkstra 1959, Prim 1957). Choose an arbitrary node 
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X,. Starting at x, build a tree edge by edge. At a general step choose a least cost 
edge to be added that has exactly one end in the current tree. 


The following algorithm does not build a tree or forest directly but gets rid of 
edges of big cost and the remaining graph is the desired tree. 


Reverse greedy algorithm. The procedure consists of discarding edges one by one 
so that the remaining graph is connected. At a general step choose an edge of 
maximum cost that is not a cut-edge of the current graph and delete it. The 
algorithm stops when the remaining graph is a spanning tree. 


All of these algorithms can be formulated in a general framework. 


General algorithm. The algorithm consists of applications of the following two 
operations in arbitrary order. The first operation builds a spanning forest F by 
adding edges one by one while the second operations deletes edges one by one. 
More precisely, let F denote the forest already constructed (at the beginning F is 
the forest of no edges.) 

Step 1. If F is a spanning tree, halt. Otherwise, choose an arbitrary cut B 
disjoint from F and let e€ B be a cheapest edge of B. Add e to F. 

Step 2. Choose an arbitrary circuit C (if there is none, step 2 does not apply 
any longer) and let e € C be the most expensive edge of C — F. Delete e from G. 


Theorem 6.3. The final tree of the algorithm is of minimum cost. 


Proof. Any stage of the algorithm can be specified by a pair (F, D) of disjoint 
subsets of E where F denotes the forest constructed so far and D denotes the set 
of edges deleted so far. : 

We prove by induction that at each stage (F, D) of the algorithm there is a 
minimum cost tree T of G for which FC TC E — D. Any min-cost tree will do 
when F = D =@. Suppose we have already proved the statement for (F, D), that 
is, there is min-cost tree T with FC TC E-—D. 

Assume first that step 1 is applied and let e € B be the newly added edge and 
F':=F +e. Ife € T, we are done. Otherwise, let C, be the fundamental circuit of 
e with respect to 7. Edge e is in cut B and in circuit C, therefore there must be 
another edge f in BM C,. Since e,f € B, by the rule in step 1 we have c(e) <c(f). 
Since e, f € C, and T is of minimum cost we have c(f) = c(e). Hence c(e) = c(f) 
and T’:= 7 —f +e is another min-cost tree for which F'C T’ CE — D. 

Second, assume that step 2 is applied and let e€ C be the newly deleted edge. 
If T does not contain e, we are done. Otherwise let B, be the fundamental cut of 
e belonging to T. There is an edge f #e with f €&CNMB,. Since e, f EC, by the 
tule in step 2 we have c(e) >c(f). Since e, f € B, and T is of minimum cost we 
have c(e) <c(f). Hence c(e)=c(f) and T’:= T—f +e is another min-cost tree 
for which FC T’CE-(D+e). O 
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Remark. The above algorithm can be extended to matroids. That is, there is a 
greedy algorithm for finding a minimum cost basis of a matroid. (See chapter 11.) 


‘ Let us turn to a directed counter-part of the minimum cost spanning tree 
problem. Let us be given a digraph D = (V, A) and a cost function c:A—>R,. 
Assume that every node can be reached from a specified node s, that is, D 
includes a spanning s-arborescence. Our next problem is to find a minimum cost 
s-arborescence. 

This problem has been solved by Fulkerson. Note that the min-cost tree 
problem can be reduced to a min-cost arborescence problem: replace each edge of 
G by a pair of oppositely directed arcs. 

Call a set-function z:2" “>R.,, c-feasible if 


c(a) = > (z2(X): aentersX) foreveryaGA. (6.1) 


Theorem 6.4 (Fulkerson 1974). The minimum cost of a spanning s-arborescence is 
max( >, (z(x): X CV—s): z is c-feasible). Furthermore, if c is integer-valued, the 
optimal z can be chosen integer-valued. 


Proof. Let F be a spanning s-arborescence and z a c-feasible vector. We have 
c(F) = > (c(a): @€ F) 
=> (x (z(x): a enters X): a € F) 
=D (2(X): X CV-s) (6.2) 


from which max<min follows. In (6.2) we have equality if the following 
optimality criteria hold. 


c(a) = > (z(X): a enters X) foreveryaEF, (6.3a) 
: 2(X) > Oimplies 6 ,(X)=1. (6.3b) 


The algorithm below finds a spanning s-arborescence F and a feasible.z for which 
(6.3a—b) holds. It consists of two parts. The first part constructs z while the 
second constructs F. In the course of the first part we revise the cost function. The 
current cost function is denoted by c’. We call an arc a a O-arc if c’(a) = 0. 

Part 1. Iterate the following step. Choose a minimal set X CV—s with no 
entering 0-arc. Define z(X):=min(c’(a): @ enters X) and revise c’ as follows. 
c'(a):=c'(a) — z(X) if a enters X. The new c’ is non-negative and its value is zero 
On at least one more arc. 

Part 1 terminates if every set X CV—s has an entering 0-arc, equivalently, 
there is a spanning s-arborescence consisting of 0-arcs. 

Part 2. Starting at s and using only 0-arcs build up a spanning s-arborescence F. 
If, during the building process, there is more than one 0-arc leaving the sub- 
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arborescence already constructed, choose that one which became a 0-arc earliest 
during the first part. 
Obviously the constructed z is c-feasible and (6.3a) holds. 


Lemma. z and F satisfy (6.3b). 


Proof. Let X be a set with z(X)>0. If, indirectly, 6(¥)> 1, there is a moment 
during part 2 when a sub-arborescence F’ is at hand for which 6 ;.(X) = 1 and the 
arc e currently added to F’ enters X. Consider the moment of part 1 when z(X) 
became positive. Then no 0-arc entered X and every proper non-empty subset of 
X had an entering 0-arc. In particular, there is a 0-arc f entering X — V(F') which 
does not enter X. Therefore, when z(X) became positive, f was a 0-arc while e 
was not and we are in a contradiction with the rule of part 2: arc f should have 
been chosen instead of e. O 


A related problem is as follows. Given a digraph D = (V, A) and a cost function 
d on A, find a maximum cost branching. We call a pair (p, y) a covering, where 
p:V—>R,, is a non-negative function on V and y: 2”->R, a non-negative function 
on the subsets of V if d(u, v) <p(v) + X ((B): uve B CV) for every (u,v) € 
A. The value of a covering is )) (p(v):v EV) + y (y(B)(jB| - 1): BCV). 


Theorem 6.5 (Edmonds 1967, Chu and Liu 1965). The maximum cost of a 
branching of D is the minimum value of a covering. If d is integer-valued, the 
optimal covering can be chosen integer-valued. 


Proof. Obviously max = min. To see the other direction, extend D by a new node 
s and new arcs (s,v) (v EV). Define a cost function c on the arcs of the extended 
digraph D’', as follows. Let the cost of the new arcs of D’ be M := max(d(a): 
ae A) and c(a) = M — d(a) (a€ A). By Theorem 6.4 there is a c-feasible vector 
z:2"—R, and a spanning s-arborescence F of D’ satisfying 6.3. Define p(v):= 

M —¥.(2(B): v € B) and for |B|>1 define y(B):=2(B) (BCV). It is straight- 
forward that (p, y) is a covering and its value is equal to the d-cost of the 
branching FN A of D. O 


6.2. Packing and covering 
A basic result on this field is due to Edmonds (1973). 
Theorem 6.6. Given a digraph D =(V, A) with a specified node s, there are k 


pairwise arc-disjoint spanning arborescences rooted at s if and only if 6 (X)2k 
for every XCV—s. 


Proof (Lovasz 1976a, sketch). Starting at s we build up an arborescence F such 
that 


6,-AX)2k—-1 foreveryXCV-s. (*) 
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By induction on k, this will prove the sufficiency of the condition. Suppose that F 
is sub-arborescence satisfying (*) which is not spanning (that is, V(F)~V). Calla 
set XCV~—S critical (with respect to F) if it satisfies (*) with equality. By 
submodularity, if X and Y are critical and XO Y “6, then XN Y and X UY are 
critical. By the hypothesis there is no critical subset of V- V(F). Let M be a 
minimal critical subset for which M — V(F) #9. Then there is an arc (u, v)E A 
such that u€ MN V(F) and vu € M — V(F). This arc does not enter any critical set 
therefore the sub-arborescence F’ = F + (u,v) continues to satisfy (*). (If no 
such an M exists, any arc (u,v) leaving V(F) will do.) O 


Note that Theorem 6.6 of Edmonds immediately implies the directed arc- 
disjoint version of Menger’s theorem. Indeed, adjoin & parallel arcs from ¢ to v 
for every v ©V— {s,t} and apply Edmonds’ theorem. 

The following conjecture is a kind of node-disjoint counterpart of Theorem 6.6. 


Conjecture 6.7. (a) Suppose that, given a digraph D=(V, A) and a specified 
node s € V, there are k openly disjoint paths from s to any other node of D. Then 
there are k arc-disjoint s-arborescences of D such that for any node v €V—s the 
k paths from s to v uniquely determined by the arborescences are openly disjoint. 

(b) Suppose that, given a graph G = (V, £) and a specified node s EV, there 
are k openly disjoint paths from s to any other node of G. Then there are k 
spanning trees of G such that for any node u€@V—s the k paths from s to uv 
uniquely determined by the & trees are openly disjoint. 

(c) The same as (b) except replace “‘openly disjoint” by “edge-disjoint”. 


Conjecture 6.7 (a) easily implies Conjecture 6.7 (b). Whitty (1986) proved 
Conjecture 6.7 (a) for k = 2 while Conjecture 6.7 (b) has been proved for & = 3 by 
Cheriyan and Maheshwari (1988) and by Zehavi and Itai (1989). Conjecture 6.7 
(c) is proved for k = 2 (using the ear-decomposition of 2-edge-connected graphs). 

Recently, A. Huck proved (a) for acyclic digraphs and disproved it for general 
digraphs when k = 3. 

One may be interested in finding k arc-disjoint spanning arborescences which 
need not be rooted at the same node. 


Theorem 6.8. There are k arc-disjoint spanning arborescences if and only if 
> 8 (X,)=k(t-1) 
i=l 


for every family of disjoint non-empty sets X 1) Xq90 0 Xp 
More generally, the arborescence packing problem can be solved when lower 
and upper bounds are imposed at every node for the number of arborescences 
rooted at that node. 
Another extension of Edmonds’ theorem is due to Schrijver (1982). Let 
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D =(V, A) be a digraph and let V,, V, be a bipartition of V. Call a subset B of arcs 
a bi-branching if 6 ,(X)21 for every XCV, and X DV,. 


Theorem 6.9. (Schrijver 1982). There are k arc-disjoint bi-branchings if and only 
if 8 (X) =k for every X CV, and for every X DV,. 


When |V,|=1 we are back at Theorem 6.6. When both V, and V, are 
independent K6nig’s edge coloring theorem is obtained as a special case. (See 
chapter 3.) 

Schrijver used this result to prove the following conjecture of Woodall for 
acyclic digraphs in the special case where each sink can be reached from each 
source. 


Conjecture 6.10. In an acyclic digraph if every directed cut contains at least k 
arcs, there are k disjoint sets of arcs each of which covers all directed cuts. 


A counterpart of Theorem 6.6 is due to Vidyasankar (1978). 


Theorem 6.11. Let s be a specified node of a digraph D = (V, A) with no entering 
arc. The arcs of D can be covered by k spanning s-arborescences if and only if (i) 
8” (v)<k for vEV—s and (ii) k—8 (X)<¥ (k~-8 (v):u E T(X)) for every 
XCV-s where T(X):= {v © X: there is an arc (u,v)EA with uEV— X}. 


Proof, By elementary construction. For every v © V—s adjoin to D a copy of vu, 
denoted by v’, and & parallel arcs from v to v' and k — 6 (v) parallel ares from v’ 
to uv. Furthermore, for (u,v)€A adjoin k parallel arcs from u to v’. Apply 
Theorem 6.6 to the extended digraph D’ and observe that & arc-disjoint spanning 
arborescences in D' correspond to & covering s-arborescences in D. Moreovet, if 
5" (X') <k for some X’CV’ — 5, then X = {vy EX’: uv’ £ X’} violates (ii). O 


Another interesting consequence of Edmonds’ theorem is the following. 


Theorem 6.12. The arc-set of a digraph D = (V, A) can be covered by k branchings 
if and only if (i) 8 (v) <k for every v EV and (ii) |A(X)| = &(|X| — 1) for every 
X CV (where A(X) denotes the set of arcs induced by X). 


Proof. By elementary construction. Adjoin a new node s to V and for vEV 
adjoin k — 8 (v) parallel arcs from s to v. In the new digraph D’ we have 


6’ (X)=6 (X)+ 0 (k-6-(@): vEX) 
=8 (X)-8 (X)-|A(X)|+k|X| 2k 


for every X'CV. By Theorem 6.6 there are k arc-disjoint spanning s-arboresc- 
ences in D'. These determine & covering branchings of D. O 
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For undirected graphs we have the following (historically earlier) theorem by 
Nash-Williams (1964). 


Theorem 6.13. The edge set of an undirected graph G = (V, E) can be covered by 
k forests if and only if |E(X)| = k(|X|—1) for every XCV. 


Proof. Theorem 6.12 and the following easy lemma imply the result. (© 


Lemma. The edges of a graph G = (V, E) have an orientation for which 8 (v) <k 
for every v EV if and only if 


|E(X)| <k|X| foreveryXCV. (+) 


Proof. (Sufficiency) In an orientation of G call a node v bad if 6 (v) > k. Choose 
an orientation where the “badness” )) (6 (v) — k: uv bad) is minimal. If there is 
no bad node, we are done. Otherwise, let t be a bad node and let X be the set of 
nodes from which ¢ is reachable in the current orientation. Then X contains a 
node s with & (s)<k since otherwise |E(X)| =) (6 (v):v EX) >k|X|, con- 
tradicting (*). Reorienting the arcs of a dipath from s to ¢ results in an orientation 
with smaller badness. UO 


(Note that this lemma easily follows from K6nig’s theorem, as well). For 
connected graphs the problems of covering the edge set by k forests or by k 
spanning trees are clearly equivalent. The packing problem of spanning trees was 
solved by Tutte (1961a). 


Theorem 6.14. A connected graph G=(V,E) contains k pairwise edge-disjoint 
spanning trees if and only if e, = k(t — 1) holds for every partition {V,,V,-..,V,} 
of V (V, #®) where e, denotes the number of edges connecting different V,. 


Remark. Edmonds extended Tutte’s theorem to matroids by providing a good 
characterization of the existence of k disjoint bases of a matroid. See chapter 11. 


7. Higher connectivity 


7.1 Connectivity between two nodes 


We start this section with a result which is undoubtedly the central theorem of this 
whole chapter, the Menger (1927) theorem. In what follows s and ft are two 
specified nodes of the graph or digraph in question. A set of paths is called openly 
disjoint if the paths are pairwise disjoint except, possibly, for their end nodes. 


Theorem 7.1. (a) Jn a digraph (graph) the maximum number of arc-disjoint 
(edge-disjoint) st-paths is equal to the minimum number of arcs (edges) covering all 
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st-paths. (Moreover, the minimum is attained on a set of type A*(S) where SCV is 
an st-set.) 

(b) In a digraph (graph) if there is no arc (edge) from s to t, the maximum 
number of openly disjoint st-paths is equal to the minimum number of nodes 
distinct from s and t covering all st-paths. 


Actually this is four theorems according to whether we consider directed or 
undirected and edge-(arc-)disjoint or openly disjoint st-paths. Menger originally 
proved the undirected, openly disjoint version. 


Proof. We have already seen two proofs for the arc-disjoint case (as a conse- 
quence of the max-flow min-cut theorem and a consequence of Edmonds’ disjoint 
arborescences theorem). From this the other three cases follow by elementary 
construction. Namely, in case (a) replace each edge by a pair of oppositely 
directed arcs and observe that if there is a set of k arc-disjoint paths in the 
resulting digraph, then there is one that does not use both arcs assigned to an 
original edge. The same construction yields the undirected openly-disjoint version 
from the directed one. 

To see the directed openly-disjoint version construct a new digraph D’ from D, 
as follows. Replace each node v (v #5, t) of D by a pair of new nodes v’ and uv”. 
Let (v’, v") be an arc of D’ and for an arc (u,v) of D let (u", vv’) be an arc of D’. 
Arc-disjoint st-paths in D’ correspond to openly disjoint paths in D. Moreover, if 
there are k arcs in D’ covering all st-paths, then these arcs can be assumed to be 
of type (v’,v”) and this set of arcs corresponds to a set of k nodes of D covering 
all st-paths. O 


There exist other versions of Menger’s theorem. For example, given a graph 
and two disjoint subsets S, T of its node set, there are k disjoint paths between S 
and T if and only if there are no k—1 nodes covering all such paths. By 
elementary construction this result easily follows from the original Menger 
theorem. Yet another version, sometimes called the fan lemma, is as follows. Let 
s be a node of a graph and 7 a subset of nodes not containing s. There are k paths 
connecting s and some elements of T so that they are disjoint except at s if and 
only if there are no k — 1 nodes in V—s covering all such paths. 

Hoffman found the following unifying approach to the different versions of 
Menger’s theorem. Let S be a finite set and let 9 be a set of ordered subsets of 5. 
We call the members of # paths. Suppose that for any two paths P= 
{Pi, P2,+-+, P,} and T= {t,,t,,...,¢,} sharing an element p; =f, the sequence 

Pry-- +> Pirtjays.-+>t;} includes a path. 


Theorem 7.2 (Hoffman 1974). The maximum number of disjoint paths is equal to 
the minimum number of elements covering all the paths. 


As a consequence, a Menger-type theorem can be formulated for disjoint 
shortest paths. 
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Corollary 7.3. The maximum number of openly disjoint shortest paths from s to t 
is equal to the minimum number of nodes covering all shortest paths from s to t. 


This type of min-max results fails to be true for paths of bounded length in 
general, however we have the following. 


Theorem 7.4 (Lovasz et al. 1978). Let G = (V, E) be an undirected graph with two 
specified non-adjacent nodes s and t. The maximum number of openly disjoint 
st-paths of length at most k (k > 2) is at least 2/k times the minimum number of 
nodes (distinct from s, t) covering all st-paths of length at most k. 


A possible generalization of Menger’s theorem (undirected, openly disjoint) is 
the following. In an undirected graph given a subset of nodes T, what is the 
maximum number of disjoint paths connecting nodes of T. This problem was 
answered by Gallai (see Corollary 8.25). Mader found a min-max formula for the 
maximum number of openly disjoint paths with end nodes in T. (See Theorem 
8.24.) 


7.2. Global connectivity 


Let k be a positive integer. A graph G = (V, E) is called k-connected (sometimes 
k-node-connected) if |V| > and for any subset X CV with less than k elements 
G(V — X) is connected. G is called k-edge-connected if deleting any subset of 
edges of less than k elements leaves a connected graph. This is equivalent to 
requiring d°(X)2=k for any 6¥ X CV. A digraph D=(V, A) is called k-arc- 
connected (often the term strongly k-arc-connected is used) if deleting any subset 
of arcs of less than k elements leaves a strongly connected digraph. This is 
equivalent to saying that 6 "(X) =k for any @4 X CV. We will say that a digraph 
D with a specified node s is k-arc-connected from s if 6 (X) =k for any subset 
X CV~—s. By Menger’s theorem this is equivalent to the property that there are k 
arc-disjoint paths from s to any other node. This is, in turn, equivalent to that 
there are k spanning arborescences rooted at s (Theorem 6.6). Finally, let us call 
a digraph strongly k-connected if deleting any subset of nodes of less than k 
elements leaves a strongly connected digraph. 

These definitions reflect one of our intuitive expectations for a graph to be 

“pretty much connected”: it is not possible to destroy the connectivity by taking 
away a small part of the graph. Another natural definition for high connectivity is 
that there are many disjoint paths between any pair of nodes. The following 
result, due to Whitney, says that these two approaches coincide. (One may have 
other intuitions for high connectivity: for example, if the graph contains k 
edge-disjoint spanning trees. Let us call such a graph k-tree-connected. For a 
characterization of k-tree-connected graphs, see Theorems 6.13 and 7.11). 


Theorem 7.5 (Whitney 1932). A graph on more than k nodes is k-connected if and 
only if there are k openly disjoint paths between any two nodes. A graph (digraph) 
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is k-edge-(arc-)connected if and only if there are k-edge-(arc-)disjoint paths from 
any node to any other. 


Proof. The second part immediately follows from Theorem 7.1. To see the 
non-trivial direction of the first part let s and ¢ be two nodes. If they are not 
adjacent, we are done by Menger. Otherwise let e be an edge connecting s and ¢. 
If there are no k —1 openly disjoint st-paths in G~—e, there is (by Menger) a 
subset of nodes of at most k — 2 elements not containing s and ¢ for which s and ¢ 
belong to different components of G —e—X. Since G has more than k nodes 
either X¥ +s or X +1 is a disconnecting set of k—1 elements. O 


Robbins’ (1939) theorem (Corollary 2.13) described a relation between 2-edge- 
connected graphs and strongly connected digraphs. The following generalization 
is due to Nash-Williams (1960). 


Theorem 7.6. An undirected graph is 2k-edge-connected if and only if it has an 
orientation which is k-arc-connected. 


Actually Nash-Williams proved a much stronger result: 


Theorem 7.7. An undirected graph has an orientation such that for every ordered 
pair (x,y) of nodes there are |A(x, y)/2] arc-disjoint xy-paths where A(x, y) 
denotes the maximum number of edge-disjoint xy-paths. 


One may be interested in the existence of a (strongly) k-connected orientation. 
The (“sufficiency” part of the) following conjecture is open even for k =2. 


Conjecture 7.8. A graph G = (V, E) has a k-connected orientation if and only if 
deleting any subset X of j nodes (Q<j<k-—1) results in a 2(k — j)-edge- 
connected graph. 


In section 2 we saw an ear-decomposition theorem for strongly-connected 
digraphs. This can be interpreted so that every strongly connected digraph can be 
obtained from a node by consecutively adding and subdividing arcs. Subdividing 
an arc (%, v) with a new node z means that we replace arc (u, v) by arcs (u, z) and 
(z,v) where z is a new node. The following generalization is due to Mader. 


Theorem 7.9 (Mader 1982). A digraph D is k-arc-connected if and only if D can 
be obtained from a node by adding arcs (connecting old nodes) and applying 
operation O,: 

Operation O,: Pick up k arbitrary arcs, subdivide them by nodes z,,...,Z, 
and identify z,,...,2, to a new node z. 


Corollary 7.10. A digraph D (with 6 (s) =0) is k-arc-connected from a node s if 
and only if D can be built up from s by repeated applications of the following 
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operation: For some j, 0<j<k, first apply O, and add then k—j new arcs 
entering Z. 


Using this characterization one can easily derive Edmonds’ Theorem 6.6 on 
arc-disjoint arborescences (see Mader 1983). Also we have the following. 


Corollary 7.11. A graph G=(V, E) is k-tree-connected if and only if G can be 
obtained from a node by sequentially adding edges (connecting old nodes) and 
applying the following operation: choose j (0 <j <k) distinct edges and k — j (not 
necessarily distinct) nodes, subdivide the j edges by j nodes, identify these j nodes 
to a new node z and connect z and the k —j old nodes by k — j edges. 


Theorems 7.6 and 7.9 imply a result of Lovasz. 


Theorem 7.12 (Lovasz 1979, Problem 6.53). An undirected graph G is 2k-edge- 
connected if and only if G can be obtained from a node by adding edges and 
applying Operation Q,: pick up k arbitrary edges, subdivide them by nodes 
Z,,---,2%, and identify z,,...,2, into a new node z. 


Notice on the other hand that Theorem 7.12 implies Theorem 7.6. What about 
(2k + 1)-edge-connected graphs? We need two other operations. 

Operation Q{: Proceed as in Q,, then choose a node x of the graph and add a 
new edge joining x and the new z. 

Operation Q7: Proceed as in Q, thereby constructing G’, choose k distinct 
edges ¢},...,e, of G’ not all incident to z, subdivide each e; by a node z,, 
identify the z;’s into a new node z’ and add a new edge joining z and z’. 


Theorem 7.13 (Mader 1978a). A graph G is (2k + 1)-edge-connected if and only uf 
G can be obtained from a node by successive addition of edges and repeated 
applications of Q; and Qj. 


Let us be given a k-edge-connected undirected graph G=(V,£). It is not 
difficult to prove that for k odd the k-element cuts are pairwise non-crossing. 
(Two cuts A(X), A(Y) are called crossing if none of X-Y, Y—X, XNY, 
V—(XUY) is empty). Dinits et al. (1976) showed that the structure of minimum 
cuts can also be described when & is even. Let k = 21. 

Let us call a 2-edge-connected (loop-free) graph T = (U, F) a circuit-tree if each 
block of T is a (possibly 2-element) circuit. Intuitively, T consists of edge-disjoint 
circuits which are joined to each other in a tree-like manner. 

Any minimum cut of T consists of two edges belonging to the same circuit of T. 
If we replace each edge of T by / parallel edges, we obtain a 2/-edge-connected 
graph T’. Clearly, the minimum cuts of T’ correspond to the minimum cuts of T. 

The content of the next theorem is that the structure of minimum cuts of every 
2l-edge-connectcd graph can be described with the help of a circuit-tree. 
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Theorem 7.14 (Dinits et al. 1976). Let G=(V, E) be a 2l-edge-connected graph. 
There exists a circuit-tree T=(U,F) and a mapping ¢:V—U so that for every 
minimum cut of T determined by a partition [X,U—X] of U the cut of -G 
determined by the partition [pg '(X),¢ (U-X)] is a minimum (i.e., of 2l 
elements) cut of G and every minimum cut of G arises this way. 


We close this subsection by mentioning two results on constructing k-edge 
connected graphs and k-arc-connected digraphs. 


Theorem 7.15. (a) (Watanabe and Nakamura 1987) An undirected graph G = 
(V, E) can be made k-edge-connected (k > 2) by adding at most y new edges if and 


only if 
Dd (k -d(X):i=1,..., <2y 


holds for every family {X;} of disjoint non-empty subsets of V. 
(b) (Frank 1992a) A directed graph D = (V, A) can be made k-edge-connected 
(k 21) by adding at most y new edges if and only if 


(k-8°(%):f=1,...,08y and (k-8*°(X%):i=1,.. ,)<y 
holds for every family {X,} of disjoint non-empty subsets of V. 


In Frank (1992a) the first part has been generalized to the case when the 
prescribed edge-connectivity between each pair of nodes is arbitrary (and not 
necessarily the same number k). Also augmentations with minimum node-costs 
are tractable. For a survey see Frank (1994), 


7.3. 3-connected graphs 


eT] 
In the preceding subsection we saw how to construct all the k-edge-connected 
graphs. As far as node-connectivity is concerned there is an ear-decomposition 
result for 2-connected graphs (Proposition 2.6). Tutte (1966) developed a theory 
for decomposing a 2-connected graph into 3-connected “components”. (The 
reader is refered to the original work since even formulating the result needs too 
much space.) 

Unfortunately there are no known analogous constructions for k-connected 
graphs, in general. For 3-connectivity, however, the situation is much better. In 
order to be able to work with 3-connected graphs we must have “‘reductions” that 
preserve 3-connectivity. Two simple reductions are deleting and contracting an 
edge e. We use the notation G—e and G/e for the graphs arising from G by 
deleting and contracting e, respectively. For 2-connectivity we saw (Proposition 
2.7) that any edge of a 2-connected graph can be either deleted or contracted 
without destroying 2-connectivity. For 3-connectivity one has the following. 


Theorem 7.16 (Tutte 1966, Theorem 12.65). if e is any edge of a 3-connected 
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graph on at least four nodes, then either Gle is 3-connected or G-—e is a 
subdivision of a 3-connected graph. 


Proof (Thomassen 1984). Suppose G/e is not 3-connected for an edge e=xy. 
Then there is a node z such that G’ = G — {x, y, z} is not connected. What we 
have to show is that there are three openly disjoint paths between x’ and y' not 
using e for any two nodes x’, y’ distinct from x and y. In G there are three openly 
disjoint paths connecting x’ and y’. If one of these contains e, then x’ and y’ 
belong to the same component of G’ or one of x’, y’ equals z. Then there is a 
component C of G’ not containing x’ and y’ and there is a path P in C + {x, y} — 
e connecting x and y. But now replacing e by P we obtain three openly disjoint 
paths between x’ and y’ not usinge. O 


We say that an edge e of a 3-connected graph is contractible if G/e is 
3-connected. The next result, due to Tutte (1961b) shows that there always exists 
a contractible edge. 


Theorem 7.17. A 3-connected graph G=(V,E) with at least five nodes has a 
contractible edge. 


Proof (sketch, Thomassen 1980b). If an edge xy is not good, there is a node z 
such that {x, y, z} is a disconnecting set of G. Choose xy in such a way that the 
largest component C of G— {x, y,z} is as big as possible. Let C’ be another 
component of G — {x, y,z} and u €V(C’) such that uz € E. The contraction of 
uz leaves a 3-connected graph. UO 


One way to generate new 3-connected graphs is applying the following splitting 
operation (that may be considered as a converse to contracting an edge). Note 
that the graph may have parallel edges. 

Operation S: Choose a node u of degree at least four. Partition the edges 
incident to v into two parts E, and E, so that |{u: wv © E,}|=2 for i=1, 2. 
Replace v by two nodes v, and V2, replace each edge vu € E, by an edge uu 
(i= 1,2) and join v, and v, by an edge. 

Theorem 7.17 provides a kind of converse. 


Corollary 7.18. A (not necessarily simple) graph G is 3-connected if and only if G 
can be obtained from K, by repeatedly adding edges (connecting old nodes) and 
applying operation S. 


A slight drawback of this theorem is that, though parallel edges do not play any 
role m 3- ~comnectivity, it may happen that even if the 3-connected graph G to be 
constructed is simple the graphs occurring in the intermediate steps are not. This 
is the case, for example, if G is a wheel (bigger than K,). (A wheel is a circuit 
Plus an extra node connected to all nodes of the circuit.) In a sense wheels are the 
only essential examples of this type since Tutte (1961b) proved the following. 
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Theorem 7.19. A simple graph is 3-connected if and only if G can be obtained 
from a wheel by repeatedly adding edges connecting non-adjacent old nodes an 
applying operation S. ; 


This result is a reformulation of the following one. 


Theorem 7.20 (Tutte 1961b). A 3-connected graph G is either 
(a) @ wheel or 
(b) contains an edge e for which G — e is 3-connected or 
(c) contains a contractible edge which is not in a triangle. 


Proof (Thomassen 1984). Suppose that neither (b) nor (c) occurs. By Theorem 
7.17 there is a triangle T. Let V(T) = {x, y, z}. We claim that at least two nodes of 
T have degree three. Suppose, indirectly, x has neighbours x,, x, not in T and y 
has neighbours y,, y, not in T. Since G — xy is not 3-connected, there is a node z’ 
such that G' = G — {z,z'} — xy is not connected. Let G, and G, denote the 
components of G’ containing x and y, respectively. Since G — y is 2-connected, 
there are two disjoint paths P,, P, from {y,, y,} to {z’, z}. One of x, and x,, say 
x,, is distinct from z'. In G — x there are two openly disjoint paths Q,, Q, from x, 
to {z’,z}. Since P,, P, are in G,+{z’,z} and Q,,Q, are in G, + {2’,z}, 
P, =(P; + Q, + @Q,) is a path from y to z. Now P,, P, and P, = (yx, xz) are three 
openly disjoint paths between y and z, therefore edge yz satisfies (b), a 
contradiction. 

So T has at least two nodes of degree 3, say x and y. If x’ (#y,z) is a 
neighbour of x, then it is easy to see that G/xx’ is 3-connected. Hence xx' must 
be in a triangle. The third node of this triangle must be z (unless G = K,) and we 
conclude that x’ has degree 3. We then consider the neighbour x” of x’ distinct 
from x and z and continuing this way we see that G is a wheel with center z. O 

4 

Obviously Tutte’s theorem, in turn, implies Theorem 7.17. Theorem 7.17 is a 
highly powerful device in proving results apparently not related to connectivity. 
For example, with the help of it, Thomassen found an easy proof of Kuratowski’s 
theorem on planarity of graphs as well as Tutte’s theorem stating that every 
3-connected planar graph has a convex representation in the plane. Note that 
Tutte originally used a different approach. He relied on the following result. 


Theorem 7.21 (Tutte 1963). Every edge uv of a 3-connected graph is contained in 
two peripherical circuits C, and C, (that is a chordless circuit the deletion of which 
results in a connected graph) for which V(C,)NV(C,) = {u,v} A 3-connected 
graph is planar if and only if every edge is contained in exactly two peripherical 
circuits. 


Note that the first part of the theorem is straightforward for planar graphs as 
the two circuits determined by the faces incident to uv satisfy the requirements. 
Here are three variations of Theorem 7.17. 
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Theorem 7.22 (Thomassen and Toft 1981). Every simple 3-connected graph with 
no triangles contains a circuit C such that every edge of C is contractible. 


Theorem 7.23 (Halin 1969a). Jf v is a node of degree 3 in a 3-connected graph, 
then there is an edge uv for which G/uv is 3-connected. 


Theorem 7.24 (Ando et al. 1987). Every 3-connected graph G=(V,E) with 
|V|>4 has at least |V|/2 contractible edges. 


There is a counterpart of Theorem 7.17. 


Theorem 7.25 (Barnette and Griinbaum 1969, Titov 1975). Every 3-connected 
graph with at least five nodes has an edge e such that G — e is a subdivision of a 
3-connected graph. 


Proof. If there is no such an edge, then by Theorem 7.16 every edge is 
contractible without destroying 3-connectivity. Let G' be a graph obtained by 
contracting an edge st. By induction there is an edge e of G’ such that G’—e isa 
subdivision of a 3-connected graph. If G —e is not a subdivision of a 3-connected 
graph, then there are nodes x, y such that G— {x, y}—e has a component 
consisting of two nodes and ¢ is incident to one of them, denoted by z. Since G is 
3-connected there is an edge f from z to {x, y}. But {z, x, y} is a separating set of 
G so the contraction of f destroys 3-connectivity. O 


Corollary 7.26. A graph G=(V,E) is 3-connected if and only if G can be 
obtained from K, by sequentially adding edges and applying the following 
operations: 

(a) Pick up two non-parallel edges, subdivide them by nodes u, v and join u and 
v by an edge. 

(b) Pick up an edge xy and a node v (#x, y), subdivide xy by a node u and join 
u and v by an edge. 


Barnette and Griinbaum (1969) used Theorem 7.25 to provide a short proof of 
a theorem of Steinitz stating that the 1-skeletons of the 3-dimensional polytopes 
are precisely the 3-connected planar graphs. 

Finally, here is a theorem consisting deletion of nodes rather than edges. 


Theorem 7.27 (Chartrand et al. 1972). Every 3-connected graph of minimum 
degree at least 4 has a node v such that G —v is 3-connected. 


7.4. Preserving connectivity 


In the preceding subsection on 3-connected graphs we have encountered theorems 
Saying that a 3-connected graph remains 3-connected under certain operations 
(Theorems 7.16, 7.17, 7.27). In this part we survey further operations that 
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preserve connectivity properties of graphs. As far as node-connectivity is 
concerned, only a few general reduction results are known. Here is one. 


Theorem 7.28 (Thomassen 1981). A k-connected graph with no triangle contains 
an edge whose contraction results in a k-connected graph. 


The situation is much better for edge connectivity. Let G be a graph (or 
digraph) and let e=uz and f=zv be two edges (or arcs) of G incident to a 
common node z. We say that a pair {e, f} is split off (at z) if we replace e and f 
by a new edge (arc) uv. The resulting graph (digraph) is denoted by G ee 

The following fundamental result is due to Mader (1978a). 


Theorem 7.29. Let G =(V, E) be a (not necessarily simple but loopless) graph and 
z anode of degree at least 4 so that there is no cut-edge incident to z. There exist 
edges e, f incident to z such that A(x, y; G) = A(x, y; G*) for every pair of distinct 
nodes x, y different from z. (Here A(x, y;G) denotes the maximum number of 
edge-disjoint paths in G connecting x and y). 


A relatively simple proof can be found in Frank (1992b). Mader used it to 
derive Nash-Williams’ orientation result (Theorem 7.7). Theorem 7.13 was also 
obtained from this result. Theorem 7.6 follows already from a weaker form of 
Theorem 7.29 due to Lovasz (1979, Problem 6.53): If z is a node of even degree 
and A(x, y: G) = k whenever x, y €V — z, then for any edges e incident to z there 
is an edge f incident to z such that A(x, y: G’) =k for every x, y (¥z). While the 
directed analogue of Theorem 7.29 is not true in general the counterpart of 
Lovasz’ version holds, as follows. 


Theorem 7.30 (Mader 1983). Let D = (V, A) be a digraph and z a node for which 
5 (z)=8"(z). Suppose that A(x, y: D)=k for every x, yEV-—z. For any arc 
entering z there is an arc f leaving z such that A(x, y:D%)=k for every x, 
yEV—-z. 


This result is one ingredient to Theorem 7.9. The other one will be mentioned 
later (Theorem 7.41). 

If we restrict ourselves to Eulerian digraphs (that is, 5 (z) =6*(z) for every 
node z) then the counterpart of Theorem 7.28 does hold (although it is much 
easier). 


Theorem 7.31 (Frank 1989, Jackson 1988). Let D =(V, A) be an Eulerian digraph 
and z a node. For any arc e entering z there is an arc f leaving z such that 
A(x, y:D) = A(x, y: D“) for every x, yEV—z. 


Sometimes we can maintain connectivity properties under “‘bigger’’ reductions. 


Theorem 7.32 (Thomassen and Toft 1981). Every 3-connected graph with mini- 
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mum degree at least 4 contains a circuit whose contraction results in a 3-connected 
graph. 


Theorem 7.33 (Jackson 1980). A simple 2-connected graph of minimum degree at 
feast 4 contains a circuit C such that the removal of its edges leaves the graph 
2-connected and, in addition (Thomassen and Toft 1981), G — V(C) is connected. 
The first part holds for not necessarily simple planar graphs (Fleischner and 
Jackson 1985). 


Theorem 7.34 (Thomassen and Toft 1981). Every simple 2-connected graph G of 
minimum degree at least 3 contains an induced circuit C (that is a circuit without 
chords) such that G — V(C) is connected. 


The same conclusion was proved by Tutte for 3-connected graphs (see Theorem 
7.21). 


Theorem 7.35 (Thomassen and Toft 1981). If G is a 2-connected graph with 
minimum degree at least 5, then G contains an induced circuit such that G— V(C) 
is 2-connected. If G is 3-connected with minimum degree at least 4, then G has a 
circuit such that G — V(C) is a block. 


For higher node connectivity we have the following results. 


Theorem 7.36 (Thomassen 1981). Every (k +3)-connected graph G contains an 
induced circuit C such that G — V(C) is k-connected. 


Theorem 7.37 (Egawa 1987). Every (k+2)-connected triangle-free graph G 
contains an induced cycle C such that G — V(C) is k-connected. 


Theorem 7.38 (Mader 1974a). Every k-connected graph G with minimum degree 
at least k +2 contains a circuit C such that G — E(C) is k-connected. 


For edge-connectivity Mader proved the following. 
Theorem 7.39 (Mader 1985a). For every pair of nodes s, t of a connected graph 
there is a path P connecting s and t such that deleting the edges of P the local 
connectivity A(x, y), for any pair x, y of nodes, can decrease by at most two. 


A directed counterpart of this result is also due to Mader. 


Theorem 7.40 (Mader 1981). For every pair of nodes s, t of a k-arc-connected 
digraph there is a path P from s to t such that deleting the arcs of P leaves the 
digraph (k — 1)-arc-connected. 
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7.5. Minimal and critical graphs 


In graph theory it is a typical way to prove things by starting with a graph critical 
(or minimal) with respect to a certain property. For example, if this property is 
“having no perfect matching” we arrive at the concept of factor-critical graphs 
(see chapter 3). Therefore, it is a general program to investigate “critical” graphs. 
Typically, we use the adjective ‘‘minimal” (resp. critical) if deleting any edge 
(resp. node) destroys the property considered. 

Call a graph (digraph) G minimally k-edge-connected (k-arc-connected) if G is 
k-edge-connected (k-arc-connected) but G-—e is not for each edge (arc) e. 
Similarly, a graph G is minimally k-connected if G is kK-connected but G — e is not 
for each edge e. Strongly minimally k-connected (or, briefly, minimally k- 
connected digraphs) are defined analogously. 

A k-connected (k-edge-connected) graph is called critically k-connected (criti- 
cally k-edge-connected) if deleting any node destroys k-connectivity (k-edge- 
connectivity). The corresponding notions for digraphs are defined analogously. 

Actually there are eight classes to be investigated corresponding to the possible 
choices: directed or undirected graph, edge- (arc-)connectivity or node-connectivi- 
ty, critical or minimal. There are interesting results concerning each but one of 
these classes (critically k-arc-connected digraphs have not yet been investigated). 
Here we list only the most important theorems. The starting point is a theorem 
due to Halin. 


Theorem 7.41. (a) (Halin 1969a) Every minimally k-connected graph has at least 
one node of degree k. 

(b) Every minimally k-edge-connected graph G (with at least two nodes) 
contains a node of degree k. 


Mader extended these results. “ 
Theorem 7.42 (Mader 1972), Every minimally k-connected graph contains at least 
k + 1 nodes of degree k. Furthermore, every circuit of G contains a node of degree 
k (Mader 1971b). Every minimally k-edge-connected simple graph contains at least 
k +1 nodes of degree k. 


Another interesting generalization of Halin’s result is also due to Mader (1973). 


Theorem 7.43. In a simple graph if every degree is at least k +1, there are two 
adjacent nodes s and t which are connected by k + 1 openly disjoint paths. 


Let us see critical graphs. 
Theorem 7.44 (Mader 1986). Every critically k-edge-connected simple graph G 


contains a node of degree k. Furthermore, G contains a node x such that G — x i 
(k — 1)-edge-connected. 
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To formulate results on critically k-connected graphs we need the following 
concepts. In a k-connected graph G a set X is a separating set if G—X is not 
connected. A subset C of nodes called an end of G if C is one of the components 
iniG—X for a k-element separating set X. An atom is an end with smallest 
cardinality. The name “atom” is justified by the following. 


Theorem 7.45 (Mader 197la). Jf C is an atom and K is an end, then either 
KNC¥#¥86 or CCK. 


It is not true, in general, that a critically k-connected graph G has a node of 
degree k (or, equivalently, the atoms of G are of cardinality one). But one has the 
following. 


Theorem 7.46 (Mader 1985c). Every critically k-connected (not complete) graph G 
contains two disjoint ends with cardinality at most k/2. Furthermore, G contains 
four disjoint ends. 


Corollary 7.47 (Chartrand et al. 1972). Every critically k-connected simple (not 
complete) graph G contains a node of degree at most |3k/2] and this bound is best 
possible. Actually, G contains at least two such nodes (Hamidoune 1980 and 
Veldman 1983). 


What about directed graphs? 


Theorem 7.48 (Mader 1985b). Every minimally k-connected digraph contains at 
least k nodes of in-degree k and at least k nodes of out-degree k. 


Conjecture 7.49 (Mader 1979). Every minimally k-connected digraph contains a 
node of in-degree and out-degree k. 


The k-arc-connected version of this statement is true and plays a central role in 
constructing all k-arc-connected digraphs (Theorem 7.9). 


Theorem 7.50 (Mader 1974b). Every minimally k-arc-connected digraph contains 
at least two nodes having both in-degree and out-degree k. 


_ For critical strongly connected digraphs it is true again that there is a node of 
in-degree 1. More specifically, the following holds. 


Theorem 7.51 (Mader 1989). Every critical strongly connected digraph with at 
least four nodes contains four distinct nodes x,, x», Y;, ¥, for which 6 (x,)= 
8 °(y,) =1 @= 1,2). 


Theorem 7,52 (Mader 1991). Every critically k-connected digraph G contains a 
node s for which 5 (s) <2k — 1 or 8 *(s) <2k — 1. If G is antisymmetric (that is, if 
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(x, y) is an arc, then (y,x) is not), then G contains a node s for which 5 (s) < 
[3k —4] or 5*(s)< [3k —4]. These bounds are best possible. 


Actually this theorem is a consequence of a result of Mader that can be 
considered as a directed counterpart of Theorem 7.46. It is not true that the 
minimum in-degree in a critically k-connected digraph (k = 2) is at most 2k — 1. 


7.6. Connected subgraphs 


By Whitney’s Theorem 7.5 we know that a k-connected graph contains k openly 
disjoint paths connecting two specified nodes. It is a natural feeling that a highly 
connected graphs must contain some other type of subgraphs. In this subsection 
we briefly summarize such results. The starting point is a theorem by Dirac. 


Theorem 7.53 (Dirac 1960). In a k-connected graph G every subset of k nodes is 
included in a circuit. If, in addition, G is non-bipartite, every subset of k — 1 nodes 
is included in an odd circuit (Bondy and Lovasz 1981). 


Theorem 7.54 (Mesner and Watkins 1967). in a k-connected graph (k 23) a 
subset H of k +1 nodes is included in a circuit if and only if there is no set 
X CV—H with |X| =k such that each node in H belongs to a different component 
of G-X. 


Suppose we want much more: find k(k — 1)/2 openly disjoint paths between k 
specified nodes s,,...,5, (one path for one pair). Such a configuration can be 
considered as a subdivision of K, with principal nodes s,,... ,5,. With sufficiently 
high connectivity this property can also be guaranteed. 


Theorem 7.55 (Jung 1970, Larman and Mani 1970). If G is 2““~"? connected, 
then for distinct nodes s,,...,5, there is a subdivision of K, in G having 
S,,...,5, as principal nodes. 


Theorem 7.56 (Haggkvist and Thomassen 1982). In a k-connected graph every 
subset of k —1 independent edges is included in a circuit. 


Conjecture 7.57 (Lovasz). In a k-connected graph every subset of k independent 
edges is included in a circuit unless k is odd and the k edges disconnect the graph. 


For k = 3 this was shown by Lovasz. The following result, due to Lovasz (1977) 
and Gyori (1978), is about partitions of graphs or digraphs into connected parts of 
given size. 


Theorem 7,58. In a digraph (graph) let S = {s,,5,,...,5,} be a set of k nodes 
and n,,n,,...,n, positive integers such that ¥,n,=|V|. Suppose that for any 
v GV—S there are k paths from S to v pairwise disjoint except at v. There is a 
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partition {V,,V,,...,V,} of V into k parts such that V,O'S = {s,}, |V,| =n, and the 
digraph (graph) induced by V, contains an arborescence rooted at s, (is connected). 


7,7. Extremal results 


In this last subsection we briefly mention some extremal-type results concerning 
connectivity. More detailed accounts are found in Mader (1979) and in Bollobas, 
(1978). Let us start with a result on digraphs. 


Theorem 7.59 (Dalmazzo 1977). A minimal k-edge-connected digraph D = (V, A) 
on n nodes has at most 2k(n — 1) arcs. 


Proof. Let s be an arbitrary node of D. By Edmonds’ Theorem 6.6 there are k 
edge-disjoint spanning arborescences of root s. Let G, denote the union of these 
arborescences. Similarly, there are k edge-disjoint spanning co-arborescences of 
root s (a co-arborescence of root s is a directed tree such that re-orienting all of its 
edges results in an arborescence of root s). Let G, denote the union of these 
co-arborescences. Clearly, both G, and G, have k(m — 1) arcs and their union is 
k-edge-connected from which the result follows. O 


Note that the bound in the theorem is sharp as is shown by a digraph obtained 
from any tree by replacing each edge uv by 2k parallel arcs among which k are in 
one direction and the other k are in the other direction. 

Theorem 7.59 immediately implies that a minimal k-edge-connected graph on n 
nodes has at most k(n —-1) edges. Indeed, if we replace each edge by two 
oppositely directed arcs, we obtain a minimal k-edge-connected digraph and then 
Theorem 7.59 applies. 

Mader proved that for simple graphs a better bound exists. 


Theorem 7.60 (Mader 1974b). A minimal k-edge-connected simple graph on n 
nodes has at most kn — k(k + 1)/2 edges. 


Since a minimal k-edge-connected graph must not have a (k + 1)-edge-con- 
nected subgraph, Theorem 7.60 is an immediate consequence of the following. 


Theorem 7.61 (Mader 1974b). Every simple graph on n nodes with more than 
kn — k(k + 1)/2 edges has a (k + 1)-edge-connected subgraph. 


_Again, the bound is sharp as is shown by a graph constructed from a complete 
bipartite graph K, ,,-, by adding all the possible edges in the k-element part. 


8. Multicommodity flows and disjoint paths 


8.1. Problem formulation 


In this section we address the following problem, called the disjoint paths 
problem. Given a graph or a digraph and k pairs of nodes (s,,t,), 
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(S,,f2),---+(Sg,%&), find k pairwise openly disjoint paths connecting the corre- 
sponding pairs (s,, f,). If we are interested in finding edge-disjoint paths we speak 
about the edge-disjoint paths problem. In the book ‘Paths, Flows, and VLSI- 
Layout” (B. Korte et al., eds., Springer 1990), several survey papers are included 
related to the material of this section (Frank 1990, Schrijver 1990, Robertson and 
Seymour 1990]. __ 

A capacitated version of the edge-disjoint paths problem is the following. For 
every edge of the graph a non-negative capacity is specified and, similarly, for 
every pair to be connected a non-negative demand is given. The inveger 
multicommodity flow problem is that of finding as many paths between the 
corresponding terminals as their demands are so that every edge occurs in at most 
as many paths as its capacity. If we allow fractional paths as well, we speak about 
the multicommodity flow problem or, in short, multiflow problem. That is, a 
multicommodity flow is defined by paths P,, P,,...,P, and non-negative 
numbers A,,... A, such that each path P, is a path from s, to t, and for each edge 
e the sum of coefficients assigned to paths using e is at most the capacity of e¢. 

Actually this kind of problem can be considered as a feasibility problem. The 
maximization problem is that when no demands are specified and one is interested 
in finding a maximum number of paths connecting the corresponding terminal 
pairs. 

Sometimes it is convenient to mark the terminal pairs to be connected by an 
edge. The graph H=(U, F) formed by the marking edges is called a demand 
graph while the original graph G = (V, F) is the supply graph. Let us call a circuit 
of G+H a good circuit if it contains precisely one demand edge. In this 
terminology the edge-disjoint paths problem is equivalent to seeking for |F| 
edge-disjoint good circuits. 

The multiflow problem can be formulated as a linear program. One way to do 
so is as follows. Let A be a 0-1 matrix the rows of which correspond to the edges 
of G the columns correspond to the good circuits. An entry (i, j) is 1 if the édge 
corresponding to i is in the circuit corresponding to j and 0 otherwise. Similarly 
let B be a 0-(—1) matrix the rows of which correspond to the edges of H, the 
columns correspond to the good circuits. An entry (/,/) is —1 if the edge 
corresponding to i is in the circuit corresponding to j and 0 otherwise. (The 
Structure of B is simple: every column has exactly one non-zero entry.) The 
multiflow problem is equivalent to the following linear inequality system. Ax <1, 
Bx = —1, x20, where 1 and —1 are appropriately sized vectors of 1s and —1s, 
respectively. 

By Farkas’s lemma this system has no solution if and only if there is a vector w 
in R® and a vector z in R” such that }) (w(e):e EE) — » (2(f): f EF) <0 and 
such that ); (w(e): e € C —f) — z(f) 20 holds for every demand edge f and every 
circuit C for which CN F = {f}. Obviously, if there is such a w and z, then z can 
be chosen so as to satisfy z(f) = dist, (u,v) where f= uv and dist, (u,v) is the 
minimum w-weight of a path in G connecting the end nodes of demand edge f. 
We obtain the following. 


Theorem 8.1 (Iri 1970, Onaga and Kakusho 1971). The multiflow problem has a 
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solution if and only if the 
Distance criterion: >, (dist, (u,v): uu EF) > (w(e):e EE) (8.1) 
holds for every w€& R®. 


As general linear programs can be solved in polynomial time, so is the 
multiflow problem. Since the constraint matrix above has entries 0, +1 there is a 
strongly polynomial algorithm as well (Tardos 1986). This is why we concentrate 
only on integer multicommodity flows or disjoint paths. 

First we survey results concerning undirected graphs. 


Theorem 8.2 (Karp 1975). The undirected (edge-) disjoint path problem (when k is 
a part of the input) is NP-complete. 


Even et al. (1976) proved that the problem is NP-complete even in the special 
case when the demand graph consists of two sets of parallel edges. In other 
words, the integer 2-commodity flow problem is NP-complete. On the other hand 
we have the following very difficult result. 


Theorem 8.3 (Robertson and Seymour 1986b). For fixed k the undirected (edge- 
disjoint) disjoint paths problem can be solved in polynomial time. 


8.2. Characterizations for edge-disjoint paths 


First, let us concentrate on edge-disjoint paths. A natural necessary condition is 
the cut-criterion: 


Cut-criterion: d,(X)=d,(X) forevery X CV. 


Note that the cut-criterion is a special case of the distance-criterion. We call the 
difference d,(X) — d,,(X) the surplus of cut A(X) and denote it by s(X). A cut 
A(X) is called tight if s(X) =0. 

The cut criterion is not sufficient, in general, as the simple example.in fig. 8.1 
shows. It is sufficient, however, if the demand graph is a star (that is, the demand 
edges share a common endpoint). (This immediately follows from the undirected 
edge version of Menger’s theorem.) 


8, So 
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The next two simplest demand graphs are 2K, (a graph on four nodes with two 
disjoint edges) and C, (a triangle). The following characterization for 2K, is due 
to Seymour (1980c) and Thomassen (1980a). 


Theorem 8.4. Let G be a graph such that no cut edge separates both of the two 
terminal pairs (s,,t,) and (s,,t,). There are no two edge-disjoint paths between the 
corresponding terminals if and only if some edges of G can be contracted so that 
the resulting graph G’ is planar, the four terminals have degree two while the other 
nodes are of degree 3 and the terminals are positioned on the outer face in this 
order: S,,S25t, ty. 


Figure 8.2 shows a typical example where the two edge-disjoint paths do not 
exist. 

If we want k; paths between s, and f, (i= 1,2) the problem becomes NP- 
complete. The situation is much better for the other special H mentioned above. 


Theorem 8.5 (Seymour 1980c). Jf the demand graph H consists of three sets of 
parallel edges between three nodes v,, v, and v,, the edge-disjoint paths problem 
has a solution if and only if the cut criterion holds and 


qv, UV, UV,) <s(V,) + s(V,) + s(V5) 


for every choice of disjoint sets V, with v,; EV, (i= 1,2,3) where s(X) denotes the 
surplus and q(X) denotes the number of components C in G—X for which 
dg(X) + d,,(X) is odd. 


This result is a rather easy consequence of ‘ale of Mader (Theorem 8. 23 
below) on edge-disjoint T-paths (when |7| = 

Let us call a set X, given G and H, an ea set and the cut A(X) an odd cut 
(with respect to G+ H) if d,(X) + d,,(X) is odd (or equivalently, the surplus 
s(X) is odd). A basic feature of odd cuts is that in any solution to the 
edge-disjoint paths problem an odd number of edges of an odd cut, in particular, 
at least one edge, will not be used. 

What if there are no odd cuts at all, that is, G+H is Eulerian? The cut 
criterion is still not sufficient as is shown in fig. 8.3. Even worse, Middendorf and 
Pfeiffer (1990) proved that the edge-disjoint paths problem is NP-complete even 
if G+ H is Eulerian. 

However, in the special cases listed below the cut criterion proves to be 
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sufficient. Given a demand graph H =(V, A), H’ will denote the graph arisen 
from H by replacing each set of parallel edges by one edge. 


Theorem 8.6. Suppose that G+H is Eulerian. In the following cases the cut 
criterion is necessary and sufficient for the solvability of the edge-disjoint paths 
problem. 

(a) H’ is 2K, (Rothschild and Whinston 1966b). 

(b) H’ consists of two stars (Papernov 1976, Seymour 1980a, Lomonosov 
1985). 

(c) H’ is K, (Papernov 1976, Seymour 1980a, Lomonosov 1985). 

(d) H’ is C, (Lomonosov 1985). 

(e) G is planar and each terminal is on one face (Okamura and Seymour 1981). 

(f) G is planar and there are two faces F,, F, such that each demand edge 
connects two nodes of either F, or F, (Okamura 1983). 

(g) G+H is planar (Seymour 1981). 

(h) G is planar and there are two specified inner faces C, and C, of G. The 
demand edges s,t,,...,5,t, are positioned in such a way that each s, is on C,, each 
t; is on C, and their cyclic order is the same (Schrijver 1989). 


Note that part (g) of this theorem immediately follows (by planar dualization) 
from a theorem by Seymour asserting that a +1 weighted bipartite graph (planar 
or not) has no circuit of negative total length if and only if the edge set can be 
partitioned into cuts such that each cut contains at most one negative edge. This is 
an equivalent formulation of Seymour’s theorem on the maximum number of 
disjoint 7-cuts. 

As we mentioned earlier the cut criterion is not sufficient, in general, even if 
G +H is Eulerian. Sometimes the stronger distance criterion (8.1) helps. 


Theorem 8.7 (Karzanov 1987). Suppose that G+ H is Eulerian and the demand 
edges form a graph arising from K, by adding parallel edges. Then the distance 
criterion is necessary and sufficient for the solvability of the edge-disjoint paths 
problem. (in other words, if there is a fractional solution, there is an integral one.) 


Theorem 8.8 (Karzanov 1994). Suppose that G is planar, G+ H is Eulerian and 
each demand edge connects two nodes of one of three specified faces of G. Then 
the distance criterion is necessary and sufficient for the solvability of the edge- 
disjoint paths problem. 


Theorem 8.5 provided an example where parity played a basic role in a good 
characterization. Here are two more cases. 


Theorem 8.9 (Frank 1990). Suppose that G +H is planar and the demand edges 
are on two faces of G. The edge-disjoint paths problem has a solution if and only if 
the cut criterion holds and dg, (XY) is even for every pair of tight sets X, Y. 


164 A. Frank 


This theorem is a generalization of an earlier theorem of Seymour (1981) where 
H consisted of two sets of parallel edges. Seb6 (1993) proved that if G+ A is 
planar and the number of demand edges nodes is bounded by a constant, then 
there is a polynomial time algorithm to solve the integer multiflow problem. 
Another result, due to Schnijver (1990) asserts, that if G + #7 is planar and the 
number of faces covering all the terminal nodes is bounded by a constant, then 
the edge-disjoint paths problem is polynomially solvable. On the other hand, the 
problem is NP-complete if there is no such a bound (Middendorf and Pfeiffer 
1990). 


Theorem 8.10 (Frank 1985). Suppose that G is planar, the terminals are on the 
outer face and the degree of every node not on the outer face is even. The edge 
disjoint paths problem has a solution if and only if Y s(C,) 2 4/2 for every family 
(C,,C2,..-,C,) of cuts (k <|V|) where q denotes the number of odd components 
in G—C,-—C,— +++ —C, and s(C) is the surplus of C. 


To close this subsection we mention a theorem by van Hoesel and Schrijver 
(1986) where topology plays a role. 


Theorem 8.11. Let G be a planar graph embedded in R’. Let O denote the interior 
of the unbounded face and I the interior of a specified bounded face. Let C,, 
C,,..., C, be curves in R? — (IU O) each of which connects a node on IU O with 
a node on IUO so that for each node v of G the degree of v has the same parity as 
the number of curves ending at v. Then there exist pairwise edge-disjoint paths P,, 
P,,...,P, in G so that P, is homotopic to C, in R’ — (OUT) (i= 1,2,...,&) if 
and only if for each dual path Q from IU O to 1U O the number of edges in Q is 
not smaller than the number of times Q necessarily intersects the curves C,. 


Note that this theorem generalizes part (e) of Theorem 8.6. It is an open 
problem to find a common generalization of Theorems 8.6(f) and 8.11. This last 
theorem is a prototype of theorems belonging to the area one may call homotopic 
paths packing. An excellent survey of this topic occurs in Schrijver (1990). 


8.3. Sufficient conditions for edge-disjoint paths 


We call a graph k-linked on the edges if for any choice of k pairs of terminals 

there are k edge-disjoint paths connecting the corresponding terminal pairs. 
Theorem 8.4 implies that a 3-edge-connected graph is 2-linked on the edges. 

Actually such a graph is 3-linked as the following even stronger result shows. 


Theorem 8.12 (Okamura 1984). In a graph three terminal pairs (s,, t;) (i= 1, 2, 3) 
are specified such that for each i there are three edge-disjoint paths connecting 5; 
and t,. Then there are s,t,-paths (i = 1,2,3) pairwise edge-disjoint. 
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Okamura’s theorem is an answer to the following conjecture of Thomassen, 
when & = 3. 


Conjecture 8.13 A k-edge-connected graph is X-linked on the edges if k is odd 
and (k — 1)-linked on the edges if k is even. 


Note that by Tutte’s Theorem 6.14 a (2k)-edge-connected graph always has k 
edge-disjoint spanning trees and therefore it is k-linked on the edges. 
The following theorem gets very close to the conjecture. 


Theorem 8.14 (Huck 1991). A k-edge-connected graph is (k —1)-linked on the 
edges if k is odd and (k — 2)-linked on the edges if k is even. 


We note that Thomassen’s conjecture is open for k = 5. 
In certain cases the cut condition is not strong enough to ensure the existence of 
the required paths but the demands can almost be met. 


Theorem 8.15 (Korach and Penn 1992). Suppose that G+ H is planar, for each 
terminal pair (s,,t,) an integer demand d, is given and the cut condition holds with 
respect to d. Then there are d,— 1 paths connecting s, and t; ‘for each terminal pair 
such that all these paths are pairwise disjoint. 


Theorem 8.16 (Itai and Zehavi 1984). Assume that in a graph G (s,,1,) are 
terminal pairs (i = 1,2) such that there are k edge-disjoint paths connecting s, and t, 
(i=1,2). Then for each m, 0<m<k there are k edge-disjoint paths P, S,, 
$35-++5Sns Qi, Qo,--->Qz-m-; Such that each S, connects s, and t,, each Q, 
connects s, and t, and P connects either s, and t, or S, and t,. 


8.4. Node-disjoint paths 


We call a graph k-linked if for any choice of k pairs of terminals there are k 
openly disjoint paths connecting the corresponding terminal pairs. A counter-part 
of the cut-condition is: 

Node-cut condition: No subset S of nodes can separate more than |,S| terminal 
pairs. 

This condition is sufficient if the terminal pairs share a common node (a version 
of the node-Menger theorem) but not in general. 


Theorem 8.17 (Thomassen 1980a, Seymour 1980c). Let G be a graph such that no 
cut node separates s, from t, ands, from t,. There are no disjoint paths between s, 
and t, and between s, and t, if and only if G arises from a planar graph G', where 
the four terminals are on the outer face in this order s,, S>, t,, t2, by placing an 
arbitrary graph into some faces of G' bounded by two or three edges. 


The problem was solved algorithmically by Shiloach (1980). 
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Corollary 8.18 (Jung 1970). A 4-connected non-planar graph is 2-linked. A 
6-connected graph is 2-linked. 


Here the second statement follows from the first one since a planar graph 
always has a node of degree at most 5. Note that there is a 5-connected planar 
graph that is not 2-linked. For higher connectivity we have the following. 


Theorem 8.19 (Jung 1970, Larman and Mani 1970). A 2°“ connected graph is 
k-linked. 


It is not known if 2* can be replaced by a linear bound. The natural 2k + 2 is 
not enough as can be seen from a K;,_, with edges x, y,,...,X,y, removed (an 
example due to Strange and Toft 1983). 

The following pretty result is not difficult to prove. 


Theorem 8.20 (Robertson and Seymour 1986a). Suppose that G is planar and the 
terminals are on the outer face. The disjoint paths problem has a solution if and 
only if the node-cut condition holds and there are no two “‘crossing”’ terminal pairs 
(that is, any two pairs (s,,t,) and (s,,t,) are in this order on the outer face; 
$15 ty) 59,¢). 


Robertson and Seymour also found a characterization for the disjoint paths 
problem when G is planar and the terminals are positioned on two specified faces. 


8.5. Maximization 


So far we have studied multicommodity flow problems of feasibility type. One can 
also be interested in the maximization form: Given a graph G=(V, £) with 
non-negative integer capacity function c on the edges and a set of terminal pair 
(S,,¢,),..-5(S,,¢,), find flows between s, and ¢, (i =1,2,...,) that maximize 
the sum M of flow values under the condition that for each edge e the sum of 
edge-values of flows on this edge is at most c(e). Let us denote by M, the 
maximum sum of flow values if we restrict ourselves to integer flows. 

We will use the notation d,(v) for the sum of capacities of edges incident to v. 
Let V,,V,,...,V, be a family 7 of disjoint subsets of V such that each demand 
edge connects different V,. By a multicut defined by we mean the set of edges 
uv of G such that u € V,, vu  V, for some i. The capacity of a multicut is defined to 
be DD d(V,)/2. Let m denote the minimum capacity of a multicut. Let m, denote 
the minimum capacity of a cut separating each terminal pair (if there is any). 
Obviously, m, =m >M=>M,. If k=1, then m, = M, by Menger’s theorem. 


Theorem 8.21 (Hu 1963). If k=2, m,=M. If k=2 and d,(v) is even for each 
non-terminal node, then m, = M, (Rothschild and Whinston 1966a). 


Theorem 8.22 (Lovasz 1976b, Cherkasskij 1977b). If the demand edges form a 
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complete graph induced by T (T CV), then m= M. In addition, if d,(v) is even 
for v €V-—T, then m= M,. 


‘Generalizing this result to non-Eulerian graphs, Mader (1978b) found the 
following characterization for M,. 


Theorem 8.23. Let G=(V, E) be a graph and T a specified subset of nodes. The 
maximum number of edge-disjoint paths connecting distinct elements of T is 
min[» d(V,) — qo(UV,)]/2 where the minimum is taken over all collections of 
disjoint subsets V,, V,,...,Vjr, for which IV; T|=1. (Here d(X) denotes the 
edges leaving X and q,(X) denotes the number of components C of G— X for 
which d(C) is odd.) 


To formulate a node-disjoint version of Theorem 8.23 suppose that T is 
independent. For a subset X of V and a subgraph G’ of G let b(X; G'):=|{x EX: 
there is an xy € E(G') with y ¢ X}|. E(X) denotes the set of edges induced by X. 


Theorem 8.24 (Mader 1978c). The maximum number of openly node-disjoint 
paths connecting distinct members of T is equal to min(|Vo| + LBV; G-—V,)/2}) 


where the minimum is taken over all collections of disjoint subsets V,,V,,...,V, of 
V—T (k20) (where only V, can be empty) such that G—V,— U(E(V,): i= 
1,...,k) contains no path connecting distinct nodes of T. 


This result can be regarded as a common generalization of Menger’s theorem 
and the Berge-Tutte theorem. An immediate corollary of Theorem 8.24 is a 
result of Gallai (1961). 


Corollary 8.25. The maximum number of disjoint paths having end nodes in T is 
min(|K| + 3 [||CN T|/2]: K CV) where the sum is taken over the components C 
of G-— K. 


Let us turn back to edge-disjoint paths. A common generalization of Theorems 
8.21 and 8.22 is as follows. 


Theorem 8.26 (Karzanov and Lomonosov 1978). Let H=(T,F) denote the 
demand graph. If the maximal independent sets of H can be partitioned into two 
classes such that both classes consist of disjoint sets (which is equivalent to saying 
that the complement of H is the line graph of a bipartite graph), then m= M. In 
addition, if d.(v) is even for v EV — T, then m = M, (Karzanov 1985). 


A proof relying on the polymatroid intersection theorem of Edmonds can be 
found in Frank et al. (1992). Let us continue our survey with two results where no 


Parity restrictions are imposed. 


Theorem 8.27 (Lomonosov 1983). Suppose that k =2 and G+ H is planar. Then 
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M, is either m—1 or m. M,=m—1 if and only if there are three cuts of value m 
(each separating both (s,, t,) (i= 1, 2)) the union of which includes a cut B of odd 


i 


capacity so that B does not separate (s,,t;) (i= 1,2). 
Note that in Theorem 8.27 m= M immediately follows from Theorem 8.6(g). 


Theorem 8.28 (Kleitman et al. 1970). if every node of vEV—(s,,...,5,)— 
(t;,...,¢,) és adjacent to a member of at least k—1 terminal pairs and the 
terminals form an independent set of G, then m= M,. 


8.6. Directed graphs 


All the results we have considered so far in this section concerned undirected 
graphs. Let D = (V, A) be a digraph and let (s,,7;) @=1,2,...,) be (ordered) 
pairs of terminals. The problem is to find (arc-) disjoint paths from s, to t,. Let 
H =(U, F) denote the demand graph, where F = {(¢,,5,):i=1,2,...,k}. We call 
a directed circuit of D + H good if it contains precisely one demand arc. Then the 
arc-disjoint paths problem is equivalent to finding & arc-disjoint good circuits of 
D+H. 

Unfortunately, much less is known about directed graphs. One negative result 
is as follows. 


Theorem 8.29 (Fortune et al. 1980). The (arc-) disjoint paths problem is NP- 
complete for k =2. 


Notice that the corresponding undirected problem is tractable (see Theorem 
8.3). In what follows we briefly list some special cases when good characteyiza- 
tions and/or polynomial time algorithms are available. 

The following criterion is clearly necessary in the arc-disjoint case: 

Directed cut criterion: 5;(X)26,,(X) for every X CV. If s,= ++: =s, and 
{,= ++: =¢,then the directed cut criterion is sufficient as well (directed arc- 
version of Menger’s theorem). A counter-part of Theorem 8.8(a) is also true. 


Theorem 8.30. If H consists of two sets of parallel arcs and D + H is Eulerian (that 
is, the in-degree of any node is equal to the out-degree), then the directed cut 
criterion is necessary and sufficient for the solvability of the arc-disjoint paths 
problem. 


Proof. Assume that H consists of a, arcs from ft; to s; @=1,2). By Menger’s 
theorem, it follows from the hypothesis of the theorem that there are a, 
arc-disjoint paths in D from s, to ¢,. If we leave out these paths and the a, 
demand edges from D+ H we obtain an Eulerian digraph. This partitions into 
arc-disjoint circuits, and hence it contains a, edge-disjoint paths from s, tot,, O 
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Figure 8.4. 


In the example in fig. 8.4 the directed cut criterion is satisfied but there is no 
solution to the directed edge-disjoint paths problem. 

The reason is that the following necessary condition, called covering criterion, is 
violated: the directed circuits of D + H cannot be covered by less than k arcs. 


Theorem 8.31. When D +H is planar and D is acyclic the directed edge-disjoint 
paths problem has a solution if and only if the covering criterion holds. 


Proof. By planar dualization we obtain from the Lucchesi-Younger Theorem 
2.15 that in a planar digraph the maximum number of directed circuits is equal to 
the minimum number of arcs covering all the directed circuits. Since D is acyclic 
the set A(H) of demand arcs covers all directed circuits of D+ H. By the 
covering criterion this is a minimum covering of directed circuits and hence there 
are k =|A(H)| arc-disjoint circuits in D + H which must be good circuits. O 


In the following theorem D may be non-planar but the number k of demand 
arcs is considered as a constant. 


Theorem 8.32 (Fortune et al. 1980). Jn acyclic digraphs the (arc-) disjoint paths 
problem can be solved in polynomial time if k is fixed. 


In the special case k = 2, Thomassen (1985) found a complete description of 
acyclic digraphs having no solution to the disjoint paths problem. The core of his 
result is as follows. 


Theorem 8.33. Let us be given an acyclic digraph D =(V, A) and terminal pairs 
(81,1), (t,,%)) such that |V|>=5, & (v), &*(v) 22 for each non-terminal node v 
and & (s,)=6 (s,)=6"(t,) =8 *(t,) =0. If there are no disjoint paths from s, to 
t, and from s, to t,, then D is planar and has a plane representation in such a way 
that s,, t,, t}, $, are on the outer face occurring in that cyclic order. 


Ibaraki and Poljak (1991) solved the arc-disjoint paths problem when k = 3 and 
D +H is Eulerian. Let D be an Eulerian digraph with three distinct specified 
nodes a, b, c, called terminals. The three-terminal problem consists of finding 
(altogether three) arc-disjoint paths from a to b, from b to c and from c to a. 
Clearly, this is a special case of the three arc-disjoint paths problem but Ibaraki 
and Poljak observed that, conversely, the three arc-disjoint paths problem can 
also be easily reduced to the three-terminal problem. 

Suppose that D is a planar digraph with no cut-nodes that has a plane 
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representation such that each face is bounded by a directed circuit, the terminals 
have degree 2, and they lie in one face where their order with respect to the 
orientation of the face is a, c, b. It is easy to see that in such case the 
three-terminal problem has no solution. Therefore we call such a representation 
bad. 


Theorem 8.34 (Ibaraki and Poljak 1991). Given an Eulerian digraph D with 
terminals a, b, c, the three-terminal problem has a solution if and only if D cannot 
be contracted to a planar digraph that has a bad plane representation. 


As far as the maximization problem is concerned for digraphs we mention the 
following (rather easy) counter-part of Theorem 8.22. 


Theorem 8.35 (Frank 1989). In an Eulerian digraph D=(V, A) the maximum 
number of arc-disjoint paths connecting distinct nodes of a specified subset T of V 
is equal to the minimum of Y8 (V,) over all families of disjoint subsets 
V,,V2,---,Vir) of V for which |V,NT|=1 G=1,2,...,|T]). 


We close this section by mentioning an interesting sufficient condition by 
Shiloach (1979). Let us call a digraph D = (V, A) k-linked on the arcs if for any 
choice of k pairs {s,,f,},...,(S,,¢,} of (not necessarily distinct) terminals there 
are arc-disjoint paths P, from s, to ¢; (@=1,...,). Obviously such a digraph is 
strongly k-arc connected (that is every non-empty proper subset of nodes has k 
entering arcs.) 


Theorem 8.36. A strongly k-arc connected digraph is k-linked on the arcs. 


Proof. Add a new node r to D and new arcs (r,s,) ((=1,2,...,) and apply 
Edmonds’ disjoint arborescence Theorem 6.9. O . 


The theorem also easily follows from Theorem 7.30 of Mader. 
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1. Introduction and preliminaries 


A matching in a graph G = (V, E£) is a set of edges (links) no two of which have a 
common end. The most basic problems of matching theory involve establishing 
the existence of matchings with sufficiently many elements. The size of the largest 
matching is denoted by »(G). A node v is saturated by a matching M if some edge 
of M is incident with v. A matching which saturates all nodes of G is called 
perfect. 

This problem of constructing maximum matchings, or determining whether 
there exists a perfect matching has been one of the most extensively studied 
problems of graph theory. This is due in part to the wide variety of extensions and 
applications which exist, in part to the amount of structural information this 
subject provides about a graph, in part to the branches of combinatorics for which 
it serves as a prototype (e.g., polyhedral combinatorics) and in part to its inherent 
tractibility. 

A stable set (or node packing) in a graph G = (V, E) is a set of nodes, no two of 
which are adjacent. The size of a maximum stable set is denoted by a(G). 
Initially, the problems of determining a(G) and »(G) sound deceptively similar. 
In one case we pack nodes, in the other we pack edges. However, the problem of 
packing a maximum number of edges can be solved in time which only grows 
polynomially with the size of the graph (see section 3) but the maximum stable set 
problem is one of the original NP-hard problems. Consequently much of the 
research on this latter topic has dealt with special classes of graphs for which the 
problem is more tractible. In this chapter we focus our attention on matchings, 
but also discuss stable sets, particularly with respect to their connections with 
matching theory. They are also treated in chapter 4. 

A node cover in a graph is a set T of nodes such that every edge has at least one 
end in T. An edge cover is a set of edges such that every node is incident with at 
least one edge of the set. Then 7(G) and p(G) denote the sizes of smallest node 
covers and edge covers respectively of G. Since S is a stable set if and only if V\S 
is a node cover, 


. 


a(G)+7(G)=|Vi=n. (1.1) 


Less obviously, the same relation holds for edges. 
Proposition 1.2. »(G) + p(G) =|V| =n, provided that G has no isolated nodes. 


Proof. Let M be a maximum matching, and let U be the set of nodes not 
Saturated by M. For each u € U we can choose an incident edge e(x), let S be the 
set of these edges. Then M US is an edge cover, so |[M| +|M US| =2|M|-+ |S|= 
(lV| —|U|) +|U| =|V], so »(G) + p(G) <n. Conversely, let C be an edge cover 
of cardinality p(G). Minimality of C ensures that C is the edge set of a forest, so 
if k is the number of components, then k =n — p(G). Each component contains 
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an edge; choosing one edge from each component yields a matching of cardinality 
k, so o(G) + p(G) =n. Combining these inequalities gives Proposition 1.2. O 


Relation (1.1) and Proposition 1.2 are called the Gallai identities (see Gallai 
1959 and Lovasz and Plummer 1986). 

There is another dual relationship between these parameters. If T is a node 
cover and M is a matching, then no node of 7 can cover more than one edge of 
M, so 


WG) <1(G). (1.3) 


Similarly, no edge of an edge cover can meet more than one node of a stable set, 
so 


p(G)>=a(G). (1.4) 


Matching theory is much easier for bipartite graphs than for nonbipartite 
graphs. In the next section we discuss a fundamental theorem of Konig which 
states that we have equality in (1.3) and (1.4) if G is bipartite. However, since 
v(K,) = 1 but 7(K,) =2 and p(K,) = 2 but a(K;) = 1 we see that equality need not 
hold for nonbipartite graphs. It does hold for some nonbipartite graphs, for 
example, suppose we add a new node u adjacent to any node of K,;. Then 
equality again holds in (1.3) and (1.4). Those graphs for which equality holds are 
called Kénig—Egervary graphs, and are discussed in section 6, in the context of 
2-matchings. 

Suppose now we wished to modify (1.3) so that equality holds for all graphs. 
First we could try to obtain an equation which holds for »(G). This was obtained 
by Edmonds (1965a). Let # be a family of pairwise disjoint odd cardinality 
subsets of V. We say that % is an odd set cover if every edge either is incident with 
a node belonging to a singleton component of %, or else has both ends in a 
nonsingleton component. The weight w(S) for SEX is defined by w(S)=1 if 
[S| = 1; w(S) = ({S| — 1)/2 if |S| =3, and w(%) = Veg w(S). Let 7(G) denote the 
minimum of w(%), over all odd set covers. Again, trivially, »(G)<7(G), and 
moreover we have the following. 


Theorem 1.5 (Edmonds 1965a). For any graph G, v(G)= 7(G). 


We discuss this result in section 3, in connection with the matching algorithm as 
it was in this context that it was developed. 

The analogous problem for 7 would be to try and pack something more general 
than edges to obtain an expression for 7(G). Unless NP=coNP, no such 
computationally tractible condition exists for general graphs. Nevertheless, some 
interesting special classes do occur, the best known of which is the class of perfect 
graphs. Suppose we define the weight w(K) of a clique K in G to be |K|—1. We 
let #(G) be the maximum of {D. xc, W(K): M is a set of pairwise disjoint cliques 


Matchings and extensions 183 


in G}. Then G is perfect if and only if 7(G’) = »(G'), for every node induced 
subgraph G’. This topic is discussed in chapter 4. 

We mention one more relationship between the matching and stable set 
problems. Recall that the node set of the line graph L(G) is the edge set E of G, 
and two nodes of L(G) are adjacent if and only if the edges have a common end 
in G. Then MCE is a matching in G if and only if M is a stable set in L(G). 
Thus line graphs provide a class of graphs for which a can be computed in time 
which grows polynomially with the size of the graph. An interesting extension of 
this result was obtained independently by Minty (1980) and Sbihi (1980). A graph 
is the line graph of a simple graph if and only if it does not contain any of nine 
node-induced subgraphs, the simplest of which is the claw, K,,. A graph is 
claw-free if no node induced subgraph is isomorphic to K, ,. Minty and Sbihi 
showed that a(G) could be calculated in polynomial time for claw-free graphs, 
see section 8. However, at present there has not been obtained the breadth of 
results concerning stable sets for claw-free graphs that has been obtained for 
matchings. 

One of the fundamental ideas of matching theory is that of an augmenting path. 
A path or circuit P in a graph is said to be alternating with respect to a matching 
M if its edges alternately are in and not in M. (An alternating circuit will have an 
even number of edges.) If the ends of a path P are not saturated by M, then P is 
called an augmenting path, for M'=M AP is a larger matching then M. This 
notion provides one characterization of v(G). 


Theorem 1.6 (Berge 1957). A matching M of G is maximum if and only if there 
exists no augmenting path with respect to M. 


Proof. We have seen that the existence of an augmenting path implies that M is 
not maximum. Conversely, if there exists a matching M’ such that |M’|>|M|, 
then M A M' consists of a number of alternating paths and circuits with respect to 
M and at least one path must be augmenting. DO 


This result has an attractive analogue for maximum stable sets in claw-free 
graphs, which is discussed in section 8. 

Note the difference between Theorem 1.5 and Theorem 1.6, both of which 
characterize maximum matchings. The former provides a means of showing that a 
matching is maximum, which can be checked in polynomial time, but does not 
indicate how to find such a matching. The latter indicates an approach to finding a 
maximum matching, but does not provide an obvious means of proving maximali- 
ty. (How can one show that no augmenting path exists?) In section 3 we will 
Present an algorithm which combines these to efficiently compute »(G). 

The best general reference for matching is the book Lovdsz and Plummer 
(1986). Most books on graph theory and integer programming contain an 
introduction to matching theory, especially the bipartite case. Good references 
are Lawler (1976), Papadimitriou and Steiglitz (1982), and Nemhauser and 
Wolsey (1988). Derigs (1988) discusses algorithmic and polyhedral issues and 
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Burkard and Derigs (1980) provide descriptions of algorithms as well as FOR- 
TRAN codes. 


2. Bipartite matching 


2.1. Konig’s theorem and consequences 


Let G=(V, UV,, E) be a bipartite graph with colour classes V, and V,. In this 
case the parameters », p, 7, a can all be efficiently computed and their values are 
closely related. This fundamental result is due to K6nig (1916a,b). 


Kénig’s Theorem 2.1. if G =(V, UV,, E) is bipartite, then v(G)=7(G). (Or, a 
maximum matching and minimum node cover have the same cardinality.) 


Proof (D. de Caen, private communication). We first show that in every bipartite 
graph with edges, at least one end of each edge is saturated by every maximum 
matching. For let « and v be adjacent nodes and suppose there exist maximum 
matchings M“ and M” which leave u and v, respectively unsaturated. Each of u 
and v is the end of a path whose edge set belongs to M“ A M”. Since M“ and M° 
are of maximum cardinality, and hence equicardinal, these paths must be of even 
length. If these paths are different, then their union plus the edge uv forms an 
odd length path joining a node unsaturated by M” and a node unsaturated by M”. 
If we replace the edges of M“ belonging to this path with the path’s odd 
numbered edges, we get a matching larger than M“, a contradiction. Therefore 
these paths are identical and have an even number of edges, so this path, plus the 
edge uv, forms an odd cycle which contradicts G being bipartite. 

Now the result follows by induction. If G has no edges, then v(G) = 7(G) = 0. 
Otherwise, choose a node v saturated by every maximum matching. Then 
v(G — v) = o(G) — 1 and since C\{v} is a cover of G, for any cover C of G-v, 
1(G)<1(G —v)+1. By induction »(G —v)=7(G—v), so 7(G)<(G), which 
together with (1.3) gives the result. O 


There are many other proofs of this theorem, see, e.g., Lovasz and Plummer 
(1986). It also follows directly from the matching algorithm, which we present in 
the next section for nonbipartite graphs. It also follows from the characterization 
of the perfect matching polytope, discussed in section 5S. 

There is an important difference between the relationships between vy and 7 
established by K6nig’s Theorem 2.1, which asserts that v(G)=7(G) for a 
bipartite graph G, and the Gallai identities (1.1) and Proposition 1.2. Let M be 
any matching. If |M| is maximum, then by K6nig’s theorem there must exist a 
node cover C such that |C|=|M|. Moreover, in view of (1.3), the existence of 
such a C provides a short proof of the maximality of |M|. Any interested observer 
can readily convince himself of this fact by verifying that M is a matching, C is a 
node cover and |M| ={C|. 

However, suppose we have found a matching M and an edge cover R such that 
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|M|+|R|=|V|. We have not established the maximality of |M| or the minimality 
of |R|. It must be independently established for either M or R and then it will 
follow for the other. Similarly, the existence of a stable set S and a node cover C 
such that |S|+|C|=|V| establishes nothing about a(G) or 7(G). In section 3 we 
describe a polynomially bounded algorithm which will compute »(G) for an 
arbitrary graph G in time which only grows polynomially with the size of G. 
However, unless P=NP (see chapter 29), no such polynomially bounded 
algorithm can exist for computing a(G) or 7(G) for an arbitrary graph G. 

One striking feature of bipartite matching theory is the number of equivalent, 
but different forms of this fundamental relationship. One of the best known is the 
so-called SDR theorem of P. Hall. Let (Q;: i €/) be a finite family of subsets of a 
finite set §. A system of distinct representatives (SDR) is a function f : [> S, such 
that for each i€ J, f(i)€Q, for each i, jE, and i+; implies f(i) ¥ f(j). The 
element f(i) is called the representative of the set Q,. 


SDR Theorem 2.2 (Hall 1935). The family (Q,: i€1) has an SDR if and only if 
\U(O;;i€ H)|>|H|, foreveryH CI. (2.3) 


Note that the necessity of this condition is obvious — if there is to exist an SDR, 
then the union of every subfamily of the sets must contain at least as many 
elements as exist in the subfamily. We will show that the sufficiency follows from 
Theorem 2.1, by first modelling the problem as a bipartite graph. Construct a 
graph G = (S UJ, E), where there is an edge in E joining s€ S toi G/ if and only 
if s€ Q,. Then an SDR corresponds to a matching in G which saturates all nodes 
of J and condition (2.3), called ‘‘Hall’s condition”, becomes |N(H)|=|H| for all 
H Cl. Thus Theorem 2.2 is equivalent to the following. 


Theorem 2.4. A bipartite graph G =(V, UV,, E) has a matching which saturates 
all nodes of V, if and only if, for every H CV,, |N(A)|= HI. 


Proof. As we have seen, the necessity of the condition is easy. Conversely, 
suppose »(G)<|V,|. Let T be a node cover of size u(G), ie., |T|<|V,|. Let 
S=V\\T. Then MS)CV,OT, so |S|=|V,)-|TNV,|=|V,|—-IT| + ¥en T]> 
|N(S)|, so the condition is violated. O : 


Corollary 2.5 (The marriage theorem, Frobenius 1917). A bipartite graph G = 
(V, UV,, E) has a perfect matching if and only if |V,| =|V,| and |H|<|N(H)| for 
all HCV,. 


There is an interesting connection between SDRs and matroid theory. Suppose 
that (Q;: i€ 1) is a finite family of subsets of a finite set S. Let H = {J CI: (Q;: 
!€ J) has an SDR}. Edmonds and Fulkerson (1965) showed that /7 is the family 
of independent sets of a matroid defined on J. If we consider the model of the 
SDR problem as a bipartite matching problem on the graph G=(S UJ, E) 
described previously, then we can see that this is equivalent to the assertion that 
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the subsets of the nodes of / saturated by some matching of G form the 
independent sets of a matroid. By reversing the roles of S and J we can obtain 
one more equivalent form. A set 7 CS is called a partial transversal of (Q;: i€ I) 
if there exists a function f: T— J such that for each sET, f(s) is distinct and 
s€Q,,). Then we have that the family of partial transversals is the family of 
independent sets of a matroid of the set S. See chapter 9 of this volume. 

A well-known theorem of Menger (see also chapter 2) provides an example of a 
theorem less obviously equivalent to Theorem 2.1. Let s and t be distinct nodes of 
a graph G (digraph G). We say that two s—t-paths (dipaths) are internally disjoint 
if they have no nodes in common other than s and t. A set X CV \{s, t} is called 
an s—t-separator if s and ¢ belong to different components of G —X. 


Menger’s Theorem 2.6 (Menger 1927). For any graph (digraph) G and any two 
distinct nodes s and t such that (s, t) is not an arc of G, the maximum number of 
pairwise internally disjoint s—t-paths (dipaths) equals the minimum cardinality of 
an s—t-separator. 


Proof. We prove the directed theorem. From this the undirected result can be 
deduced by replacing each edge uv with arcs (u,v) and (v, u). 

Construct a bipartite graph G* from G by replacing each node uv of G other 
than s and ¢ with two new nodes A(v) and t(v). Node s is replaced with deg*(s) 
new nodes and ¢ is replaced with deg’ (t) new nodes. For each arc (s, u) of G, we 
construct an edge joining ‘(u) and a distinct new node corresponding to s. For 
each arc (u, t) we construct an edge joining A(u) and a distinct copy of ¢. Finally, 
nodes h(u) and t(v) of G* are adjacent if and only if «=v or G contains an arc 
(u,v). See fig. 2.1. 

Let M be a maximum matching in G*. Then for each node u EV \{s, t}, at least 
one of A(u), t(u) must be saturated. In fact if only one is saturated, we can replace 
the incident edge of M with h(u)t(u), and get another maximum matching of G* 
which saturates both. Then the arcs of G for which the corresponding edges of G* 
are in M decompose into internally disjoint circuits and s—i-paths in G, and the 
number of s-t-paths in this collection is |M| — |V\{s, ¢}|. By Theorem 2.1 there is 
a cover C in G* with |C| =|M| and we can assume that only nodes A(u) or t(u) 
belong to C. Let W be the set of nodes u of G for which both A(u) and t(u) are in 
C, If any s—t-dipath in G avoided all nodes of W, then some edge of G* would not 
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be covered by C, a contradiction. So W is an s—t-separating set of size |M|— 
lV\{s, |. O 


2.2. Term rank and the Birkhoff-Von Neumann theorem 


Another corollary of Theorem 2.1 can be stated in terms of matrix theory. Let A 
be an m Xn matrix, all of whose entries have value 0 or 1. The term rank of A is 
the size of the largest subset R of the entries having value 1, such that no two 
members of R belong to the same row or column. The term rank of A is the same 
as the linear-algebra rank of the matrix obtained by replacing the 1s in A by 
distinct indeterminates (over an integral domain). (See Edmonds 1967.) 


Theorem 2.7. The term rank of a (O—-1)-matrix A equals the minimum size of a set 
of rows and columns which contain every nonzero entry of A. 


Proof. Let / and J be the sets of row and column indices of A. Construct a 
bipartite graph G = (J UJ, E) where there exists an edge joining i€ J and j € J if 
and only if a,,= 1. Then »(G) is the term rank and 7(G) is the size of a smallest 
cover of the 1s with rows and columns. O 


This result does have an interesting consequence. An nx” matrix A=(q;,;: 
i= 1,2, Ee Soe elt Oe a) is doubly stochastic if a,,>0 for all i and j and 
yi -1 4, = 1 for all j, and ¥);., @,, = 1 for all i. We define an assignment to be a set 
of n entries of A, no two of which belong to the same row or column. 


Lemma 2.8. A doubly stochastic matrix has an assignment contained in its nonzero 
elements. 


Proof. Let A be obtained by replacing each positive element of A with a 1. Then 
A has an assignment contained in its nonzero elements if and only if A has term 
rank n. Let H and K be subsets of the rows and columns of A which contain all 
nortzero elements of A. Since each row sum and column sum of A is 1, the sum of 
all its entries is n, which must be less than or equal to the sum of the entries in 
each of the rows of H plus the sum of the entries in each of the columns of K. 
Therefore |H|+|K|2n, and so A has term rank n. O 


Using Lemma 2.8 we can obtain a basic theorem of polyhedral combinatorics, 
for bipartite graphs. An n xn (0-1)-matrix A is called a permutation matrix if 
the nonzeros comprise precisely an assignment. A matrix A is a convex 
combination of matrices A’, A’,...,A* if there exist nonnegative reals Aj, 
An,...,A, such that A= an A,A’ and ie A, = 1. 


Birkhoff-Von Neumann Theorem 2.9 (Birkhoff 1946, Von Neumann 1953). Every 
doubly stochastic matrix is a convex combination of permutation matrices. 
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Proof. It is sufficient to show that each (nm x n) nonnegative matrix A having all 
row and column sums equal to some positive value can be expressed as a 
nonnegative linear combination of permutation matrices. We proceed by induc- 
tion on the number of nonzero entries. Since all row and column sums are 
positive, there are at least n nonzero entries. If there are exactly n, then A is just 
a positive multiple of a permutation matrix. Suppose there are more than n. By 
Lemma 2.8 (applied to A suitably scaled), there is an assignment T contained in 
nonzero entries of A. Let A, be the smallest entry of 7, and let A’ be the 
corresponding permutation mate. Let A’=A-A,A’'. Then A has constant row 
and column sums, but fewer nonzero entries. By induction, there exist A,, 

Ay, Ay > O and permutation matrices A’, A®,...,A* such that A= 


y wo Therefore A= ¥,., 4’ as required. O 


There are many different proofs of this result. It is an easy corollary of the 
matching polyhedron characterizations of section 5. It is also an immediate 
consequence of the total unimodularity of the node—edge incidence matrix of a 
bipartite graph. (See chapter 30.) 


3. Nonbipartite matching 


Matching theory for nonbipartite graphs is significantly more complicatedi%han for 
bipartite graphs. As we saw, K, provides an example of a graph G for which 
v(G)<7(G). Nevertheless, the matching problem for nonbipartite graphs is as 
well solved as it is for the simpler bipartite case. 


3.1. Theorems of Petersen and Tutte 


The following theorem is due to Petersen (1891). s 


Petersen’s Theorem 3.1. Let G be a cubic (regular of degree three) graph with at 
most three isthmuses. Then G has a perfect matching. 


We will show later how this follows from characterizations of Tutte and 
Edmonds. 

This theorem had its roots in the attempt by Tait to prove the four-colour 
conjecture. Tait showed in 1880 that this conjecture was equivalent to the 
assertion that a cubic three connected planar graph could be edge 3-coloured, i.e., 
have its edge set partitioned into three perfect matchings. Moreover, he claimed 
that this latter fact followed from the hamiltonicity of planar cubic 3-connected 
graphs, which indeed it would have, were this latter “fact” true. Only much later, 
Tutte (1946) constructed a counterexample to Tait’s claim. Note that this did not 
rule out the existence of the required partition into three perfect matchings, just 
one method of obtaining the partition. 

Petersen apparently was sceptical concerning Tait’s “proof”, and constructed 
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an example of a cubic isthmus-free graph whose edges could not be partitioned 
into three perfect matchings. This is the now-famous “Petersen graph’. (Of 
course, this graph is nonplanar.) His Theorem 3.1 did show that planar or not, a 
cubic isthmus free graph must have at least one perfect matching. See Biggs et al. 
(1976) for a fascinating account of these events. 

Tutte (1947) was able to completely characterize those graphs having perfect 
matchings. This result may be the most fundamental result of matching theory. 
For any graph G=(V,E), we let @(G) denote the number of connected 
components of G which have an odd number of nodes. 


Tutte’s Theorem 3.2. G =(V, E) has a perfect matching if and only if, 
‘for every X CV, 0(G — X) <= |X|. (3.3) 


Proof. The necessify of the condition is immediate. For if G has a perfect 
matching M, then there must be at least one edge of M joining each odd 
component of G — X to X. If J is the set of all such edges, then 0(G — X) <|J|< 
|X|. 

We prove the sufficiency by induction on the size of G. Suppose that (3.3) 
holds. By taking X =@ we see that every component of G has an even number of 
nodes and |V| is even. Therefore |X| and @(G — X) have the same parity, for all 
X CV. Consequently if |X| >6(G — X) for every nonempty X CV, we can delete 
any pair u, vu of adjacent nodes from G; (3.3) will still hold; by induction there 
exists a perfect matching M’' of G— {u,v} and M' U {uv} is a perfect matching of 
G. 

Suppose now that there exists @# X CV such that |X| =0@(G — X). Choose X 
maximal with this property. Let S be the set of nodes of V\X belonging to even 
components of G — X. 

First note that G[S] must have a perfect matching, for if not, by induction, 
there exists X’ C S such that G[S] — X’ has more than |X’| odd components. But 
then X UX’ violates (3.3) for G. 

Second, let K be the node set of an odd component of G — X, and let v be any 
node of K. We claim that G[K\{v}] has a perfect matching. For if not, by 
induction, there exists X C K\{v} such that G[K\(X U {v})] has at least |X| +2 
odd components, and so deleting X U X U {v} from G must result in at least 
IXUXU {v}| odd components, contradicting our choice of X. 

Finally, let G be the bipartite graph obtained from G — S by deleting all edges 
of E(X) and contracting each odd component of G— X to form a pseudonode. 
Let W be the set of these pseudonodes, some of which may just be single nodes of 
G. If G has a perfect matching we are finished, for we can combine a perfect 
matching of G with a perfect matching of G[S] and perfect matchings of G’ — v, 
for each odd component G’ of G — X, for a suitable node vu of the component. If 
G does not have a perfect matching, then by Hall’s Theorem 2.4 there exists 
Ww'Ccw such that |N,(W’)|<|W’'|. But then if we let X*=N,(W’)CX, we 
Violate (3.3), a contradiction. O 
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Tutte’s original proof of the sufficiency of his condition made use of the notion 
of the Pfaffian of a matrix. (See section 7.) The proof we gave here, which makes 
use of Hall’s theorem, is due to Anderson (1971). Later in this section we will 
provide another proof by describing a polynomially bounded algorithm which 
either constructs a perfect matching of G or else finds a set X CY such that 
6(G— X)> |X|. For another proof, and references to still more proofs, see 
Lovasz and Plummer (1986). 

At first Tutte’s condition (3.3) may seem of limited use, because it is clearly 
impractical to check all sets X CV to see if it is satisfied. However, just as K6nig’s 
Theorem 2.1 provides us with an efficient means of establishing the value of »(G), 
it does provide us with a very simple way of proving that a graph does not have a 
perfect matching — simply display X CV such that |X|<@(G—X). In the lan- 
guage of complexity, it proves that the class of all graphs mot possessing perfect 
matchings is in NP (see chapter 29). (Trivially the class of graphs possessing 
perfect matchings is in NP -—a perfect matching provides a succinct certificate.) 
Compare this with Berge’s alternating path Theorem 1.6. 

Petersen’s Theorem 3.1 follows immediately from Theorem 3.2. All we need 
verify is that (3.3) holds for a cubic two-connected graph G = (V, E). Let X CV 
and let J be the set of edges joining nodes of X to nodes of odd components of 
G — X. For any SCY, such that |S| is odd, |5(S)| will also be odd, and if G is 
two-connected, then |5(S)| = 3. Therefore, |/| = 30(G — X). But since G is cubic, 
3|X|=|J|. Therefore |X|20(G-—X) and so G has a perfect matd@ing as 
required. 


3.2. Edmonds’ perfect matching algorithm 


We now consider the problem of determining whether a graph has a perfect 
matching. This was solved for bipartite graphs by Kénig (1916a,b) and Egervary 
(1931), for both weighted and unweighted problems. Their method was termed 
“the Hungarian method” by Kuhn. Edmonds was able to generalize this method 
to nonbipartite graphs. The unweighted problem was treated in Edmonds (1965a) 
and the weighted in Edmonds (1965b). We discuss the unweighted version here 
and postpone treatment of the weighted problem until section 5. 

The perfect matching algorithm starts with any matching. If all nodes are 
saturated, then the matching is perfect and it terminates. Otherwise, it proceeds 
to either find a larger matching or else show that no perfect matching exists. It 
chooses an unsaturated node r and proceeds to grow a tree rooted at r which will 
enable it to find an augmenting path with one end equal to r, if such a path exists. 

An alternating tree T is defined relative to a matching M. It is rooted at one 
node r and the nodes are called even or odd depending on whether the distance to 
r in the tree is even or odd. It satisfies the following: 

(i) r is not saturated by M, all other nodes are saturated by M; 

(ii) for each odd node v, deg;(v) = 2; 

(iii) for each even node u, the first edge in the path in T from u to r is in M. 
(See fig. 3.1). 
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© EVEN NODE 
@ ODD NODE 


Figure 3.1. An alternating tree. 


A feature of the algorithm is that it shrinks certain subsets of nodes. Let 
G=(V,E) be a graph and let SCV. The graph G XS obtained from G by 
shrinking S is obtained by replacing all nodes in S$ with a single pseudonode, 
called “S”. All edges of E(S) are deleted, and any edge which joined u EG V\S 
and v€S in G is replaced by an edge joining u and S in GS. See fig. 3.2. 


Matching AlgorithA 3.4. 


Input. 


Output. 


Step 0. 
Step 1. 


Step 2. 


Step 3. 


Step 4. 


A graph G =(V, E). 

Either a perfect matching M, or a set X CV such that |X]<6(G — X). 

Let M be any matching, e.g., M=9. 

If every node is saturated by M, then M is perfect; terminate. Other- 

wise, choose an unsaturated node r and begin growing an alternating 

tree T rooted at r. Initially T consists of the single even node r. 

If every edge of G incident with an even node of T has as its other end 

an odd node of T, then let X be the set of odd nodes of T and terminate. 

Otherwise, let uv be an edge of G such that u is an even node of T and uv 

is not an odd node of T. There are three possibilities: 

Case 1. If v is not in T and is saturated, we go to Step 3 where we grow 
T. 

Case 2. If v is in T, then it must be an even node of T. Go to Step 4 
where we shrink. 

Case 3. If v is not in T, and is not saturated, we go to Step 5 where we 
augment. 

(Grow tree). Let uw be the edge of M incident with v. Grow T by 

adding v, w, uv and uw. Note that v becomes odd and w becomes even. 

Go to Step 2. 

(Shrink). Both u and v are even nodes of T. Let p be the frst common 

node in the paths in T from u to r and uv to r. Then p will be an even 

node of T. (Possibly p = u, v or r.) The edge uv plus the paths in T from 


Figure 3.2. Shrinking. 
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Step 5. 


Step Sa. 
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BEFORE SHRINKING AFTER 
Figure 3.3. 


u and v to p form an odd cycle C. Let S be the nodeset of C. Replace G 
by GxS. Replace M by the restriction of M to the edges of GxS. 
Replace T by T X S. Go to Step 2. (See fig. 3.3.) 

(Augment). The edge uv plus the path in T from u to r comprises an 
augmenting path. Enlarge the matching using this path. (See Theorem 
1.6.) 

(Expand pseudos). If there exists pseudonodes (formed by shrinking in 
step 4), we proceed to expand them in the order reverse to that in which 
they were formed. Let S$ be the last (unexpanded) pseudonode formed 
and let C by the odd polygon whose nodes were shrunk to form S. There 
will be exactly one edge of M joining a node u of S to a node not in S. 
There will be exactly one node p of C not saturated by an edggof M. 
Let P be the even length path contained in C which joins u and p. 
Replace M with M AP. The result is a matching in the graph obtained 
by expanding S. (See fig. 3.4.) If there remain unexpanded pseu- 
donodes, repeat this process. If no pseudonodes remain, go to Step 1. 


Now we show that the algorithm performs as claimed. It is easy to see that after 
each augmentation, and subsequent sequence of expansions, we are left ‘with a 
matching exactly one edge larger than the one with which we began. 

Suppose we terminate in Step 2 unable to grow the tree. The tree always has 
exactly one more even node than odd node, for it begins with a single even node 
and each time we grow the tree, we add one of each. Each time we shrink, we 
reduce the number of odd and even nodes by the same amount. Moreover, each 
even node of T, real or pseudo, will form a distinct odd component of G — X, 
where X is the set of odd nodes. Therefore |X| = 0(G — X)- 1, and so (3.3) is 


violated, 


as required. 


Figure 3.4, 
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Finally, we establish a bound on the running time of the algorithm. The 
algorithm can perform at most |V|/2 augmentations. Between augmentations, it 
can grow the tree at most |V|/2 times and shrink at most |V|/2 times. Using 
standard data structures, it is straightforward to implement tree growth and edge 
selection in constant time, shrinking in time O(|V|) and augmentation with 
expansion in time O(|V|*). This gives a total running time of O(|V|*). 

Hopcroft and Karp (1973) showed that the maximum matching problem for 
bipartite graphs could be solved in O(|V|°’’), by systematically using all possible 
shortest augmenting paths in each case. This was generalized to the nonbipartite 
case by Even and Kariv (1975). See also Vazirani (1994). For a good survey of the 
issues involved in implementing both weighted and unweighted matching algo- 
rithms, plus some improvements to the above bounds, see Galil (1986). This 
reference also discusses some of the issues involved in parallel computation. See 
also Derigs (1988) for a comprehensive discussion of matching algorithms in both 
the weighted and gnweighted cases. 

Note how the algorithm simplifies in the case that G is bipartite. Case 2 of Step 
2 cannot occur (because there are no odd cycles). Consequently we never need 
shrink or expand. This algorithm is the “‘Hungarian method” for constructing a 
maximum matching in a bipartite graph. 

Moreover, note that, as claimed, the algorithm does provide a second proof of 
the sufficiency of Tutte’s condition (3.3). However, it has the advantage that it 
either finds a perfect matching or else a set X which violates (3.3) in time which 
only grows polynomially with the size of the graph. 


3.3. Maximum matching and calculation of v(G) 


It is easy to modify the algorithm so that it will construct a maximum cardinality 
matching. When we reach the state in Step 2 of being unable to grow the tree, we 
say that the tree has become Hungarian. At this point we remove all nodes of the 
tree, or contained in pseudonodes of the tree, from the graph and resume trying 
to saturate another unsaturated node, if one exists. We repeat this process until all 
nodes not in Hungarian trees are saturated. 

We will reach the following situation: We have a set of Hungarian trees called a 
Hungarian forest. The nodes of G can be partitioned into DU AU C, where D 
consists of all nodes of G which are even nodes, or contained in even pseu- 
donodes of a Hungarian tree; A is the set of odd nodes of Hungarian trees and C 
is the set of nodes in no Hungarian tree. No edge joins a node of D to a node of 
C. An edge joins nodes of D only if they belong to the same pseudonode. The 
current matching induces a perfect matching on G[C], and each node of D is 
matched to a node of A. (See fig. 3.5.) Let ¢ be the number of Hungarian trees. 
Ther the number of nodes unsaturated by the final matching M will be exactly 1. 
Moreover, G— A has |A|+¢ odd components. Since the value 6(G — X)—|X| 
provides a lower bound on the number of nodes which must be left unsaturated 
by any matching, for any X CV, this shows that M is maximum. We thereby 
obtain a generalization due to Berge (1958) of Tutte’s theorem. 
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Figure 3.5. Hungarian forest. 
Theorem 3.5. For any graph G, 


xorg (LAG 


The deficiency of a matching M of G is the number of nodes left unsaturated by 
M. We let d(G) denote the deficiency of G, namely the deficiency of a maximum 
matching. Berge’s Theorem 3.5 is equivalent to the following. 


Corollary 3.6. For any graph G = (V, E) 


d(G) =max {0(G — X) — |X|}. ” 


It is possible to deduce Theorem 3.5 and Corollary 3.6 directly, see Lovasz and 
Plummer (1986). Also, they clearly imply Tutte’s theorem. However it is also 
easy to deduce them directly from Tutte’s theorem. Let d = maxy-y(@(G — X) — 
|X|}. Suppose we add d new nodes to G, each adjacent to every node in V. Then 
it is easy to see that Tutte’s condition holds in the new graph G’, so G’ has a 
perfect matching M’. But at most d edges of M’ are incident with the new nodes, 
so the edges of M’ belonging to E form a matching having deficiency d as 
required. 

Finally, we note how Edmonds’ Theorem 1.5 follows from the algorithm for 
constructing a maximum matching. The final matching M produced will contain 
(|V| — ¢/2) edges, where ¢ is the number of Hungarian trees. The following is an 
odd set cover of E. Initially, define X# by 


K = {{v}: v€ A} U{S: S is the set of real nodes contained 
in an even pseudonode of a Hungarian tree} . 


If C4, then choose some 6 EC and add to # the sets (G} and C\{v}. Let e 
and o denote the number of even and odd nodes of the Hungarian trees 
respectively. Then the weight of X is 4(|V|—e+0)=4(|V|— 94), establishing 
Theorem 1.5. 

In section 5 we will look at these results once again and see that they are special 
cases of some general weighted theorems. 
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4. Structure theory 


A rich structure theory for graphs has been developed, based upon its matchings. 
Not only does this provide information concerning which edges and sets of edges 
belong to maximum matchings, but in addition, yields decomposition results 
which are useful in proving other theorems. 


4.1. Ear decompositions of bipartite and critical graphs 


One common type of theorem establishes a so-called ear decomposition. It asserts 
that all graphs in a particular class can be constructed by starting with some 
“simple” structure and then adding “simple” appendages. As an example, we say 
that a graph G is matching covered if every edge belongs to a perfect matching. 
(Note that we can test whether an edge uv belongs to a perfect matching by 
testing whether Gg u —v has a perfect matching.) Sometimes matching covered 
graphs are called 1-extendable. More generally, a graph is k-extendable if it has a 
perfect matching and every matching of cardinality k is contained in a perfect 
matching. 

An ear with respect to a subgraph G = (V, E) of G consists of a path having an 
odd number of edges, such that the two end nodes are in V, but no internal nodes 
belong to V. (See fig. 4.1.) 


Theorem 4.1 (Hetyei 1964, see Lovasz and Plummer 1986). Let G=(V UW, E) 
be a connected bipartite graph. Then G is matching covered if and only if there 
exists a sequence Gy, G,, G,,...,G,=G of graphs such that 

(i) Gy is K,; 

(ii) fori=1,2,..., p, G, is obtained from G,_, by adding a single ear joining 
two nodes belonging to different parts of the bipartition of G,_,. 


We call this a bipartite ear decomposition. 


Proof. We first prove the necessity. Let M be a perfect matching of G and let 
G =(V, E) be a maximal subgraph of G such that 

(i) G has a bipartite ear decomposition; 

(ii) M induces a perfect matching of G. 


o-----* 
p——4 


Go GS, Ge Gz =G 


Figure 4.1. Ear decomposition. 
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First note that E = E(V), for if any edge of E(V) has been omitted, it could be 
added as an ear. Second, suppose that V # V. Let u EV\V be adjacent to uv € V. 
By (ii), edge uv is not in M. Let M be a perfect matching containing uv. Then 
M AM contains an even cycle containing edge uv whose edges alternately belong 
to M and to M. Let P be the maximal portion of this cycle containing uv, but 
containing no edges of F. Then the first and last edges of P are not in M, and P 
provides an ear which contradicts the maximality of G. 

Conversely, suppose that G has a bipartite ear decomposition. Then it is 
straightforward to prove that G is matching covered by induction on the number 
of ears, if we add to the induction hypothesis the following property: for all nodes 
u,u belong to different parts of the partition, G-—u-—v has a perfect 
matching. O 


A graph G is called (factor) critical or hypomatchable if G —v has a perfect 
matching for every node v. Such graphs must have an odd number of nodes and 
be connected, and it is easy to see that they must also be nonbipartite. Critical 
graphs play important roles in matching theory. They have ear decompositions 
very similar to those of matching covered bipartite graphs. 


Theorem 4.2 (Lovasz 1972b). Let G be a graph. Then G is critical if and only if 
there exists a sequence G,, G,,...,G,=G of graphs such that 

(i) Gg is an odd cycle; ) 

(ii) fori=1,2,..., p, G;, is obtained from G,_, by adding an ear joining two 
(not necessarily distinct) nodes of G,_,. 


This theorem can be proved analogously to Theorem 4.1. 

We call the sequence G,, G,,..., G, a critical ear decomposition of G. Note 
that critical graphs may contain cutnodes. In this case, some ears become odd 
cycles attached to single nodes of the previous graph. However, if G is 2- 
connected, then these degenerate ears can be avoided. This fact will make 
possible a simple inductive proof of a theorem characterizing some of the essential 
inequalities for the matching polyhedron (Theorem 5.18). 


Theorem 4.3 (Cornuéjols and Pulleyblank 1983, see also Lovdsz and Plummer 
1986). If G is a 2-connected critical graph, then G has a critical ear decomposition 


Gp, G,,..-, G, such that each G, is 2-connected. 
Proof. Suppose this is false; let Gy, G,,...,G, be an ear decomposition such 
that 


(i) the smallest t for which G, contains a cutnode v is as large as possible; and, 
subject to this, 

(ii) the first s for which v is no longer a cutnode of G, is as small as possible. 

For each i€ {1,2,..., p}, let C, be the ear added to G,_, to form G,. By our 
choice of s and ¢, each ear C, for t<i<s — 1 has neither end in G,_,, but C, must 
have one end in G,_, and the other must be an internal node u of some C, for 
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tsi <s — 1. The node u splits C; into an even length path P° and an odd length 
path P’. We can concatenate P° onto C, and obtain an ear C; which we can add 
instead of C,. Then we can add P' as a separate ear. Then we can add the ears 
Cisy,-++sC,_,. We will now have constructed G,, so we can complete the ear 
decomposition. But now vu stopped being a cutnode after adding C;, andi<s, a 
contradiction. O 


The matching algorithm shrunk certain subgraphs of G. We say that a graph is 
shrinkable if the maximum matching algorithm, when applied to it, shrinks it to a 
single pseudonode. That is, G is shrinkable if either G is a single node or else G 
contains an odd cycle with nodeset C such that G x C is shrinkable. It follows 
from Theorem 4.2 that all critical graphs are shrinkable. It is easy to show by 
induction that for any node v of a shrinkable graph G, G-—v has a perfect 
matching, that is, G is critical. Then G is critical if and only if it is shrinkable. 


4.2. Barriers and the canonical partition 


It is possible to give an ear decomposition theorem for nonbipartite matching 
covered graphs, but if we want every intermediate graph to be matching covered, 
we cannot do it by adding ears one at a time. Rather we must sometimes add ears 
in pairs. Also, care must be exercised when adding single or double ears to ensure 
that each graph in the sequence is matching covered. In order to specify which 
pairs of nodes are suitable as attachment points for ears, we introduce the 
canonical partition of V for any matching covered graph G = (V, E). Most of the 
material in this section is based on Hetyei (1964), Kotzig (1959a,b, 1960) and 
Lovasz (1972a), see Lovasz and Plummer (1986). 
First we observed two preliminary facts. 


Lemma 4.4, Let M be a perfect matching of a connected matching covered graph 
G =(V, E). For every pair u, v of distinct nodes, there is an alternating path from 
u to v, the first edge of which is in M. 


Proof. Let u € V and let S be the set of nodes reachable from u by an alternating 
path which begins with an edge of M, plus u itself. If S@V then there is st € E 
such that s€§ and tEV\S. If st M, then ¢ must also be reachable from u, so 
!€S, a contradiction. Therefore st ¢ M, but there exists a perfect matching M of 
G containing st. The symmetric difference M A M contains an alternating cycle C 
which uses st, and includes nodes of S. Let P be a minimal alternating path from u 
to a node r of C, which starts with an edge of M, if u is not in C. Then P plus one 
of the two paths in C from r to ¢ provides an alternating path from u to ¢ as 
Tequired, contradicting the definition of S. Therefore S=V. O 


__ An alternating path with respect to a matching M is called a decrementing path 
if the first and last edges belong to M. Then G — u—v has a perfect matching if 
and only if u and v are the ends of a decrementing path. (The sufficiency is trivial, 
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the necessity follows by considering MAM, for a perfect matching M of 
G—u-—v.) We define a relation ~ by u ~v if and only if G — u — v does not have 
a perfect matching. 


Lemma 4.5. Let G=(V,E) be connected and matching covered. Then ~ is an 
equivalence relation. 


Proof. All we need show is transitivity. Let M be any perfect matching of G. 
Suppose u~v and v ~ w but u % w. Then u and w are joined by a decrementing 
path P. By Lemma 4.4 there is an alternating path from v to u, which begins with 
an edge of M. Let P’ be a minimal such alternating path from v to any node s of 
P. Then P’, plus the portion of P from s to u or s to w is a decrementing path 
from vu to u or v to w, contradicting either u~v oru~w. O 


The canonical partition of G, denoted by A(G), is the set of equivalence classes 
of this relation. If G is nonempty, bipartite and matching covered, then there are 
exactly two equivalence classes —- the two colour classes. A graph G =(V, E) is 
bicritical if G — u — v has a perfect matching for each u, v EV. Equivalently, G is 
bicritical if every class of A(G) is a singleton. 

A barrier (Lovasz and Plummer 1986) in a connected matching covered graph 
G is a set XCV such that |X|=0(G —X). For each vEV, {v} is trivially a 
barrier, but we are more interested in the maximal barriers. ? 


Lemma 4.6. Let X be a maximal barrier in a connected matching covered graph 
G=(V,E). Then every component of G — X is critical. 


Proof. Since G is matching covered, all components of G — X are odd. Suppose 
that J is the nodeset of a component of G — X which is not critical. Then there 
exists v€S such that G[S]—v has no perfect matching. By Tutte’s theorem, 
there exists YCS\{v} such that G[S]—(YU{v}) has more than |¥| odd 
components. Therefore G — (X UY U {v}) has at least |X|+|¥|+1 odd com- 
ponents, and since G is matching covered, it must have exactly this number. 
Therefore X U Y U {u} is a barrier, contradicting the maximality of X. O 


The following is the main result relating barriers to the canonical partition. 


Theorem 4.7 (Kotzig 1959a,b, 1960 and Lovasz 1972a). For a connected matching 
covered graph G, the maximal barriers of G are precisely the classes of P(G). 


Proof. If u and v belong to a barrier X then G-—u-—v can have no perfect 
matching, since deleting X\{u,v} leaves |X| odd components. Therefore every 
maximal barrier is contained in a single class of A(G). 

Now let X be a maximal barrier and let v €V\X. By Lemma 4.6, v belongs to 
a critical component G[S]} of G\.X. Let M be a perfect matching of G. Some node 
w€S is joined to a node uE X by an edge of M. Remove uw from M and 
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replace the matching on G[S] with a matching which leaves only v unsaturated. 
This yields a perfect matching of G — {v, u}. Hence v “u, so X must be a class of 
YP(G). O 


Now we can describe the ear decomposition of matching covered graphs. 


Theorem 4.8 (Lovdsz and Plummer 1975). Let G =(V, E) be a connected graph. 
Then G is matching covered if and only if there exists a sequence Gg, 
G,,...,G, =G of graphs such that 
(i) G, is a Kp, 
(ii) fori=1, 2,..., p, G;, is obtained from G,_, by either 
(a) adding a single ear joining nodes belonging to different classes of 
P(G,_,) or 
(b) adding two ears, such that one joins distinct nodes u,, u, belonging to 
some class df P(G,_,) and the other joins distinct nodes v,, v, belonging to 
a different set of P(G,_,), and G — u, — u, — v, — ¥, has a perfect matching. 


Proof. See Lovasz and Plummer 1986. O 


The hard part of the proof is showing that we never require more than two ears 
to be added at a time. An elementary proof of this was recently obtained by Little 
and Rend! (1989). 


4.3. The Edmonds—Gallai partition 


Now we describe another type of decomposition of graphs which is useful for 
graphs which do not have perfect matchings. The idea is to decompose the graph 
into simpler graphs, such that the maximum matchings of the original graph can 
be constructed from those of the smaller graphs. For any graph G = (V, E), let 
D(G) be the set of all nodes v such that some maximum matching of G leaves v 
unsaturated. Let A(G) be the nodes not in D(G), but adjacent to nodes of D(G). 
Let C(G)=V\(D(C)U A(G)). (“D” stands for “deficient”, “A” stands for 
“adjacent” and “C” stands for ‘‘covered”.) The partition V= DV) VA(G)U 
C(G) is called the Edmonds—Gallai partition of G. 


Theorem 4,9 (Gallai 1963, 1964, Edmonds 1965a). Let G = (V, E) be an arbitrary 
graph and let D(G), A(G) and C(G) be defined as above. Then 


every component of G[D(G)] is critical ; (4.10) 
a matching M of G is maximum if and only if 


M induces a perfect matching of G[C(G)]; (4.11a) 
M induces a near-perfect matching of G[K], for every component K of 
G[D(G)] , (4.11b) 
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for each v€ A(G), there is an edge uv © M such that u belongs to 
D(G). (4.11¢) 


(Note that (4.11b) and (4.11c) imply that each node of A(G) is matched to a 
unique component of G[D(G)]. Also note that if G has a perfect matching, then 
V=C(G). If G is critical then V= D(G). The interesting cases are when G 
neither has a perfect matching nor is critical.) 

This result follows easily from linear programming duality, and we postpone the 
proof to the next section. However, we should note that the maximum matching 
algorithm of the previous section actually constructed the partition. The set D(G) 
consists of all nodes of V which are even nodes, or contained in even pseu- 
' donodes, of the Hungarian trees. The set A(G) consist of the odd nodes of the 
Hungarian trees and C(G) consists of the nodes not belonging to Hungarian trees. 
Consequently, constructing the Edmonds—Gallai partition is no more difficult 
than finding a maximum matching. 


4.4. Brick decompositions 


Suppose that G = (V, £) has a perfect matching. As we have already noted, in 
this case the Edmonds—Gallai theorem 4.9 gives no information. If we wish to 
decompose G based on the maximum (perfect) matchings, then edges not in any 
perfect matching are of no interest, so assume that G is matching covereg. We 
now describe the so-called brick decomposition of G. A brick is a bicritical 
3-connected graph. This process decomposes an arbitrary matching covered graph 
into a set of bricks and matching covered bipartite graphs. 


Brick Decomposition Procedure 4.12. Let G =(V, EZ) be a connected matching 
covered graph. 
(1) If G is a brick or is bipartite then it is indecomposable. 
(2) If G is nonbipartite and not bicritical, then let _X be a class of the canonical 
partition A(G) for which |X|>2, and let S be the nodeset of a component of 
G—X such that |S|>3. Let G, = Gx S. Let G, = G[S UX] x X. See fig. 4.2. 
A brick decomposition of G is the union of brick decompositions of G, and G2. 


Figure 4.2. 
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Figure 4.3. 


(3) If G is bicritical, but not 3-connected, then let {u,v} be a two node cutset, 
let S be the nodeset of a component of G — u — v and let T=V\(S U {u, v}). Let 
G, =G x (SU {u}) and let G, = G x (TU {v}). (See fig. 4.3.) A brick decompo- 
sition of G is the union of brick decompositions of G, and G,. 


Note that in bath (2) and (3) above, the coboundary of the shrunk nodes is a 
tight cut of G, i.e., each perfect matching uses exactly one edge. Moreover, the 
perfect matchings of G can be obtained by composing the perfect matchings of G, 
and G,. 

When performing a brick decomposition, we have considerable freedom in our 
choice of X and S in Step 2 and of u, v and S in Step 3. Surprisingly the final list 
of bricks and bipartite graphs obtained is (almost) independent of these choices. 


Theorem 4.13 (Lovasz 1987). The list of bricks and bipartite graphs obtained by a 
brick decomposition is independent (up to multiplicity of the edges) of the choices 
made in Step 2 and 3). 


In the next section we will see that the brick decomposition determines the rank 
of the incidence vectors of the perfect matchings of G, over the field of real (or 
rational) numbers. 


5. Weighted matchings and polyhedra 


5.1. Maximum weight matchings 


Suppose we have a graph G =(V, E) and a vector c =(c,: e EE) of edge costs. 
We wish to find a matching, or perfect matching, for which the sum of the edge 
costs is maximized. Both these problems can be solved almost as easily as the 
maximum cardinality matching problem, by exploiting a connection with linear 
programming. 

We first formulate these problems as integer programs. We introduce a 0-1 
variable x, for each e € E with the interpretation that x, = 1 if e is in the matching 
and x, = 0 otherwise. For any vector x =(x,;: jEJ) and any K CJ we let x(K) 
denote yy (x;:; jE K). Recall that dv) denotes the set of edges incident with v. 
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The maximum weight matching problem becomes: 


maximize cx 
subject to x20, integer, 
x(6(v)) <1 for allvEeV. 


Let 0 = {SCV: |S|=3, odd}. For any S$ € 0, if x is the incidence vector of a 
matching, then x(E(S)) < (|S| — 1)/2. Now consider the following linear program: 


maximize cx 


subject to x20 (5.1) 
x(6(v)) <1 for alluEV, (5.2) 
x(E(S)) < ({S| - 1)/2 for al SEO. (5.3) 


We have already noted that all incidence vectors of matchings satisfy (5.1)—(5.3). 
We will see that every extreme solution to (5.1)—(5.3) is the incidence vector of a 
matching. This we will prove by showing that for any vector c of edge weights, 
there exists an optimum solution to this linear program which is the incidence 
vector of a matching. 

The dual linear program is the following: 


minimize > y,+ > ((|S|—1)/2)z, a 
veEV seo 

subject to y, =O for allvEV , (5.4) 
zs2O0 forall SEO, (5.5) 


y ty, +2 (z5;u4,v ESEO)2c,, for all uvEE. (5.6) 


We will prove optimality by constructing both an incidence vector x* of a 
matching, and a feasible dual solution y*, z* which satisfy the complementary 
slackness conditions for optimality. Namely, x* satisfying (5.1)-(5.3) and y*, z* 
satisfying (5.4)—(5.6) are optimal if and only if 


x*,>Oimplies y* + y* +> (z¥:u,vE€SEO)=c,, for all uv EE, 


(5.7) 
z* >0 implies x*(E(S)) = (|S|— 1)/2 for al SEO, (5.8) 
y, >0 implies x*(8(v)) = 1 for allvEV. (5.9) 


We illustrate the use of these conditions by establishing the Edmonds-Gallai 
Theorem 4.9 from the output of the maximum cardinality matching algorithm (see 
section 3). Recall that when this algorithm terminates it has constructed a 
(maximum) matching with incidence vector x* and also a number of Hungarian 
trees. Recall that D is the set of nodes of G which are even nodes, or contained in 
even pseudonodes, of the Hungarian trees; A is the set of odd nodes of the 
Hungarian trees and C=V\(A UD). We let c, = 1 for all e © E, and consider the 
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following dual solution: 


1 ifvEA, 
y?=40 ifveD, 
4 ifvec, 


pseudonode of a Hungarian tree , 


1 if S is the nodeset of an even 
Zz 5 = 
0 otherwise. 


It is easy to see that (5.1)-(5.9) hold, which establishes the optimality of y*, z* 
(and x*). Moreover, since the incidence vector of every maximum matching must 
satisfy (5.7)—-(5.9) with respect to this y* and z*, we deduce the following. First, 
by (5.9), only nodes of D can be left unsaturated by any maximum matching. It is 
easy to construct a maximum matching of G deficient at any desired node of D 
(using the Hunga#ian tree structure) so D=D(G) (of the Gallai-Edmonds 
theorem). Since nodes of D are only adjacent to nodes of A, we have A = A(G) 
and also C = C(G). We saw in section 4 that a graph is critical if and only if it is 
shrinkable, implying (4.10). Since we have y7 +y*>1=c,, for all w€ A and 
v€ AUC, (5.7) implies (4.11a and c). Finally, (4.11b) follows from (5.9). Thus 
we have both proved Theorem 4.9 and shown how the maximum matching 
algorithm finds D(G), A(G) and C(G). 

Now we describe how to find a maximum weight perfect matching in a graph. 
From this, it is easy to find a maximum weight matching, which is described 
following the algorithm. 

For the case of perfect matchings, we must make the following changes to our 
linear programming formulations. Condition (5.2) becomes the equation 


x(S(v)) = 1 for all vEV. (5.2') 


Consequently, dual feasibility condition (5.4) and complementary slackness 
condition (5.9) disappear. 

At all times, the algorithm maintains a feasible dual solution y, z. It attempts to 
construct a perfect matching whose incidence vector £ satisfies (5.7) and (5.8). If 
successful, it terminates. If unsuccessful, y and z are changed and the process 
repeats. : 

A family ¥ of subsets of V is called nested if whenever S, TE ¥ satisfy 
SOT #9, either SCT or TCS. Let y, z be a feasible dual solution. We let 
O'={S€EQ:z,>0}. At all times, the algorithm will ensure that ©’ is a nested 
family. Also, we let 


E-={wek:y,+y,+¥ Gu vEesee) =cut. (5.10) 


For every SE 0’, let G* denote the graph obtained from G[S] by shrinking all 
maximal members of ©’ properly contained in $. Then we will have 


G* is spanned by an odd cycle, all of whose edges are in E~. — (5.11) 
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Thus G* — (E\E*) is critical (cf. Theorem 4.2). 


Weighted Perfect Matching Algorithm 5.12. 


Input. 


A graph G = (V, E), a vector c = (c,:e GE) of edge weights. 


Output. Either a set X CV for which |X|<6(G — X) (establishing the nonexist- 


Step 0. 
Step 1. 


Step 2. 


Step 3. 


Step 4. 


ence of a perfect matching) or the incidence vector x* of a maximum 
cost perfect matching and an optimum dual solution y*, z*. 

(Construct feasible dual solution). Let y satisfy y,+y,2c,, for all 
uv € E, let z; =0 for all SEO (i.e., 0’ =8). 

(Construct current equality subgraph). Let E~ be defined as in (5.10). 
Let G=(V,£) be the graph obtained from G~ = (V, E~) by shrinking 
all maximal members of 6’ = {S € 0: z, >0}. (If 0’ =, then G=G~.) 
(Solve maximum cardinality problem on G). Apply the maximum 
matching algorithm of section 3 to G. If a perfect matching is found, go 
to Step 4. If not let D and A be the sets of all nodes of G which are 
equal to or contained in even and odd nodes respectively of the 
Hungarian trees in G. Let C be the other nodes of G. Go to Step 3. 
(Modify the dual solution). Let 


6, =min{z;: SEO’ and S$ is the set of nodes belonging to an odd 
pseudonode of a Hungarian tree} ; 

6, =min{y, + y, —¢,,: ww E E\E ,u€D andvEC}. 9 

6, =min{y, + y, —¢,,: uv E\E™ and u and v are nodes of D which do 
not belong to the same pseudonode of a Hungarian tree} . 


By convention, if any of these sets are empty, we let the corresponding 
6, =. Let 6=min{6,/2, 6,,5,/2}. If 6=« then G has no perfect 
matching as @(G — A) >|A|, and we terminate. Otherwise, let 


y,-6 forvEeD, 
ylL=4qy, +8 forvEea, 
y, forvEC; 


z,+26 if S is the set of nodes 
contained in an even pseudonode , 
: Z;—26 if S€6" is the set of nodes 
contained in an odd pseudonade , 
Zs otherwise . 


N 
an 
Il 


By our choice of 6, the dual solution y’, z’ is feasible. Moreover, any 
edge in the forest or in the current matching will still be in E, 
recomputed for y’ and z’. Replace y, z with y’, z’ and go to Step 1. 

(Extend matching in pseudonodes). Using (5.11), it is easy to see that the 
incidence vector £ of the perfect matching of G obtained can be extended 
to a perfect matching of G, whose incidence vector x* satisfies (5.7) and 
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(5.8). Let y* = y and z* =z. Then x*, y* and z* are the desired optimal 
solutions. Terminate. 


i [t is straightforward to verify that when the algorithm terminates, it has found 
either the required set A CV such that |A| <@(G — A) (Step 3) or has constructed 
optimal y*, z* and x*. What remains to be shown is that the running time is finite, 
and in fact, polynomial in the size of the graph. To show this, we must establish 
an upper bound on the number of times Step 3 (dual change) can be performed. 
Recall that d(G) denotes the deficiency (minimum possible number of unsatu- 
rated nodes) of G. Let G and G’ be the graphs obtained before and after a dual 
variable change. Let D and C be the sets of nodes of G belonging to D(G) and 
C(G) respectively. Let D’ and C’ be defined similarly for G’. We show that 


d(G')<d(G) 
and ° 

if d(G’) = d(G) then |D’| =|D| 
and 

if |D'| =|D| then |C’|>|C]. 


From this it will follow that we can perform at most |V|* dual changes. 
When Step 3 is performed, we have one or more of the following situations: 
(6 =5,/2) A pseudonode, belonging to A, is expanded. 
(5 =5,) An edge joining a node u€ D and vEC enters E. 
(5 = 5,/2) A node joining distinct even nodes u, v of the Hungarian trees enters 


It is easy to see that none of these can increase d(G), so suppose d(G’) = 
d(G). Again, we can still find maximum matchings leaving any node of D(G) 
unsaturated, so |D’| > |D|. We now try to construct a matching deficient at a node 
of G which does not belong to D(G), which will show |D‘| > |D]|. 

. lf 6 =6,/2 we can start with a maximum matching of G deficient at u. If v is 
not saturated, then adding uv to the matching gives d(G')<d(G’). If v is 
incident with an edge uw in the matching then add uv and remove vw. We now 
have a maximum matching deficient at w € A(G’) so |D'| >|D]. 

If 5 =6, then start with a maximum matching of G which leaves u, or the 
pseudonode containing u, unsaturated. Some edge vw is in the matching, where 
v, w€C(G). Now remove uw and add uv. Then wE D’, so |D’|>|D|. 

If 6 =6,/2 then either some node of the expanded pseudonode enters D’ or 
else all go into C. Thus if |D’|=|D| then |C’|>|C|, and we are done. 

This shows that a maximum weight perfect matching can be found by solving 
O(|V|*) maximum matching problems. The algorithm as presented by Edmonds 
(1965b) was slightly different — when a Hungarian tree was found, a dual variable 
change would be made and the algorithm would continue with that tree, which 
would no longer be Hungarian. This resulted in an overall running time of 
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Q(|V|*). Lawler (1976) showed that this could be reduced to O(|V|*) by the use 
of suitable data structures. This is still the best known bound for a weighted 
nonbipartite matching. 

Each time we make a dual variable change, the value of the dual solution 
strictly decreases. All this requires is that we have identified a subset X of V(G) 
such that 0(G — X)> |X|. However if we were to make such changes based on 
arbitrary X rather than A(G), we could not even guarantee finite termination of 
the algorithm for real costs! This even applies in the bipartite case (Araoz and 
Edmonds 1985). 


5.2. Matching polyhedra, dual solutions and facets 


The correctness of the preceding algorithm provides a proof of one of the 
fundamental theorems of polyhedral combinatorics. The perfect matching poly- 
tope P(G) of a graph G is defined to be the convex hull of the incidence vectors of 
the perfect matchings of G. 


Theorem 5.13 (Edmonds 1965b). P(G) is the set of all x E R® satisfying 


x,20 forallecE, 
x(6(v))=1 forallveEeV, 
x(E(S)) <(\S|—1)/2 forall SEO. ] 


Proof. Let P denote the polyhedron defined by the above linear system. We have 
already noted that the incidence vector of any perfect matching satisfies these 
inequalities so P(G) C P. We complete the proof by showing that every vertex of 
P is in P(G), i.e., is the incidence vector of a perfect matching. If P(G) #9 and if 
£ is a vertex of P, then there is a vector c of edge weights such that £ is the unique 
member of P maximizing cx. Apply Algorithm 5.12 to G, with edge weights c. We 
have seen that it will terminate with the incidence vector x* of a perfect matching, 
if one exists, plus an optimum dual solution y*, z*. Since they are feasible and 
satisfy complementary slackness, they are optimal solutions to the linear program 
maximize cx for x€P. Therefore x*=4%. If P(G)=9, then the algorithm 
terminates in Step 3, with 6 =. This means that the objective function of the 
dual linear program can be made arbitrarily negative, so P=@ by the weak 
duality theorem of linear programming. O 


See chapter 30 or Lovdsz and Plummer (1986) for alternate nonalgorithmic 
proofs. 

It is interesting that Theorem 5.13 has provided the basis for a cutting plane 
approach to the maximum weight matching problem, which has proved to be 
successful on quite large problems (see Grétschel and Holland 1986). The idea is 
to begin with a small subset of the inequalities, in this case just the nonnegativity 
and degree constraints. First solve the relaxed problem, obtained by maximizing 
cx subject to these constraints. If the optimum solution is 0-1 valued, then we 
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stop with the incidence vector of a maximum weight matching. If not, we find one 
or more violated constraints of the third type, add them to our constraint set so as 
to obtain a stronger relaxation, and resolve. This is then repeated until an 
optimum matching is obtained. (See chapter 30 for a thorough discussion of 
separation based algorithms.) 

The crucial step in this process is to find a violated blossom inequality (5.3) if 
one exists. This problem was solved by Padberg and Rao (1982), who showed 
how the procedure of Gomory and Hu (1961) for constructing a flow equivalent 
tree (see chapter 2) could be adapted to find a minimum capacity odd cut. 

The previous algorithm also makes it easy to deduce an important result 
concerning discreteness of optimal dual solutions. 


Theorem 5.14 (Edmonds 1965a). If G has a perfect matching and c is integral, 
then there exists an optimum dual solution y*, z* such that 
] 


y*=0 (mod) forallveEV, (5.15) 
z;=0(mod1) forallSEO. 


Proof, Begin the algorithm with z =0 and y integral. Then (5.15) holds, as well 
as the following: 


y, =y, (mod 1) for all u, v€ D(G). (5.16) 


(Since every v € A(G) is adjacent with some u € D(G), we have y, =y, (mod 1) 
for all u, v€ D(G)UA(G).) It is straightforward to see that these properties 
imply that 5, and 4, will be integer and 4, will be half-integer. Therefore 6 will be 
half-integer, which ensures that (5.15) holds for the new solution. Finally, we can 
see that (5.16) continues to hold, for any new node of D(G’) must be joined by a 
path of edges in E~ to a node previously in D(G), so since all costs are integer, 
this gives the result. O 


.This dual discreteness property has useful combinatorial consequences which 
we discuss later. First, however, we note that the problem of finding a maximum 
weight (not necessarily perfect) matching in G reduces easily to -the perfect 
matching case. Let G’ = (V', E’) be a second disjoint copy of G = (V, E) and join 
each v' EV’ to the corresponding v EV. Let each new edge not in E have cost 
zero. Now a maximum weight perfect matching in the expanded graph will induce 
a maximum weight matching in G, since any matching of G can be extended to a 
perfect matching in the extended graph having the same weight. Moreover, 
suppose that y*, z* is an optimal dual solution. Define a new objective function ¢ 
by interchanging the costs of the corresponding edges of E and E’. For any 
SCV UV’, we define S’= {vu EV: v' ES} U {v’ EV’: vES}. Let y’ and z’ be 
the optimal dual solutions for € obtained by letting y; =y* and y;,=y; for all 
i€V, and z,= 25. for all SCV UV’. Then y=y*+y’, Z7=z* +2’ is an optimum 
dual solution to the perfect matching problem on the expanded graph, where 
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c; =¢; for all JEL’ and all new edges not in E’ have weight zero. Moreover, 
y,=y, and Z,=2Z,. for all v€V and SCV UV’. Also, y, z has the same 
discreteness properties as y*, 2*. ; 

Suppose there exists SCV UV’ such that z,>0 and VON S4¥6#V'NS. Let 
T=(VNS)\(V'NS)' and let T=(V'NS)\(VNS)'. One of T, T, say T, has odd 
cardinality. Let W=(SQNV)N(SOV’'). Redefine y and z by letting y, := y, + Z; 
for al vEWUW’, y,:=y, +42, for all UE TUT’, Z,:=2,.:=Z, and Z,:= 
Z,.:=0. Then the new y, z is optimal and has all the discreteness properties of 
the old, but fewer components z,>0 for which S intersects both V and V’. 
Repeating this we can remove all such components, at which point it is immediate 
that the restriction of y, z to G is an optimal dual solution to the maximum 
weight matching problem, which satisfies (5.15). The matching polytope M(G) is 
defined to be the convex hull of the incidence vectors of the matchings of G. We 
obtain the following generalization of Theorem 5.13. 


Theorem 5.17 (Edmonds 1965a). M(G) is the set of all x ER® satisfying 


x,20 foralleEE, 
x(6(v))=1  forallvEV, 
x(E(S)) = (|S]—1)/2 forall SEO. 


This is a generalization since P(G) is the face of M(G) obtained by replacing 
the inequalities x(5(v))<1 for v EV by equations, and all faces of integral 
polyhedra have integral vertices. 

We also obtain a dual discreteness result, analogous to Theorem 5.14, which 
can be strengthened as follows. 


Theorem 5.18. Let G=(V, E) be a graph and let c be a vector of integral edge 


weights. Then there exists an integral optimal solution to the dual problem 
(5.4)-(5.6). 


Proof. ie y*, z* be an optimal dual solution which satisfies (S. 15). Let F={veE 
V: y? 1 mod 1}. Since the value of y*, z* is integer, |F| is even. Choose any 
bEF and let F = F\{é}. Let ye = y? i for all i€ F, yi := y3 +1 and let 
Zp ‘= zp +1. The new solution is the required integral optimal solution. OU 


An identical argument applies to P(G). A linear system such as (5.1)—(5.3) 
which has the property that there exists an integral optimal dual solution, for 
every integral objective function c which has a maximum, is called totally dual 
integral. See chapter 30. Such theorems have useful combinatorial applications. 
For example, suppose we are only interested in finding a maximum cardinality | 
matching. This is equivalent to maximizing cx for x € M(G), where c, = 1 for all 
JEE. Let y*, z* be an integral optimal dual solution. Then (see (5.6)) each 
component must have value 0 or 1. Let H={{v}: yf =1}U{SCV: zz =1}. 
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Then # is an odd set cover and w(#) = max{cx: x © M(G)} = o(G). Thus we 
derive Theorem 1.5 as a special case. 

In general, not all constraints (5.2) and (5.3) are required to define M(G). For 
S € 6, constraint (5.3) is essential if and only if F, = {x € M(G): x(E(S)) = (|S| - 
1)/2} induces a facet of M(G), i.e., has dimension one less than that of M(G). 
(See chapter 30 for definitions of these terms.) This is equivalent to F, containing 
|E| affinely independent points. The essential constraints (5.3) correspond 
precisely to those critical S CV for which G[S] contains no cutnode. 


Theorem 5.19 (Pulleyblank and Edmonds 1974). Let SE@© be such that the 
inequality x(E(S)) <(|S| — 1)/2 is not of the form (5.2), for some v EV. Then this 
inequality is essential for M(G), i.e., Fs is a facet of M(G), if and only if G[S] is 
critical and nonseparable. 


Proof. We descfibe how to construct |E| affinely independent members of F,. It 
will be sufficient to construct |E(S)| independent members of F, all of which are 
zero for all edges of E\E(S). For once this is done, we can consider each 
e © E\E(S) in turn. Since at most one end of ¢ is in § and G[S] is critical, there 
exists a matching of G which contains e and some maximum matching of G[S]. 
The incidence vector of this matching belongs to F,, and since it is the only vector 
x we construct for which x, #0, it is affinely independent of the others. 

By Theorem 4.3, G[S] has a critical ear decomposition Gy, G,,...,G, such 
that each G, is 2-connected. We prove by induction that each G, has |E(G;,)| 
matchings, each of cardinality (|V(G,)|—1)/2, whose incidence vectors are 
affinely independent, which will complete the proof. Since G, is an odd cycle, the 
|E(G,)| maximum matchings of G, suffice for the case i= 0. Now suppose the 
result is true for i<k, and consider G,. Let 7 be the (odd length) ear added to 
G,_, to form G, and let u, v EV(G,_,) be the ends of the ear. 

First, by induction there are |E(G,_,)| independent maximum matchings of 
G,,_,. Each can be extended to a maximum matching of G, by adding the 2nd, 
4th, ... edges of the ear. Then we can obtain |E(2)| — 1 additional matchings by 
considering each w € V(7)\{u, v} and constructing a matching of the ear which 
leaves only w and one of u, v, say u, unsaturated. Since G,_, is critical, we can 
extend it to a maximum matching of G, by adding a perfect matching of G,_, — v. 
This is the only matching constructed so far which does not saturate w, so it is 
independent of all others. Finally, we get our last matching by starting with a 
matching consisting of the 1st, 3rd, ... edges of 7. We combine this with a perfect 
matching of G — v from which the edge incident with u has been removed. This is 
the only matching constructed so far, which does not contain (|V(G,_,)| — 1)/2 
edges of G,_,, so it is independent of the others. We therefore have |E(G,)| 
maximum matchings of G, whose incidence vectors are independent as required. 
The result follows by induction. O 


For another proof, of a more polyhedral nature, see Lovdsz and Plummer 
(1986). Cunningham and Marsh (1978) showed that Theorem 5.18 remains true 
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even if we restrict ourselves to a minimal defining system for M(G). That is, such 
a system is totally dual integral. 

The Birkhoff-Von Neumann Theorem 2.9 also follows directly from Theorem 
5.19. As we remarked, no bipartite graphs are critical, so we have the following. 


Corollary 5.20. If G is bipartite, then M(G) is given by (5.1) and (5.2). 


In general, not all degree constraints (5.2) are necessary either. Let N(v) 
denote the set of all nodes adjacent to v (not including v). If |N(v)| = 1, then the 
inequality x(5(v)) <1 is implied by the constraints x =0 and x(6(w)) <1, where 
N(v) = {w}. If |N(v)|=2, then the inequality x(6(v))<1 is implied by the 
constraints x =0 and x(E(M(v) U {v})) <1, and this latter constraint is essential if 
the two nodes of M(v) are adjacent. Pulleyblank and Edmonds (1974) show that 
in all other cases, the constraints (5.2) are essential. 


5.3. Dimension and the matching lattice 


The matching polytope M(G) is of full dimension, since all unit vectors plus the 
zero vector belong to it. However, it is a nontrivial problem to compute the 
dimension of P(G), the perfect matching polytope. For a bipartite graph G = 
(V, E), an upper bound is |E| — |V| — 1 and, using the ear decomposition theorem 
4.1 it is easy to show that this is the correct answer, if G is matching coyered. 

If G is nonbipartite, then let 0’ = {S € ©: every perfect matching of G contains 
(|S| —1)/2 edges of G[S]}. We say that a nested subfamily ¥ of ©’ has the odd 
cycle property if for each SE ¥ U {V}, wen we shrink all maximal members of # 
contained in $, the resulting graph is nonbipartite. Naddef (1982) showed that all 
maximal such nested ¥ with the odd cycle property have the same cardinality 
r(G) and proved the following. 


Theorem 5.21. Let G=(V,E) be a matching covered nonbipartite graph. The 
dimension of P(G) is |E|— |V|-r(G). 


Edmonds et al. (1982) showed that the dimension was determined by the 
number of bricks in the brick decomposition, described in section 4. 


Theorem 5.22. Let G=(V,E) be a matching covered nonbipartite graph. The 
dimension of P(G) is |E|-—|V|+1—k, where k is the number of bricks in the 
brick decomposition of G. 


Another algebraic structure related to the perfect matching polyhedron is the 
matching lattice of a graph, introduced by Seymour (1979). It consists of all 
integer linear combinations of the incidence vectors of perfect matchings. A tight 
cut in a matching covered graph G = (V, E) is a cut 6(S) for $#S CY, |S| odd, 
such that every perfect matching contains exactly one edge of 6(S). For each 
v EV, d(v) is a trivial tight cut, but there are often others for which 3 <|S| <|V| — 
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3. Clearly, for every vector x in the matching lattice, there exists an integer k such 
that x(6(S)) =k for every tight cut. In the case of bipartite graphs this character- 
izes the matching lattice, even when we restrict ourselves to the trivial tight cuts. 
However, for nonbipartite graphs the situation is more difficult. 

For example, suppose that G is the Petersen graph, and we define x, to be 0 for 
the edges j belonging to two disjoint pentagons and 1 elsewhere. Then x satisfies 
the above property but is not in the matching lattice. Lovasz showed that in a 
sense this was the only exception. (Recall that the brick decomposition procedure 
of section 4 constructs a nested family ¥ of odd cardinality subsets of V such that 
6(S) is a tight cut for every SES.) 


Theorem 5.23 (Lovasz 1987). Let G be a matching covered graph, w an integral 
vector defined on the edges of G and & the family constructed by the brick 
decomposition. Then w is in the matching lattice of G if and only if 
(a) w(C) is te same value for every C= 6(v) for v EV or C= 6&(S) for SEF; 
(b) for every brick which is a Petersen graph and for every pentagon in that 
brick, the sum of the entries corresponding to the edges of G mapped onto this 
pentagon is even. 


This also enabled Lovasz to describe a polynomial algorithm for constructing a 
GF(2)-basis of the incidence vectors of the perfect matchings of G, and so 
determine the GF(2)-rank of this set. 


6. Variations and extensions 


A remarkable feature of matching is its ‘‘self-refining’’ property. There are many 
combinatorial problems which appear more general and yet can be reduced to a 
matching problem. At the same time, several innocent appearing extensions of 
matchings turn out to be NP-complete. We discuss a variety of such problems in 
this section. 


6.2. f-factors 


Let f = (f,: v EV) be a vector of positive integers. An f-factor is a set.F of edges 
of G such that |8(v) N F| =f, for fall v EV. If f, = 1 for all v EV, then it is simply 
a perfect matching, sometimes called a 1-factor. If f, =2 for all v, then F is the 
edge set of a family of node disjoint simple cycles on G which contain every node, 
commonly called a 2-factor. 

Tutte (1954) showed that by node splitting, an f-factor problem could be 
transformed into a perfect matching problem. Each node v of G is replaced by a 
set B(v) of f, new nodes, such that the sets B(v) are pairwise disjoint. Each edge j 
is replaced by two new adjacent nodes u, and v, such that, if u, v are the nodes 
incident with j, then u, is adjacent to all nodes ‘of B(u) and vu, is adjacent to all 
nodes of B(v). See fig. ‘6.1. There is a natural correspondence between matchings 
in the new graph and f-factors in G. If we wish to solve a weighted f-factor 
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3 


Figure 6.1. Transformation of f-factor problem to perfect matching problem. 


problem, we define c/,,=c,,,, for all w€ B(u), for all EE, and c, =0 for all 
other new edges. (Here c is the original vector of edge costs.) The weights of the 
f-factors and the corresponding perfect matchings will agree. 

Now we describe Tutte’s f-factor theorem, which can be deduced, although not 
trivially, from the above construction. Let G = (V, E) be a graph and let f=(f,: 
uv EV) be the vector of positive integral degree constraints. Let SCV and let 
T CV\S. For each v ET, we let d> denote the degree of v in G — S. We let 
Q(S, T) = ¥ (d’ -f,: v€T). Finally, we consider the graph G — (S UT) and let 
Q(S,T) denote the set of components of G—(S UT) for which, if K is the 
nodeset, then f(K) + |5(K) M6(T)| is odd. (Note that each such K, plus the edges 
joining K to T corresponds to an induced subgraph of G.) 


Theorem 6.1 (Tutte 1952, see also Cook and Pulleyblank 1987 and LoWisz and 
Plummer 1986). The graph G=(V, E) has an f-factor if and only if 


b(S) + Q(S, T) =|B(S, T)| forall SCV, forall TCV\S. 

This result was further generalized by Lovasz (1970). Instead of specifying a 
single “target” degree f, for each node v, he permitted the specification of a set 
F(v) of possible degrees. He defined a gap to be any maximal sequence of 
consecutive integers not in F(v), but for which the lower and higher values are in 
F(v). Then he generalized Theorem 6.1 to the case that no gap has length greater 
than one. This includes, for example, the f-factor problem and the case that a 
consecutive interval of degrees is permitted, for each v € V. We will see later that 
if gaps of length two or more are permitted, then the problem of existence of an 
appropriate factor becomes NP-complete. Recently Cornuéjols (1988) developed 
a polynomially bounded extension of the blossom algorithm for this problem 
when no gap of size greater than one is permitted. However, the corresponding 
weighted problem has not yet been solved. 

Edmonds and Johnson (1970) gave a linear system sufficient to define the 
convex hull of the incidence vectors of the f-factors. It is a special case of the 
more general Theorem 6.2 presented in the next section. 

Suppose we have nonnegative integers ¢,<f, defined for each node uv. A 
(g, f)-factor is a set F CE such that g, <|FN 8(v)|<f, for all ve V. Hell and 
Kirkpatrick (1993) describe efficient methods for finding (g, f)-factors of mini- 
mum deficiency. 
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6.2. Capacitated and uncapacitated b-matchings 


Another generalization of the f-factor problem is the capacitated perfect b- 
matching problem. In this case, each node v has a positive integral demand b, 
and each edge j has capacity a,. It is desired to assign an integer x, satisfying 
0 <x, <a, to each edge j such that x(6(@)) = b, for all vu EV. (Alternatively, we 
may require x(&(v)) = b, for some nodes v EV). The f-factor problem is just the 
special case that a, = 1 for all j &E and f= b. However, the b-matching problem 
can also be reduced to the f-factor problem by replacing each edge with a, parallel 
edges and again letting f= 5. Theorem 6.1 generalizes easily to this case; we 
define d* to be the sum of the capacities of edges of G — S incident with v and we 
let BS, T) consists of those K for which b(K) + a(6(K)M&(T)) is odd. Edmonds 
and Johnson described how to generalize the weighted matching algorithm to 
handle this problem and (see also Ardoz et al. 1983) showed that the matching 
polyhedron theorem can also be generalized to this case. 


Theorem 6.2 (Edmonds and Johnson 1970). The convex hull of the incidence 
vectors of the capacitated perfect b-matchings of G is the solution to the following 
system. 


O<x,<a, forall jEE (6.3) 
x(6(v))=6, forallvEV (6.4) 


b(S)+a(J)-1 forall SCV and J C&S) such that 
MES) RIO) tS) a(S) odd 
(6.5) 


(It can also be shown that if we replace (6.4) with x(5(v)) < b,, we get the convex 
hull of the “‘not necessarily perfect” capacitated b-matchings.) 

If all capacities are infinite, we get a simpler generalization of the matching 
problem. We wish to assign a nonnegative integer x, to each edge j of G = (V, E) 
in such a way that x(5(v)) = b, for all vy EV, and cx is maximized, for a vector c of 
edge weights. This problem is called’ the maximum weight (uncapacitated) perfect 
b-matching problem. The characterization of when a perfect uncapacitated b- 
matching exists is much simpler than Theorem 6.1. 


Theorem 6.6 (Tutte 1954). A graph G=(V,E) has a perfect uncapacitated 
b-matching if and pry if 


b(S) = b(@(G — S)) + |0,(G—S)| forall SCV. 


Here 6(G — S) is the set of nodes of G which are singleton components of G — S 
and 6,(G — S) is the set of nodesets K of components of G — S which contain at 
least three nodes and for which b(K) is odd. 

Theorem 6.2 also has a simpler analogue in this case. 
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Theorem 6.7 (Edmonds and Johnson 1970, see also Pulleyblank 1973). The 
convex hull of the uncapacitated perfect b-matchings of G is the solution to the 
following system: 


O<x, forall jEE, 
x(5(v)) = 5, = allvEV, 


x(E($)) < 22-1 


for all SCV such that b(S) is odd. 
See Cook and Pulleyblank (1987) for a discussion of dual integrality results for 
b-matching polyhedra. 


6.3. Bidirected matching problems 


A still more general problem, which also reduces to this problem is the matching 
problem on a so-called bidirected graph. In this case, each edge has one or two 
“ends”, each of which may be designated as a “head” or a “‘tail’’. Some edges, 
called arcs, have a head and a tail and others, called links, have two heads or two 
tails. A slack has one end, which may be either a head or a tail. For each node v, 
let 5 *(v) denote the set of edges having a tail incident with v and let 5 (v) denote 
the set of edges having a head incident with v. A b-matching is a vector of 
nonnegative integers which satisfies 9 


x(5*(v))- x(6~(@))=5, forallvEV, (6.8) 


where b, may be positive, negative or zero. A capacitated b-matching must also 
satisfy (6.3). Note that the presence of slacks which may have a head or a fail 
permits us to effectively replace the “=” in (6.8) with “=<” or “2”. Again we 
may ask whether a perfect (capacitated) b-matching exists, or ask for one which 
maximizes (or minimizes) )),<; ¢; x;, where c is a vector of edge weights. 

A 6-matching problem on a bidirected graph can be transformed to a b- 
matching problem on an undirected graph by node splitting. Basically, each node 
v is replaced with copies v" and v_, and all edges having a head incident with v 
are made incident with v’, and a tail with v*. Then v* and v are joined with a 
new edge and we define b,- = B,, b,. = B, +b,, where B, is some value larger 
than the maximum possible values of x(5*(v)), for all b-matchings. (If there are 
capacities, then B, is trivially obtained. If not, it can be shown that setting 
B,=% (jb): 6€ Vv) for all v EV is sufficient.) Finally, a new node w* is created, 
and all slacks are given a second end incident with w*. We can define b,. = 
% (|o,|: i€V), and if a perfect b-matching problem is desired, attach a loop to 
w*. (See fig. 6.2.) 

If capacities and/or weights are present they are transferred to the corre- 
sponding edges of the new graph, and all new edges are given zero cost and 
infinite capacity. (See Ardoz et al. 1983 or Lawler 1976 for more details.) 


Theorem 6.9 (Edmonds and Johnson 1970, see Ardoz et al. 1983). The convex 
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Figure 6.2. Transformation from bidirected to undirected. 


hull of the b-matchings of a bidirected graph G = (V, E) with integral capacities a, 
and integral degree constraints b is given by (6.3), (6.8) and 


x(8(S)\J) — x) =1-alJ), for SCV, J C&S) such that 
” b(S) + a(J) is odd . (6.10) 


Note that the constraint (6.10) can be put in a form analogous to that of (6.5) 
by adding one half of the sum of the degree constraints (6.8). This theorem also 
permits edges to have infinite capacities, i.e., to be uncapacitated. In this case 
a, =~ and so such edges can never appear in sets J in the above inequalities. 


6.4. Edge covers 


A special case of the bidirected matching problem is the minimum edge cover 

problem. In this case we wish to select a minimum cardinality or weight subset of 

the edges of G=(V, E) such that each node is incident with at least one. To 

reduce this to the bidirected b-matching problem we make each edge a link with 

two tails, and capacity 1, define 6, =1 for all iG V and required “=” in (6.8). 
Theorem 6.9 specializes to the following: 


Theorem 6.11. The convex hull of the incidence vectors of the edge covers of a 
graph G =(V,E) is given by 


O<x,<1 forall jEE, 
|S] +1 
2 
(It does require a simple argument to show that those constraints (6.10) with 


J #§ are redundant in this case.) See also Gamble and Pulleyblank (1989), Murty 
and Perin (1982) and White and Gillenson (1975). 


x(E(S)) + x(5(S)) = for all SCV such that |S| is odd . 


6.5. “T-joins and the Chinese postman problem 


Let TCV have even cardinality. A T-join is a set F C E such that IFN &(v)| is 
odd if v€ T and even if vEV\T. If |V| is even and T=V, then a minimum 
cardinality T-join is a perfect matching, if one exists. An application of T-joins is 
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provided by the Chinese postman problem: Suppose we are given a graph 
G =(V, E) and a vector c of edge weights. We wish to find a closed walk in G 
which passes through each edge at least once, and for which the sum of the edge 
costs is minimized. If G is Eulerian, then any Euler tour will suffice. Otherwise, 
we must find a minimum cost set of edges which when duplicated will yield an 
Eulerian graph. This is just a minimum cost 7-join, in the case that we let T be 
the set of odd degree nodes of G. 

Edmonds and Johnson (1973) proposed two different methods for solving the 
minimum cost 7-join problem. In the first, which works when the sum of the costs 
is nonnegative for every cycle, a minimum cost path is computed between each 
pair (u, v) of nodes in T — let d,,, be the cost of this path. Then a minimum weight 
perfect matching is computed for the complete graph with nodeset T, where we 
let d,,, be the cost of the edge uv. The edge sets of the corresponding paths form a 
minimum weight T-join. 

The second method reduces the problem to a single capacitated b-matching 
problem. For each v €V, define 6, to be the largest odd integer less than or equal 
to |S(v)| if v€T, and the largest even integer less than or equal to |8(v)| if 
uv EV\T. Define the capacity a, to be one for each edge j of E. Finally, add a loop 
i, having both ends equal to v, for all v EV. We let I, have infinite capacity. 
(Purists may wish to subdivide each of these loops twice, introducing new nodes w 
where b,, = |b, /2], to remain simple.) There is a bijection between 7-joins in G 
and perfect b-matchings in the expanded graph. 9 

It is worth noting that by lowering b, for some node v by some even amount, 
we effectively impose an upper bound on the degree of uv in the 7-join. By 
making the capacity of the loop /, equal to & for some integer k, we are setting 
the minimum allowable degree of node vu in the T-join to be b, — 2k. Note too 
that this reduction enables us to solve the minimum weight 7-join problem for 
positive or negative edge weights. (This general case can also be reduced to the 
case of positive edge weights — see Lovasz and Plummer 1986). 

The shortest path problem in an undirected graph is a special case of the 7-join 
problem. If we wish to find a shortest path joining nodes s, t, we define T = {s, t} 
and construct a minimum weight 7-join. Note that this will work even for 
negative weight edges. If there are no negative weight cycles, then the optimum 
solution will consist of a shortest s—t-path, plus, possibly, some zero weight 
cycles. If there are negative weight cycles, then at least one will show up in the 
minimum weight 7-join. (Note that the simpler method of solving this problem — 
replace each edge with two parallel, oppositely directed arcs each having the 
same weight as the edge and use Dijkstra’s directed graph algorithm — only works 
if all edge weights are nonnegative.) 

A T-cut is a cut 6(S) in G, for SCV such that |SMT| is odd. Clearly 
|F N8(S)| 1 for every 7-join F, and T-cut 5(S) of G. A double packing of 
T-cuts is a family of 7-cuts such that each edge is used at most twice. 


Theorem 6.12 (Edmonds and Johnson 1973). The minimurn size of a T-join 
equals one half the maximum number of sets in a double packing of T-cuts. 
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Suppose that each edge j of G has a nonnegative integral weight c,. Replace j 
with a path of length c,. (If c,, = 0, then identify uv. The resulting node is in T if 
and only if both or neither of wu, v were in 7.) Then a minimum weight T-join in 
G corresponds to a minimum cardinality 7-join in the new graph. Similarly, if we 
allow each edge of G to appear in at most 2c, T-cuts, then a maximum such 
capacitated packing of T-cuts in G corresponds to a maximum double packing in 
the new graph. Thus we obtain the following. 


Theorem 6.13 (Edmonds and Johnson 1973). The convex hull of ail vectors 
x€R?* such that x > u, where u is the incidence vector of a T-join is given by the 
system 


x(JJ)21 for every T-cut J of G. (6.15) 


Moreover, if c is integral, then there exists an optimal solution to the dual linear 
program of minimizing cx subject to (6.14), (6.15) for which each component is 
half-integer valued. 


Seymour (1981) showed that for bipartite graphs, a stronger result is true: The 
minimum size of a T-join is equal to the maximum number of 7-cuts which can be 
packed. Moreover, in Theorem 6.13, we can find an integer optimal dual solution. 
Further, by subdividing edges, Seymour’s extension can be seen to be valid 
whenever c is integral and has even sum around every cycle. 

Seymour (1981) also showed that if we restrict the set of nodes which can 
comprise the set 7, then we can obtain another integral result. We say that 
G=(V, E) is T-contractible to K, if there exists a partition V, UV, UV, UV, of V 
such that, for each i, G[V,] is connected and |T  V,| is odd, and each pair of parts 
of the partition is joined by an edge. 


Theorem 6.16. if G is not T-contractible to K,, then the minimum size of a T-join 
equais the maximum number of pairwise disjoint T-cuts. 


An easy corollary of this is that the integral result holds for series parallel 
graphs (graphs not contractible to K,) for all choices of T. Gerards (1992) showed 
that the integral result also holds for all choices of T for a class of graphs which 
properly includes all bipartite graphs and all series-parallel graphs, namely those 
graphs which do not contain an odd subdivision of K,. (An odd subdivision of a 
graph H is a subdivision G of H such that circuits in G coming from odd (even) 
circuits in H are odd (even) circuits in G.) 

Seb6 (1990) showed how the Gallai-Edmonds theorem can be extended to the 
case of T-joins. He also developed the connections between T-joins and so-called 
conservative weightings of graphs. A vector of edge weights is said to be 
conservative if the sum of the weights is nonnegative for the edgeset of every 
cycle. If we consider the special case that all weights are 1 or —1, then a weighting 


218 W.R. Pulleyblank 


is conservative if and only if the edges assigned the value —1 form a minimum 
cardinality T-join for some set T. 

Frank (1993) considered the problem of finding a conservative (+1, —1)- 
weighting for which the sum of the weights was minimized. This is equivalent to 
finding an even cardinality set T of nodes for which the size of a minimum 
cardinality 7-join is maximized. He established the following surprising con- 
nection between this problem and the problem of finding an ear-decomposition of 
a 2-edge connected graph which now permits even length ears, but uses as few of 
them as possible. (It is well known that a graph has an ear-decomposition, using 
both even and odd length ears, if and only if it is 2-edge connected.) 


Theorem 6.17 (Frank 1993). The minimum of w(E), taken over all conservative 
(+1, —1) weightings of a graph G=(V,E), equals the maximum number of odd 
length ears in an ear-decomposition of G. 


Let G =(V, E) be a planar graph and let G°=(V",E?) be the planar dual of 
G. Then F CE is a cut in G (i.e., F = 5(S) for some S CV) if and only if the set 
F? of corresponding edges of G” decomposes into the union of circuits, i.e., has 
even degree at every node. The general problem of finding a minimum weight cut 
in a graph (when negative weights are permitted) is NP-hard, but this shows that 
for planar graphs it can be solved polynomially by solving a minimum weight 
T-join problem in the dual (Hadlock 1975). See Barahona (1990) forga com- 
prehensive discussion of the connections between T-joins, the maximum cut 
problem and multicommodity flows. 


o 


6.6. Fractional matchings, 2-matchings and Kénig-Egervary graphs 


One additional special case of b-matching is the uncapacitated 2-matching 
problem. In this case, each edge of G is to be assigned an integer x, € {0, 1, 2} 
such that x(5(v)) =2 (or x(6(v)) <2) for all vEV. In this case, it follows from 
Theorem 6.2 (take a, =2 for all jE) that no constraints (6.5) exist, so any 
extreme solution ¥ to {x =0, x(6(v)) =2 for all v€V} will be integer valued. 
(This is also easy to show directly.) Therefore the edges for which %,=1 form 
pairwise disjoint circuits in G, and since < is extreme, each has an odd number of 
edges. Thus a perfect (uncapacitated) 2-matching in G corresponds precisely to a 
set of pairwise disjoint odd circuits and K,’s which contain every node. In this 
case Tutte’s Theorem specializes to the following. 


Theorem 6.18. G=(V,E) has a perfect 2-matching if and only if, for every 
X CV, G-X has at most |X| singleton components. 


We say that a graph G = (V, E) is 2-bicritical if G — v has a perfect 2-matching 
for all yGV. These graphs were introduced when studying a certain integer 
programming formulation of the maximum stable set problem, described in the 
next section. They can be characterized in terms of stable sets as follows. 


Matchings and extensions 219 


Theorem 6.19 (Pulleyblank 1979). For a graph G=(V,E) the following are 
equivalent: 
(i) G ts 2-bicritical; 
(ii) | <|N(W)| for every stable set I, 
(iii) G is nonbipartite, and for every j € E, there is a perfect 2-matching x for 
which x, > 0. 


We let N(J) denote the set of nodes not in /, but adjacent to one or more 
members of /.) 

These graphs have ear decompositions similar to those described in section 4. 
See Bourjolly and Pulleyblank (1989). 

Suppose we wish to study the stable sets of a graph from a polyhedra) point of 
view, in a manner similar to the way we have considered matchings. For any 
graph G=(V,E), we let S(G)CR"” denote the convex hull of the incidence 
vectors of the sfable sets of G. We call 5(G) the stable set polytope of G. The 
incidence vector x of every stable set satisfies the following constraints: 


x20, (6.20) 
x,+x,S1 forevery wEE. (6.21) 


The following is easy to prove. 


Proposition 6.22. S(G) = {x ER": x satisfies (6.20) and (6.21)} if and only if G is 
bipartite. 


In general, a basic optimum solution £ to the problem of maximizing cx, 
subject to (6.20) and (6.21) will have components receiving values 0, 4 and 1. 
Nemhauser and Trotter (1974) showed that there always exists a stable set S in G, 
for which c(S) is maximum and such that S$ contains all nodes v for which £, = 1 
and none for which £, = 0. Pulleyblank (1979) showed that the 2-bicritical graphs 
are precisely those for which the unique optimum solution < to maximizing cx, 
subject to (6.20) and (6.21), has %, =4 for alli v€V. Further, almost all graphs 
(under the Erdés—Rényi model) are 2-bicritical. This was also shown to hold for a 
model of near regular random graphs by Grimmett and Pulleyblank (1985). See 
also Grimmett (1986). 

Sometimes 2-matchings are studied as fractional matchings — vectors xeER® 
Satisfying x > 0 and x(6(v)) = 1 for all v EV. Since x is a fractional matching if and 
only if 2x is a 2-matching, these concepts are equivalent. See Balas (1981) and 
Uhry (1975) for the relationship between fractional matchings and matchings. 

Problems involving 2-matchings are much easier than the corresponding 
problems for 1-matchings. In fact, by node-splitting, 2-matching problems can be 
transformed to 1-matching problems in bipartite graphs. It is also easy to modify 
io of section 5 to this case, see, e.g., Bourjolly and Pulleyblank 

9). 
Recall that G was said to be a K6énig-Egervary graph if p(G) = v(G). Such 
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graphs were characterized by Deming (1979), Sterbou! (1979) and Lovasz (1983). 
See also Korach (1982). It is surprising though that they can be characterized in 
terms of 2-matchings. We say that a (not necessarily perfect) 2-matching x is 
maximum if x(E) is maximum, over all 2-matchings. Let E” be the set of edges J 
of G =(V, E) for which x, >0 in some maximum 2-matching. 


Theorem 6.23 (Lovdsz and Plummer 1986). G is a Kénig—-Egervary graph if and 
only if G =(V, E’) is bipartite. 


For extensions of this, see Bourjolly and Pulleyblank (1989). 

For further results concerning the structure of 2-matchings, see Miihlbacher et 
al. (1983) and Pulleyblank (1987). 

Finally, note that the maximum 2-matching problem can be viewed as the 
problem of packing edges and odd circuits in a graph. Cornuéjols et al. (1982) 
and, independently, Hell and Kirkpatrick (1984) show that the problem of 
packing edges and any family of critical subgraphs can be solved polynomially. 
See also Cornuéjols and Pulleyblank (1983) and Cornuéjols and Hartvigsen 
(1986). Hell and Kirkpatrick (1983, 1984, 1986) show that in many other cases, 
the problem of determining whether G has a packing of edges plus subgraphs 
from a set # is NP-hard. 

F a 

6.7. Some NP-hard extensions 
We close this section by observing that many “natural” extensions of matching are 
NP-hard. One of the first problems to be shown to be NP-complete (Karp 1972) 
was the so-called 3-dimensional matching problem. Here we are given disjoint sets 
1, J, K such that |J| = |J| =|K| =n, and a set ¥ of triples from J x J x K. We wish 
to determine whether there exists ¥’C ¥ such that each element of /UJUK 
appears in exactly one triple. 

This can be modelled as a matching problem, with one additional restriction. 
(See fig. 6.3.) Construct a node for each element of JU JU XK; construct three 
nodes f,, t;, ty for each (¢,, t,, t,) €¥ and join each #, to the node corresponding 
to the member of /, etc. Then there exists a 3-dimensional matching if and only if 


Figure 6.3. 
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the resulting graph has a matching which saturates all nodes corresponding to 

IUJUK, which also uses 0 or 3 edges of 5({£;, t;,tx}) for each (f,,8,,1,) ES. 

This construction shows that the following problems are NP-complete. 

e Given G=(V, £) and a set F, of allowed degrees for each vu EV, does there 
exist F C E such that |F N S(v)| € F,, for all vu EV? (Identify ¢,, t,, ty to form a 
new node ¢ and define F, = {0, 3}, for all (t,,t,,t, EF). (Recall that Cornu- 
éjols 1988 showed that this could be solved polynomially if gaps of length two 
are not allowed.) 

® Given (bipartite) G=(V,E) and a partition of E, does G have a perfect 
matching which, for each set in the partition, uses all or no members? 
Suppose we construct a bipartite graph with nodeset / U J as follows. For each 

(t;,;, tx) EF, we join the corresponding nodes of J and J and ‘‘colour” the edge 

with colour t,. Then a 3-dimensional matching corresponds to a perfect matching 

on this graph, which uses one edge of each colour. Thus the following is also 

NP-complete. ~ 

® Given (bipartite) G = (V, E) and a colouring of E, does G contain a perfect 
matching with one edge of each colour? 


7. Determinants, permanents and Pfaffians 


Let G =(V, UV,, E) be a bipartite graph, with |V,|=|V,|. Construct the matrix 
P(G) =(p,,: #EV,, v EV,) as follows: 


=\5 ifuvEE, 
Pu = VQ ifuvZE. 


The permanent of a matrix A =(a,,; 1<i<n, 1<j <n) is defined by 


per(A) = x @12(1)420(2) °° @nm(n) > 


where IT, is the set of all permutations of {1,2,..., 2}. Now consider per(P(G)). 
Each term in the summation will be 1 if all the factors p,,, in the term are 1, and 
zero otherwise. Therefore, the term is 1 if and only if the corresponding edges 
form a perfect matching in G. Hence per(P(G)) 0 if and only if G has a perfect 
matching. Moreover, per(P(G)) = @(G), where @(G) denotes the number of 
perfect matchings of the graph G. 

However, this fact does not help particularly in determining whether a graph 
has a perfect matching or in computing ®(G), for computing per(A) is #P- 
complete (Valiant 1979), and, indeed, computing the number of perfect match- 
ings of a bipartite graph is #P-complete. (This implies that the problem is at least 
as hard as any NP-complete problem. See chapter 29.) Nevertheless, upper and 
lower bounds obtained on the permanents of 0-1 matrices do yield important 
results bounding @(G) for the case of regular graphs. See Lovasz and Plummer 
(1986, chapter 8) for details. 
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Recall that the determinant of matrix A is defined as 


det(A) = > sign(a)a19(1;420(2)"**Fnw(ny » 

where sign(7) is 1 if w is an even permutation and —1 if a is odd. We can 
compute det(A) efficiently and |det(A)|<|per(A)|. However, if cancellation 
occurs, it will be the case that |det(A)| < |per(A)|. If each nonzero entry in A is a 
distinct algebraic indeterminate, then no cancellation can occur, and det(A) ~ 0 if 
and only if per(A) 40. But in this case, we do not know how to compute det(A) 
polynomially. Nevertheless, this observation provides the basis for so-called 
randomized algorithms, which we discuss later, in a more general setting. 

Now we consider the case of general graphs. We require the following 
definition. Let B = (6,,) be a skew symmetric matrix of dimension p = 2n. (That 
is, 6, =—b, for all i,j.) Let A be the family of all partitions {{i,, j,}, 


{i,, ‘ame ie i} of the set {1,..., 2} into pairs. For each member A= 
(Liss fads Liar fabs «+++ figs jn}} Of Ay define 
b, = sign((i,, j1, 425 jos oo shay dn))D; Dg ck 


iy Diy, 


Note that since B is skew symmetric, the value of b, is independent of the order 
we write down the classes of the partition or the members of each class. The 
Pfaffian of B is defined by 9 


pf(B)= > b,. 
AGA 
We need the following identity from linear algebra. . 
Theorem 7.1. If B is a skew symmetric matrix, then det(B) = (pf(B))’. 


(If B is of odd order, then det(B) =0 and we take pf(B) to be 0.) 

Let G be an arbitrary simple graph. Let G be any orientation of G, that is each 
edge uv of G is replaced with either arc (u,v) or (v,u). The skew adjacency 
matrix of G is defined as A;(G)= (a, &, vu EV) where 


1 if(u,v)EE(G), 
a, =)-1 if (v,u)EE(G), 
0 otherwise . 


Now pf(A,(G)) sums over each possible pairing of the nodes of G, and the 
corresponding term is nonzero if and only if the pairing corresponds to a perfect 
matching of G. Again, however, there is the possibility of cancellation, which can 
be handled in two ways. Let A,(G, x) be obtained from A,(G) by replacing each 
1 or —1 with a distinct real variable x or —x, respectively. Note that pf(A,(G, x)) 
is a polynomial of degree at most |V|/2, in at most |V|’ variables. Therefore we 
obtain the following. 
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Theorem 7.2. Let G be any graph and let G be any orientation of G. Then G has a 
perfect matching if and only if pf(As(G,x)) is not identically zero. 


‘ This criterion provided the basis for Tutte’s Original proof of Theorem 3.2. 

Suppose that we could obtain and orientation G of G such that all terms in the 
expansion of pf(A s(G)) had the same sign. Then no cancellation could occur, and 
|pf(A 3(G))| would equal ®(G). A graph is called Pfaffian if such an orientation, 
called a Pfaffian orientation, exists. 


Kasteleyn’s Theorem 7.3 (Kasteleyn 1963, 1967, see Lovdsz and Plummer 1986, 
chapter 8). Every planar graph has a Pfaffian orientation. Such an orientation can 
be constructed in polynomial time. 


Proof (Sketch). Assume that G is matching covered. Let G be an orientation of 
G, let F be a perfect matching of G, let C be the edge set of an alternating circuit 
with respect to F and let F’ = F AC. Then the terms of pf(A,(G)) corresponding 
to F and F’ have the same sign if and only if C has an odd number of arcs 
oriented in each direction. So a sufficient condition of G to be a Pfaffian 
orientation is that every even circuit of G contain an odd number of arcs in each 
direction. 

If G is planar, we can use an inductive construction to ensure that every face 
boundary has an odd number of arcs oriented in a clockwise direction. This 
implies that the orientation has the desired properties. O 


Little (1974), see also Little (1973), showed that this could be extended to the 
case that G contains no subdivision of K, ,. Barahona (1981) gave a method for 
computing #(G) for a toroidal graph, based on Theorem 7.3. 

It is trivial to show that a given orientation is not Pfaffian — exhibit two perfect 
matchings whose symmetric difference has an even number of arcs in each 
direction. However, in spite of various characterizations known (see Lovasz and 
Plummer 1986), none show that the problem is in NP, i.e., that there is a short 
way of verifying that a given orientation is Pfaffian. Lovasz’s procedure for 
constructing a GF(2)-basis of the perfect matchings does enable us to find an 
orientation G of G such that G is a Pfaffian graph if and only if G is a Pfaffian 
orientation. Recently Vazirani and Yannakakis (1989) showed that determining 
whether a given orientation is not Pfaffian is equivalent to determining whether a 
digraph has a (simple) dicircuit, containing an even number of arcs. This latter 
Problem is a quite well-known problem whose complexity has not been de- 
termined. (See Klee et al. 1984 and Thomassen 1986.) 

Theorem 7.3 also has interesting algorithmic consequences. First, if 
pf(A ,(G, x)) is not identically zero, then there are real vectors x such that 
Pf(A ;(G, x)) #0. Suppose we evaluated it, for a random value of x. If the result 
was nonzero, we would know that G had a perfect matching. If the result was 
zero, then either G had no perfect matching, or we made an unlucky choice of x. 
This is an example of a randomized polynomial algorithm (Lovasz 1979) — an 
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algorithm which runs in polynomial time, but may with “‘small’” probability give a 
“NO” answer when the answer should be “YES”. In this case, for example, if 
each component of x is chosen uniformly from {1,2,... , 20}, and |£| = 100, then 
the probability of accidental cancellation, i.e., an error, is less than 107'°. (See 
Lovadsz and Plummer 1986.) 

Moreover, this approach is well suited to parallel! models of computation. (See 
chapter 29.) We can evaluate the determinants of |V|/2 +1 matrices in parallel. 
(See Mulmuley et al. 1987 for a discussion of these issues.) This procedure only 
determines whether a perfect matching exists, it does not actually construct one. 
This latter problem could be solved by sequentially solving a perfect matching 
existence problem for each edge of G. If G — e has a perfect matching delete e. If 
not, add e to the matching and delete all other edges incident with each end node. 
However, such a serial procedure is not acceptable when operating with a parallel 
model of computation. Mulmuley et al. (1987) show that this construction 
problem can be reduced to the problem of inverting a single matrix. They use a 
lemma which shows that if we independently assign to each edge of a graph a 
random weight between 1 and 2|E|, then the probability is at least 0.5 that the 
minimum cost perfect matching is unique. 

Suppose now we independently randomly orient the edges of a graph G so that 
each edge is equally likely to get either of the two possible orientations. Let G be 
the corresponding randomly oriented graph. If we were then to compute 
|pf(A,(G))], we would expect there to be large amounts of cancellatidh and so 
the value would be far from ®(G). However, somewhat surprisingly, the 
expected value of the determinant of this matrix has magnitude exactly @(G). 


Theorem 7.4 (Lovdsz and Plummer 1986). Let G be a random orientation of a 
graph G, obtained by orienting each edge independently, with equal probability, in 
either direction. Then the expected value of |det(A,(G))| is ®(G). 

It can also be shown that the variance of |det(A ,(G))| is © 2°") where the 
sum is over all pairs M,, M, of distinct perfect matchings of G and a(M,, M,) is 
the number of (alternating) cycles contained in the symmetric difference of M, 
and M,. If the variance divided by ®7(G) is a polynomial in the size of G, then 
we can efficiently estimate ®(G) by repeatedly computing \det(A ,(G))|, for 
random orientations of G. This is true, for example, for complete graphs, but at 
present few more interesting classes are known. 

The last topic of this section is the exact matching problem. We are given a 
graph G=(V,£E), an integer k and a set RCE of “red” edges. We wish to 
determine whether G has a perfect matching with exactly k red edges. For 
general graphs, and even for bipartite graphs, the complexity of this problem is 
not settled. However Barahona and Pulleyblank (1987) show that it can be solved 
polynomially for Pfaffian graphs using the methods of this section. Replace the 
entries of the skew adjacency matrix A,(G) of a Pfaffian orientation which 
correspond to red edges with +x. Then tthe Pfaffian is a polynomial of degree at 
most |V|/2 in x, and the coefficient of x* is just the number of perfect matchings 
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with k red edges. This polynomial can be obtained by evaluating the Pfaffian for 
|V|/2 + 1 distinct values of x, and solving the resulting system of linear equations. 

This relationship also enables the construction of a randomized algorithm for 
‘the exact matching problem. See Mulmuley et al. (1987). 


8. Stable sets and claw free graphs 


The problems of computing a(G), or of constructing a maximum cardinality 
stable set or a minimum cardinality node cover are NP-hard for general graphs 
(Karp 1972), however due to K6nig’s Theorem 2.1 and the Gallai identities (1.1) 
and (1.3), these can be solved efficiently for bipartite graphs. 


Theorem 8.1. Let G=(V,£E) be a bipartite graph with no isolated nodes. Then 
a(G) = p(G),“t.e., the maximum cardinality of a stable set equals the minimum 
cardinality of an edge cover of the nodes. 


Moreover, in the bipartite case, the weighted stable set problem, wherein we 
have a vector (c,: v EV) of node costs and require a stable set S for which c(S) is 
maximized, is the dual of a minimum uncapacitated b-matching problem in a 
bipartite graph. Hence this can also be solved polynomially via matching theory. 

For the rest of this section we focus on nonbipartite graphs. First, we note that 
an analogue of the augmenting path theorem 1.6 does hold for stable sets, but it is 
more complicated. For any stable set § CV, an alternating sequence relative to S 
is a sequence o =(t,,5,,f,,5,,...) of distinct vertices alternately belong to 
S=V\S and S such that 


(i) s; EE S\{s,,8,,...,5;-;} and N(s,)N{t,,--- 4} 40; 
(ii) t,,,E S\{t,,t,...,t} and N(t,,,)O{s,,-.-,5;} 49, 
Nt.) O{t,,.--5t)} =O. 


It is said to be maximal, if no vertices can be added without violating (i) or (ii). 
Notice that it has a tree like structure, rather than being a path. If we let S(o) and 
S(c) denote the nodes of S and § respectively in a, then S(c) is a stable set, and 
if 7 is maximal, then S U S(c)\S(a) is also a stable set. If the length of o is odd, 
then it will be larger than S. 


Theorem 8.2 (Edmonds 1962, see also Berge 1985). A stable set S is maximum if 
and only if there is no maximal alternating sequence having odd length. 


However, whereas alternating trees enable us to find augmenting paths in the 
case of matchings, no general method is known for finding these odd length 
maximal alternating sequences. 

In one case, however, the situation simplifies. We say that G is claw-free if no 
induced subgraph is isomorphic to K,,. If o is an odd maximal alternating 
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@ NODES OF S 


Figure 8.1. An odd, maximal, alternating sequence. 


sequence in a claw-free graph G, then no node of S(a) can be adjacent to more 
than two nodes of S(a), without creating a claw, and no node of S(c) can be 
adjacent to more than two nodes of S. Therefore, every node of a is adjacent to 
at most two other nodes of o, and so a is the node sequence of a path with an odd 
number of nodes or a circuit. But the circuit would have to have an even number 
of nodes, contradictory to the sequence being of odd length. Thus we obtain the 
following. 


Theorem 8.3 (Minty 1980, Sbihi 1980). Let S be a stable set in a claw-free graph 
G=(V, E). Then S is maximum if and only if there is no induced simple path 7 
with node sequence t,, 51, ty, Sx)---st_—1) Sq, t, Such that each s,ES, each t,E S, 


and no node t, is adjacent to any node of S not in 7. a 


Thus the maximum stable set problem for a claw-free graph does appear much 
closer to matching than the general problem. As mentioned in section 1, it is a 
generalization of the maximum matching problem, since line graphs are a special 
class of claw-free graphs and the maximum matching problem in a graph is just 
the maximum stable set problem in its line graph. 

Sbihi (1980) and Minty (1980) obtained quite different polynomially bounded 
algorithms to find a maximum stable set in a claw-free graph. The former directly 
generalized the blossom algorithm described in section 3. The latter reduced the 
problem of finding an alternating sequence to O(|V|”) problems of determining 
whether an augmenting path (with respect to a given matching) occurs in an 
auxiliary graph. 

Recently, Lovasz and Plummer (1986) showed that by means of a sequence of 
local replacements, a claw-free graph G could be transformed to a line graph G’ 
in such a way that a(G) was reconstructible from a(G'). Their method proceeds 
as follows. First, a node uv of G is called regular if N(v) can be partitioned into 
two complete subgraphs, and irregular otherwise. (All nodes in line graphs are 
regular.) We let N,(v) denote the set of all nodes of G at distance two from v, for 
all vEV. 


Lemma 8.4. Let G be a claw-free graph and v an irregular node. Let Y be the set 
of nodes yEN,(v) for which N(v)\N(y) induces a complete subgraph of G. Let 
G’ be the graph obtained from G by deleting {v} U N(v) U Y and then joining each 
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nonadjacent pair of nodes of N,(v)\Y with a new edge. Then G’ is claw-free and 
a(G') = a(G) - 2. 


‘ A clique Q of G is defined to be reducible if a(N(Q)) <2. 


Lemma 8.5. Let Q be any reducible clique in a claw-free graph G. Let G' be the 
graph obtained from G by deleting the nodes of Q and joining any nonadjacent 
nodes u and v of N(Q) by an edge if and only if QC N(u) UN(v). Then G' is 
claw-free and a(G')=a(G) -— 1. 


They next prove that if a graph contains neither irregular nodes nor reducible 
cliques, then it is a line-graph. 


Theorem 8.6. Let G be a graph such that every node vu of G is contained in two 
_ irreducible cliqiies, which cover all members of N(v). Then G is a line graph. 


They can now compute a(G) for a claw-free graph G as follows: Select any 
node v and see if it is regular. (This simply involves checking whether N(v) 
induces a bipartite graph in the complement of G.) If v is irregular, we reduce G 
as described in Lemma 8.4. If v is regular, that is, N(v) can be partitioned into 
complete subgraphs 7, and T, of G, then we extend each 7, to a clique Q, by 
adding node v. If either QO, or Q, is reducible, we apply Lemma 8.5. If not, we 
proceed to a new node v. 

When this procedure terminates, by Theorem 8.6 we left with a graph G which 
is the line graph of a graph H. We use the maximum cardinality matching 
algorithm to compute v(H) = a(G). 

Minty (1980) also solved the problem of finding a maximum weight stable set in 
a claw-free graph. However, his method did not lead to a polyhedral description 
of the convex hull of the incidence vectors of the stable sets. Indeed, the 
determination of such a system remains one of the more vexing open problems of 
polyhedral combinatorics. (To this date, no one has been successful in generaliz- 
ing either the Sbihi or Lovasz—Plummer method to the weighted case.) 
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colouring theory, including critical graphs, graphs on surfaces, sparse graphs, per- 
fect graphs and Ramsey theory. 

Saaty and Kainen (1977) survey the four colour problem in its many variations 
and extensions, and Appel and Haken (1989) prove and discuss the four colour 
theorem. 

Lovasz (1983) and Grotschel et al. (1988) give excellent surveys of perfect graphs 
and stable sets in graphs, including combinatorial optimization aspects. Many types 
of perfect graphs and algorithmic aspects of them are treated by Golumbic (1980), 
and interesting problems by Gyarfas (1987). 

Colouring by only three colours is surveyed by Steinberg (1993), algorithmic 
aspects in general by Kubale (1991), chromatic polynomials by Read and Tutte 
(1988) and edge-colourings up to 1977 by Fiorini and Wilson (1977). Sachs and 
Stiebitz (1989) include a comprehensive bibliography of colour-criticality. Tuza 
(1990) gives an interesting survey of unsolved problems. The monograph by Jensen 
and Toft (1995) contains a description of more than 200 unsolved graph colouring 
problems. 

Finally, Berge and Chvatal (1984), De Werra and Hertz (1989), and Nelson and 
Wilson (1990) have edited interesting collections of research papers and surveys 
on perfect graphs and colouring theory in its many forms. 


1. Basic definitions and motivation a 
Dividing a set of objects into classes according to certain rules is a fundamental 
process in mathematics. A simple set of rules determines for each pair of objects 
whether they are allowed in the same class or not. Graph colouring theory deals 
with exactly this situation. The objects form the vertex set V(G) of a graph G, two 
vertices being joined by an edge in G whenever thcy are not allowed in the same 
class. To distinguish the classes we use a sct of colours C, and the division into 
classes is given by a colouring py: V(G) — C, where xy € E(G) > 9(x) ¥ oy). 
Thus vertices of the same colour form a stable set. The set C of colours can 
be any set; however, in this chapter we shall restrict C to be finite. Since the 
names and nature of the colours are irrelevant, most often we let C be equal to 
N, = {1,2,...,k} for some k EN. 

We shall assume that all graphs are simple, except if explicitly stated otherwise. 
If y is a colouring of G with colours C and |C| = k, then ¢ is a k-colouring and G is 
k-colourable. We do not require all colours to be used, hence a k-colourable graph 
is also (K + 1)-colourable. The smallest value of & for which G is k-colourable is the 
chromatic number x(G) of G. If x(G) = k, but x(G’) < k for any proper subgraph 
of G, then G is k-colour-critical. If y(G — t) < x(G) for 1 € V(G) UE(G), then ¢ 
is a critical element of G. Hence G is critical iff all vertices and edges are critical. 
The colour-critical graphs were first defined, studied and used by Dirac (1952a,b,c, 
1953). The only &-colour-critical graphs for k = 1,2 are the complete graphs K, 
and K>. The following result of K6nig (1916, 1936, Satz X.12) is equivalent to the 
statement that the 3-colour-critical graphs are the odd circuits. 
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Theorem 1.1. A graph G is 2-colourable iff G contains no odd circuit. 


Proof. If G contains an odd circuit C then x(G) > x(C) = 3. Conversely, if G 
contains no odd circuit, then the distance classes of G w.rt. a vertex x € V(G) are 
‘all stable and hence can be coloured alternately by two colours. 


No reasonable characterization of the 4-colour-critical graphs, or equivalently 
of 3-colourability, seems possible, in fact the 4-colour-critical graphs form a very 
unruly class, as we shall see. 

The following theorem by De Bruijn and Erdés (1951) shows that we may restrict 
our attention to finite graphs, in particular all colour-critical graphs are finite: 


Theorem 1.2. /f all finite subgraphs of an infinite graph G are k-colourable then 
G is k-colourable. 


Proof. The £2eorem follows from compactness results in logic. A direct proof can 
be given as follows. Let G be a possible counterexample. By Zorn’s lemma, G may 
be assumed to be maximal, i.e. the addition of any new edge e to G gives a finite 
subgraph G, of G+e which is not k-colourable. Suppose xy ¢ E(G), yz ¢ E(G), 
but xz € E(G). Then (Gy — xy) U(G,, — yz) +xz would be a finite subgraph of 
G which is not k-colourable. Hence non-adjacency is an equivalence relation on 
V(G). Since G is not k-colourable the number of equivalence classes is at least 
k +1, but then K,,, C G. Contradiction. 0 


Not only colouring theory, but much of graph theory, had its beginnings in work 
on the four colour problem of F. Guthrie, asking if any planar graph is 4-colourable. 
Well written accounts are contained in the monographs by Ringel (1959), Ore 
(1967), Saaty and Kainen (1977), Barnette (1983), and Aigner (1984). The four 
colour problem was mentioned in a letter from A. De Morgan to W.R. Hamilton 
in 1852, and seems first mentioned in print in 1860 by De Morgan. A proposed 
solution by Kempe (1879) stood for more than a decade until it was refuted by 
Heawood (1890) in his first paper. Heawood (1890) proved the five colour theorem, 
extended the problem to higher surfaces by giving a sufficient number of colours for 
‘each surface and showing the number 7 of colours to be necessary and sufficient for 
the torus. Moreover, Heawood considered maps with countries consisting of more 
than one connected part. Dirac (1963) gave a survey of Heawood’s achievements. 

After many attempts and partial results on the four colour problem throughout 
this century by many mathematicians, Appel and Haken in 1976 announced a 
complete proof, published in Appel and Haken (1977a) and Appel et al. (1977). 
The proof is based on the same basic idea as Kempe’s proof, which is to find a set 
of unavoidable configurations, all being reducible in the sense that a 4-colouring 
ofa planar graph containing one of the configurations can be obtained from a 
4-colouring of a smaller reduced planar graph. But where Kempe’s unavoidable 
configurations were nodes of degree 3, 4 or 5, Appel and Haken’s initial set had 
1936 configurations (in Appel and Haken 1977a it was announced that a proof with 
fewer configurations is possible). The detailed techniques of Appel and Haken are 
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further developments of methods of Heesch (1969), who was the first to strongly 
emphasize a possible proof of the four colour theorem along these lines. Several 
surveys of the proof exist, for example Appel and Haken (1977b) and Woodall 
and Wilson (1978). Due to its length, its extensive use of computer verification, 
and its omission of details, the proof of Appel and Haken has been surrounded 
by some controversy. In any case it is safe to say that the last word on the four 
colour problem has not been said. Appel and Haken (1986; 1989) have answered 
the criticism raised against their proof and published an amended version of it. 

Recently a new ingeniously improved proof of the four colour theorem has been 
presented by Robertson et al. (1994). The proof is similar to the proof of Appel 
and Haken, but it is simpler and more transparent in several ways. In particular it is 
based on a simpler procedure to obtain unavoidability (avoiding some of the more 
problematic aspects of Appel and Haken’s proof with configurations that wrap 
around and meet themselves), and uses only 633 configurations. The reducibility 
proofs are however still based on computers. 

But even if we accept that there is only little more to add about the four colour 
problem, there are still many attractive and easily formulated unsolved problems 
to consider, as exemplified in the book by Jensen and Toft (1995). Tutte (1978) 
says “The Four Colour Theorem is the tip of the iceberg, the thin end of the wedge 
and the first cuckoo of spring”, and he describes several difficult conjectures gen- 
eralizing the four colour theorem, among them the famous conjecture of Hadwiger 
(1943), which may be formulated as follows. 9 


Conjecture 13. Let ¢ be a class of graphs closed under deletions (of vertices 
and/or edges) and contractions of edges (removing possible loops and parallel 
edges that might arise). Then the maximum chromatic number y of the graphs ia 
$ equals the size k of a largest complete graph in G. 


For k =3 this was proved by Dirac (1952a), and the case k = 4 not only implies, 
but is equivalent to the four colour theorem, as proved by Wagner (1937). That 
Conjecture 1.3 is true when & is the class of all graphs embeddable on a surface 
S (different from the sphere), was proved by Ringel (1959). 

As somewhat different examples of unsolved problems, here are two of the 
favourites of Erdés (1981). 


Problem 1.4. Let G be the union of at most & complete k-graphs of which any 
two have at most one vertex in common. Is G k-colourable? 


Problem 1.5. Let V(G) = R’, ie. the vertices of G are all the points in the plane, 
where xy € E(G) iff x and y have distance 1. What is y(G)? 


Problem 1.4. is due to Erdés, Faber and Lovadsz, and Erdés (1981) offers 500 
dollars for a proof or a disproof. Problem 1.5. is due to Hadwiger and Nelson. 
By Theorem 1.2 it is sufficient to consider finite subgraphs of G. It is known that 
4<x(G) <7. 
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The chromatic number also plays a role outside pure colouring theory. A good 
illustration of this is the theorem of Erdds and Stone (1946) and Erdés and Si- 
monovits (1966). Let G be a fixed graph with at least one edge. For n > |V(G)| 
define f(n,G) to be the maximum number of edges possible in a graph on 7 ver- 
tices not containing G as a subgraph. This extremal function, apparently with no 
connection to colouring, depends on y(G). 


Theorem 1.6. f(n,G)/n? — 3(x(G) — 2)/(x(G) — 1) for n > o. 


Finally, although graph colouring theory has a recreational background, there 
are “real world” applications. For example, time tabling problems are basically 
graph colouring problems, where the activities to be scheduled are represented by 
vertices, two vertices being joined by an edge iff the corresponding activities are 
in conflict. Thus the times to be assigned to the activities correspond to colours. 
Radio frequency assignment is another example. 


2. Constructions and examples 


This section may be considered to be negative in nature. By exhibiting a variety 
of examples, the chromatic number of a graph G is seen not to have very strong 
implications for the structure and properties of G, even when G is colour-critical. 


2.1. Sparse graphs 


Obviously w(G) < x(G), where w(G) denotes the size of a largest complete sub- 
graph of G. Moreover, w(G) = 1 implies y(G) = 1. Zykov (1949) showed by ex- 
amples that apart from these there are no relations between w and y in general. 
In particular he proved the following. 


Theorem 2.1. For all k there exists k-chromatic triangle-free graphs Gy, i.e. x(G) = 
k and G, 2 K3. 


Proof. By induction we assume that G,,G2,...,G,_; have been obtained. Take 
disjoint copies of these. Let V be a set of |V(G,)|-|V(G2)|-+-|V(Gx_1)| new ver- 
tices corresponding to all selections of one vertex from each of Gi, G2,..., Gx—1. 
Then G, is obtained from G,, G2,...,G,_; and V by joining each vertex in V to 
its corresponding set of k — 1 vertices, one vertex from each G;. 

G,, is k-colourable. On the other hand, if G, had a (k — 1)-colouring, then G; 
would have a vertex x), of some colour i;, G2 would have a vertex x2 of a colour 
i, different from i,, G3; would have a vertex x3 of a colour i; different from i, and 
tn, etc. The vertex uv of V joined to x1,x2,...,x,_, would be joined to all colours 
of the (k — 1)-colouring. Contradiction. Hence y(G,) =k. O 


Schauble (1969) proved that the above construction, which is a refinement of 
Zykov’s, produces a k-colour-critical G, from critical G;,G2,...,G,_;. There are 
several other constructions of triangle-free k-chromatic graphs, the earliest ones 
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by Descartes (1948a, 1954), Kelly and Kelly (1954) and Mycielski (1955). A survey 
was given by Sachs (1969). 

Erdés (1957) proved by a geometric construction that the size of a smallest G, 
is bounded by a polynomial in k (of degree 50). In fact, Erdés (1959, 1961) proved 
by probabilistic non-constructive arguments the extensions Theorems 2.2 and 2.3 
of Theorem 2.1. 


Theorem 2.2. There is a constant c such that for all k there exists a k-chromatic 
triangle-free graph on <c-k? - (logk)? vertices. 


Theorem 2.3. For all k and & there exist k-chromatic graphs without circuits of 
length < £. 


It is not known whether the upper bound in Theorem 2.2 is best possible, how- 
ever Erdés and Hajnal (1985) obtained c’ - k? -logk as a lower bound. 

Extending the domain of operations to hypergraphs, Lovasz (1968) gave a con- 
structive proof of Theorem 2.3. This was later done more simply by NeSetfil and 
Rédl (1979). 

There is a very simple way to obtain a k-chromatic graph in which all odd 
cycles are long. Let X = {1,2,...,2n+k} and define the Kneser graph KG,, by 
V(KG, 4) = {x|x CX and x =n} and E(KG,,) = {xy|xN y =@}. The Schrijver 
graph SG,,, is the subgraph of KG,,, induced by X’ = {x’|x’ does not contain two 
consecutive elements in the cyclic order (1,2,...,21+,1)}. If K; denotes shose 
vertices having least element i, then K,, K2,...,Kxsi, V(KG,,«) — Ky — Kz —---- 
Ky41 defines a (k + 2)-colouring of KG,,,. Kneser (1955) thus observed x(KG, x) < 
k+2, and equality was proved by Lovasz (1978a) using Borsuk’s theorem, and 
subsequently more simply by Barany (1978) using a further topological result of 
D. Gale. Schrijver (1978) proved that x(SG,,,) = k +2 and that all vertices of SG,,, 
are critical. 

Let C be an odd circuit of length @ in KG,,. For each xy € E(C) consider 
the k-set X — x —y. Since C is odd these k-sets cover X, hence @ > (2n+k)/k. 
By a well-known combinatorial result (Ryser 1963, Lemma 2.3), the graph SG,,x 
has ((2n+k)/(n+k))- (i) nodes. For each k >2 this implies the existence 
of infinitely many (k +2)-chromatic graphs G in which each odd circuit has 
length > c, -|V(G)|!/*. For k =2 the first such graphs were obtained by Gallai 
(1963a). Kierstead et al. (1984) proved the above order of magnitude to be best 
possible. For circuits in general the corresponding order of magnitude is only 
c-log|V(G)|/log&, as proved by Erdés (1959, 1962). 

Finally let us mention the following remarkable result due to Miller (1979). The 
proof of the second part uses Theorem 2.3. The case r = 1 of Theorem 2.4 implies 
the existence of uniquely k-colourable graphs without short circuits. 


Theorem 2.4. Let A be a set of vertices, and let P,,P2,...,P, denote different 
partitions of A into at most k classes (k > 3). Then there exists a k-chromatic graph 
G with A C V(G) such that G has precisely r different k-colourings ¢), ¢2,...1 Gr 
(where renaming or permuting the colours does not count as different colourings), 
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and for all i the restriction of g to A partitions A into colour classes like P;. 
Moreover, for a given £, there exists such a G in which all circuits have length > € 
and any two vertices of A distance > &. 


22. Hypergraphs 


In a hypergraph H the edges are subsets of the vertex set V(H). The edges may 
be of arbitrary, not necessarily equal, size > 2. If all edges have the same size r 
the hypergraph is r-uniform. A colouring of H is a mapping » : V(H) — C, where 
|p(A)| 22 for all edges A € E(H). This definition is due to Erdés and Hajnal 
(1966). A 2-colour-critical hypergraph H has just one edge, i.e. E(H) = {V(H)}. 
No reasonable characterization of the 3-colour-critical hypergraphs, or equivalently 
the 2-colourable hypergraphs, seems possible. An example of a 3-uniform 3-colour- 
‘critical hypergraph is the finite projective plane of order 2, also called the Fano 
configuration, on seven vertices and seven edges. An example of an r-uniform 
. k-colour-critical hypergraph is a set of (r — 1)(k — 1) +1 vertices together with all 
subsets of size r as edges. Other examples can be found, e.g. in Toft (1975). 

Any k-chromatic or k-colour-critical hypergraph with k 2 4 can be transformed 
into a similar graph. The construction for Theorem 2.1 by Descartes (1948a, 1954) 
is an example of this transformation. From a graph G, a vertex x of G and a set X 
of vertices disjoint from G of size at most deg(x), a new graph G’ can be obtained 
by splitting x into X, i.c. G’ is obtained from X and G — x by joining each vertex 
of X to one or more neighbours of x, such that each neighbour of x is joined to 
at least one vertex of X. If each neighbour of x is joined to exactly one vertex of 
X, then the splitting is proper. The following theorem is not deep, but it can be 
used to construct a variety of colour-critical graphs, because we can operate free 
of the constraint in graph theory that all edges have to be of size 2 (the above 
mentioned constructive proofs of Theorem 2.3 illustrates the same: even if we are 
only interested in graphs, the hypergraphs are indispensable in the constructions). 


Theorem 2.5. Let k > 4 and let H be a k-chromatic hypergraph (or graph) and A 
an edge of H. Let G be a k-chromatic graph disjoint from H with a node x € V(G) 
of degree > |A|. Obtain the hypergraph H' from H and from G by a proper splitting 
of x into A and the removal of A as an edge. Then: 

(a) x(H’) 2k. 

(b) if H and G are k-colour-critical, then H’ is k-colour-critical in at least these 
cases: 

(i) deg(x) < 2k ~—4; 

(ii) |A| = 2 and the graph obtained from G by the proper splitting of x into A 
is (k ~ 1)-colourable; 

(ili) G consists of an odd circuit C completely joined to a K,_3 containing x. 


Proof. If H’ had a (k — 1)-colouring, then, since y(H) = k, the vertices of A would 
all have the same colour, and hence G would be (k — 1)-colourable. Therefore 
x(H") > k. The proof of (b) is more tedious, but not difficult (see Toft 1974a). oO 
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If H is k-critical with no other edge than possibly A having size > 3 and if G 
is k-critical with deg(x) = k —1 2 |A|, then H’ is a k-colour-critical graph with a 
non-trivial cut (ie. both sides of the cut have 22 vertices) of k —1 edges. Any 
such H' can be obtained in this way, as proved by Gallai (unpublished) and Toft 
(1974a). 

If H is a k-colour-critical graph and the condition in (b) (ii) is satisfied, then H’ 
is a k-colour-critical graph containing a cutset of two vertices. Any such H’ can be 
obtained in this way, as proved by Dirac (1953, 1964a) and Gallai (1963a). 


2.3. Colour-critical graphs 


Since the k-colour-critical graphs are minimal k-chromatic one might expect them 
to have relatively few edges. However, for k >4 there exist positive constants 
a, and infinitely many k-colour-critical graphs G with > a,|V(G)|? edges. Dirac 
(1952a) observed the following. 


Theorem 2.6. Let G, and G, denote disjoint graphs, and let G, * Gz denote the 
graph obtained from G, and G, by joining each vertex of G, to each vertex of G, 
by an edge. Then x(G, * G2) = x(G1) + x(G2). Moreover, G; * Go is colour-critical 
iff G, and G2 are colour-critical. 


The proof of Theorem 2.6 is straightforward. Letting G, and G2 be odd circuits 
of the same length n, Dirac (1952a) thus obtained 6-colour-critical graphs on 2n 
vertices and n? +2n edges, thereby getting ag = t and answering a questign of P. 
Erdés. For & = 4 one may similarly let G, and G» be 2-colour-critical hypergraphs 
of the same size x. Then G, + G, is a 4-colour-critical hypergraph with n* edges of 
size 2 and two edges of size n. Reducing G, * G2 to a 4-colour-critical graph G by 
Theorem 2.5 (b) (iii), one may, for m odd, obtain a G on 4n vertices and n? + 4n 
edges. These and other similar examples were obtained by Toft (1970). The best 
possible values of a, are not known, and the above values a4 = 7g and ag = } are 
the best known at present. 

In the above examples most edges are concentrated in 2-colourable subgraphs, 
but V. Raédl, and Stiebitz (1987) proved that this need not be so. 


Theorem 2.7. For ail k > 4 there exists a positive constant by and infinitely many 
k-colour-critical graphs G such that it requires the removal of > b,|V(G)|? edges 
from G to reduce the chromatic number to k —2. 


Proof. We shall carry out the construction for kK = 4 only. Let A;, Az, A3,A4,A5 
be disjoint sets of n vertices each, n > 2. Join A; to A;,,, i = 1,2,3,4, and As to Aj 
by all possible edges. Add the five hyperedges A; UA3, Az U Ag, A3 UAs, Ag UA}, 
As U A) of size 2n. The hypergraph H obtained is 4-colour-critical. Reducing H to 
a 4-colour-critical graph by Theorem 2.5 (b) (iii), one may obtain a G on 15n +5 
vertices such that the removal of at least 2” edges from G is required tc reduce 
the chromatic number to2. 


As a further surprise, the maximum number a(G) of vertices in a stable set 
in a k-colour-critical graph (k 2 4), may exceed any fixed proportion of the total 


Colouring, stable sets and perfect graphs 243 


number of vertices, as proved by Brown and Moon (1969). Simonovits (1972) gave 
an alternative proof, based only on the existence of k-colour-critical graphs with 
many edges. He noticed that splitting a vertex of a k-colour-critical graph into 
new vertices corresponding to all subsets of size k — 1 of the neighbours of x in 
G, and joining each new vertex to its corresponding subset, produces a new graph 
G’ with y(G’) =k. Moreover, a k-colour-critical subgraph G” of G’ is obtained by 
removing a subset of the new vertices. Since each neighbour y of x will still be 
joined to at least one new vertex (because y(G — xy) = k — 1), it follows that at 
least deg(x)/(k — 1) new vertices remain in G”. If G has many edges, then splitting 
all vertices in a colour class incident with many edges produces a k-colour-critical 
graph with a large stable set consisting of new vertices. The order of magnitude is 
as in the construction of Brown and Moon. Lovasz (1973a) proved this order of 
magnitude to be best possible for k = 4 and generalized it to all k > 4. 


Theorem 2.8. Fork > 4, let a,(n) denote the maximum number of vertices possible 
_ ina stable set sf vertices in a k-colour-critical graph on n vertices. Then for infinitely 
many values of n 

(i) a(n) >n—k-ni/k-2) 
and for all values of n 

(ii) a(n) <n — bk n/E-2), 


Proof. Let G’ be obtained from i-colour-critical graphs G;, i=1,2,...,k —1, 
as in the construction for Theorem 2.1 by Zykov and Schauble, where each of 
G3, G4,...,Gy_, has N vertices, except that we shall let G2 be a 2-colour-critical 
hypergraph, also on N vertices. Hence G’ has precisely one hyperedge, and it may 
be reduced to a k-colour-critical graph G by Theorem 2.5 (b) (iii). This G has 
n > a(G) > N*, and hence we obtain (i): 


n—a(n)<n—a(G)<(k-1)N+2<¢k-N ok enV), 


Lovasz’ proof of (ii) is an interesting application of a linear algebra technique. Let 
G be a k-colour-critical graph on n vertices with a stable set S of size s = a,(n). 
By the splitting argument of Simonovits (1972) we may assume that all vertices 
of S have degree k —1 in G (otherwise split the vertices of 5; the theorem for 
the new graph implies the theorem for G). Let T be the tf =n — a,(#) vertices of 
G outside S, and let A; be the set of kK ~1 neighbours in T of vertex i from S 
fori =1,2,...,s. Let By, B,...,Bg be all subsets of T of size k — 2, ite. g = (9): 
Finally, let M = {mj;} be the s x q-matrix with mj =1 if A; 2 Bj and mj =0 
otherwise. 

We shall prove below that the rank of M over GF(2] is s. Hence s <q, and 
therefore 


ft 2. tk-2 et \*? 
= S Se a 
meee (.i2) + am < (aa) 
where we use that i! > 2(i/e)' by Stirling’s formula. Then we obtain (ii): 


n~a(n)=t> (=) niltk-2) > Kg ritk-2) 
~ 6 i 
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To see that the rank of M over GF[2] is s, colour the subgraph G[7] of G induced 
by T with k — 1 colours 1,2,...,k — 1 in such a way that only A; of Aj,Ao2,..., As 
get all k —1 colours. This is possible, since vertex 1 of S is critical in G. The 
total number of pairs (i, /), where B; has colours 1,2,...,k —2, A; > B;, andieé 
{1,2,...,r} for some r, 1 <r <s, is an odd number, since there is exactly 1 pair 
for i = 1, and 0 or 2 pairs for i =2,...,r. Hence at least one B; must be contained 
in an odd number of A), A2,...,A,; but then the rows of M corresponding to 
A, A2,...,A, do not have sum 0. This argument shows in fact that no non-empty 
subset of the rows has sum 0, and hence the s rows are linearly independent over 
GF[2]. a 


Using the splitting process of Simonovits, but splitting into vertices of large 
degree, Simonovits (1972) and Toft (1972a) independently proved the following. 


Theorem 2.9. There exist infinitely many 4-colour-critical graphs G for which the 
minimum degree 5(G) is at least (3 -|V(G)|)'. 


Notice that Dirac’s 6-colour-critical graphs G with many edges have 6(G) > 
c-|V(G)|. It is not known if a similar result holds for 4-colour-critical graphs. 

Theorems 2.7, 2.8 and 2.9 are in sharp contrast to the situation for k = 3, i.e. for 
the odd cycles. A further such result, based on the existence of 4-colour-critical 
graphs with many edges, was obtained by V. RGdl and published by Toft Gers 


Theorem 2.10. There exists a constant c>1 such that the number of non- 
isomorphic 4-colour-critical graphs on n vertices is > cl), 


3. Algorithmic aspects 


Johnson (1978a) remarked that all the known graph colouring algorithms are hor- 
rible, and that all the unknown graph colouring algorithms are probably not much 
better. Much can be said to support these statements, as we shall see, the key- 
words being “worst case behaviour” and “NP-completeness”. A graph colouring 
algorithm A is an algorithm that applied to any graph G produces a colouring of 
the vertices of G. The number of colours that A uses on G is denoted by A(G). 
Ideally we would like A to be efficient and A(G) to be equal to or close to x(G). 
The performance guarantee A(n) of A is the maximum of A(G)/x(G) taken over 
all graphs G on n vertices. 


3.1. Polynomial colouring algorithms 


There are two basic types of polynomial graph colouring algorithms, sequential 
colouring and maximal stable set colouring. In a sequential colouring the vertices of 
the input graph G are ordered and then coloured in this order by colours 1, 2, 3,..-- 
giving each vertex the least colour possible. In a maximal stable set colouring the 
vertices are also considered in some order, giving a vertex the colour 1 wheneve1 
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it is not joined to a vertex already coloured 1. When no more vertices can be 
coloured 1, all the vertices of colour 1 are removed and the colouring continues 
likewise with colour 2 on the remaining graph, etc. The vertices of colour i form a 
maximal stable set in G— V; — V2 —---— Vj_1, where V, are the vertices of colour 
j. The number of colours used will depend on the order in which the vertices are 
considered, in fact there will always be orderings giving a y(G)-colouring of G, but 
no polynomial algorithms to find such orderings are known. A possible polynomial 
ordering for sequential colouring is a srnallest last ordering in which the vertex x; 
in the order x),X2,...,%, is of smallest degree in G — x, — xX,_1 —+*-—Xj41, OF & 
largest first ordering in which the vertices are ordered by non-increasing degrees. 
The philosophy behind these orderings is to have, by each colouring of a vertex, 
as few restrictions as possible. Polynomial graph colouring algorithms of the above 
types with various refinements, are described by Matula et al. (1972). More recent 
surveys were presented by Manvel (1985) and Kubale (1991). 

For each of 13 different versions of such polynomial graph colouring algorithms, 
’ Johnson (1974) gave infinite classes of graphs, where the algorithms perform 
poorly. For example, let G be a bipartite graph with vertices x1,x2,...,x, and 
Yt) ¥21---;¥n in the two sides and with all possible edges x;y; with i # 7. Then a 
sequential colouring with the ordering X,, ¥n,Xn—1;---,X1,¥1 uses colours on the 
2-colourable graph G. This is the largest first ordering, and a similar example for 
a smallest last ordering can be given. Such examples show the following. 


Theorem 3.1. For a largest first or smallest last sequential graph colouring algo- 
rithm A we have A(G) >, - x(G)-|V(G)| for a positive constant c, and infinitely 
many G. Moreover, A(n) 2c, -n for a positive constant c». 


How bad the situation seems to be can best be explained by posing a question 
of Johnson (1978b). 


Problem 3.2. Does there exist a polynomial graph colouring algorithm A for 
which A(G) <c- x(G)-|V(G)|'-* for suitable positive constants ¢ and «? 


The maximal stable set algorithms seem to behave slightly better than the se- 
quential ones. As described by Johnson (1974), P. Erdés suggested the following. 


Theorem 3.3. There is a polynomial maximal stable set graph colouring algorithm 
A for which A(G) <c - (log x(G)) -|V(G)|/(log|V(G)|) for all G with x(G) 22 
and some constant c. 


Proof. Let |V(G)| =” and y(G)=k >2. Then the size of a maximum stable 
set a(G) is >n/k, and therefore the minimum degree 5(G) is <n — (n/k). We 
first colour a vertex x; of minimum degree by the colour 1. Then x, and all its 
neighbours are deleted. At least n/k — 1 vertices remain. Taking a vertex x2 of 
minjmum degree in the remaining graph, colouring x2 by colour 1, and deleting 
x2 and all its neighbours, at least n/k? — 1/k — 1 vertices remain. Continuing like 
this, after ¢ vertices have been coloured 1, and as long as 


n 1 1 
je pe ee 
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at least one more vertex can be coloured 1. It follows that as long as n > k't! at least 
one more vertex can be coloured 1. Therefore |V;| > |logn/logk|, where V; are 
the vertices coloured 1. Continue in the same fashion with G — V, and the colour 
2, etc. Let G— V, — V2 —---— V; have n;,; vertices and let / be the first value of 
i for which n;,, <n. Then the number of vertices of colour i for i= 1,2,...,/ is 
> |logn;/logk| > |} logn/logk|. Therefore J < 2(logk) -n/(logn — 2logk) and 
A(G) <1+4/n. Hence A(G) < c(logk)-n/logn. a 


The performance guarantee of the above algorithm A is < cn/logn. By refine- 
ments, Wigderson (1983) obtained a polynomial algorithm with performance guar- 
antee < cn(loglogn)*/(logn)* and in 1990 this was improved by M.M. Halldérsson 
with an extra factor logn in the denominator. 


3.2. NP-completeness 


As seen above, we are very far from a polynomial graph colouring algorithm A 
with A(G) equal to y(G). Probably no such algorithm exists. Karp (1972) showed 
the problem-type GRAPH k-COLOURABILITY, for a given graph G and num- 
ber k to decide whether G is k-colourable, to be NP-complete. This means that a 
polynomial algorithm to solve it implies polynomial algorithms to solve any prob- 
lem in the large class NP of problem-types, and hence NP would be equal to 
the class P of polynomially solvable problem-types. This is considered very un- 
likely. The monograph by Garey and Johnson (1979) is the authoritative%ext on 
NP-completeness. 

Using the method in the proof of Theorem 1.1 it is easy to see that GRAPH 
2-COLOURABILITY is in P However, already the case & =3 contains all the 
difficulties, even for rather special graphs. 


Theorem 3.4. The following problem-types are equivalent in the sense that if there 
is a polynomial algorithm to solve one of them, then there are polynomial algorithms 
for all of them. 


(a) HYPERGRAPH k-COLOURABILITY. 

(b) GRAPH k-COLOURABILITY. 

(c) GRAPH 3-COLOURABILITY. 

(d) MAX DEGREE 4 PLANAR GRAPH 3-COLOURABILITY. 
(c) HYPERGRAPH 2-COLOURABILITY. 

(f) 3-UNIFORM HYPERGRAPH 2-COLOURABILITY. 


Proofs of the various reductions establishing Theorem 3.4 can be found in Lovasz 
(1973b), Garey et al. (1976) and Toft (1975). 

Theorem 3.4 shows that we cannot expect ever to get a polynomial algorithm to 
x(G)-colour graphs G, and with the same degree of certainty one cannot expect 
even to get anywhere near to y(G). Lund and Yannakakis (1993) proved the 
following. 
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Theorem 3.5. For some constant £9 > 0, if there is a polynomial graph colouring 
algorithm A with A(G) < x(G)-|V(G)|”, then there is a polynomial algorithm for 
GRAPH k-COLOURABILITY, and x(G) can be found by a polynomial algorithm. 
In particular this is the case if there is a polynomial graph colouring algorithm A 
‘with A(G) <r-y(G)+d for some constants r and d. 


Proof in a very special case. We shall only prove the “in particular” case and 
only forr < <. Let G be any graph and let N be an integer such that 4N > r-3N + 
d. Take N disjoint copies of G and join any two completely by all possible edges 
to obtain G*. If G is 3-calourable, then A(G*) <r-y(G*)+d<r-3N+d<4N. If 
G is not 3-colourable, then A(G*) 2 x(G*) > 4N. Hence A applied to G* solves 
(c) of Theorem 3.4. The result then follows from Theorem 3.4. 9 


The proof for ¢ < r < 2 makes use of the Kneser graphs instead of the complete 
graphs Ky, but this is rather more complicated and was carried out by Garey and 
_ Johnson (1978). For r 2 2 the result was obtained as a corollary of the first part 
of Theorem 3.5 by Lund and Yannakakis (1993). 


3.3. Chromatic polynomials 


For a graph G let P(G,k) denote the number of k-colourings of G using colours 
1,2,...,k. Thus P(K,,k) =k-(k —1)-(k —2)---(K—n+1) and P(K,,k) =k". 
For e = xy € E(G), let G- e denote the graph obtained from G by contracting e to 
a single vertex, leaving only one copy of each parallel edge that might arise. Since 
the k-colourings of G — e are of two types, those with x and y coloured differently 
and those with x and y coloured alike, we have P(G — e,k) = P(G,k)+P(G-e,k). 
Both G — e and G-e have fewer edges than G, hence applying the relation again 
to these graphs, and so on, P(G,k) will be expressed as the sum of terms P(G’,k) 
for graphs G’, without edges, the largest of which has |V(G)| nodes. Therefore, 
the following holds. 


Theorem 3.6. P(G,k) is polynomial in k with integral coefficients. If \V(G)| =n 
then P(G,k) is of degree n with leading term k". The coefficient of k° is 0. 


The above proof of Theorem 3.6 is constructive and can be used to compute 
P(G,k), although this of course is not a polynomial algorithm. Theorem 3.6 can 
also be proved using the principle of inclusion and exclusion. One can easily prove 
that the coefficient of k”-! is —|E(G)|, that the coefficients alternate in sign and 
that the smallest number r such that k’ has a nonzero coefficient is the number of 
connected components of G. Thus P(G,k) contains several pieces of information 
about G in addition to the basic property that y(G) is the minimal integer k > 0 
for which P(G,k) 4 0. However, chromatic polynomials are not well understood, 
for example it is not known what makes a polynomial chromatic. 

The study of chromatic polynomials was initiated by Birkhoff (1912) and contin- 
ued by Whitney (1932). An exposition of the early history of chromatic polynomials 
was given by Biggs et al. (1976). A good survey was presented by Read (1968), and 
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more recently by Read and Tutte (1988). Many deep results have been obtained 
by W. T. Tutte in a sequence of papers. For a development of graph colouring 
theory with the chromatic polynomial as a basic concept, see Tutte (1984). One of 
Tutte’s surprising and beautiful results is the golden identity. 


Theorem 3.7. Let M be a plane triangulation on n vertices. Then 
P(M,1+2) =(1+2)-7°"""9. (P(M,7+1))’, 
where 7 is the golden ratio (1 + V5)/2, ie. T+1= 7? and t+2= V5r. 


Theorem 3.7 was obtained by Tutte (1970), who also noted that P(M,7r+1) 40. 
Hence we have the curious consequence that P(M,3.618...) > 0. The four colour 
theorem, of course, says P(M,4) > 0. 


4. Upper and lower bounds for the chromatic number 


As explained in section 3 the chromatic number seems intractable, and it is there- 
fore of importance to relate it to more tractable graph constants. We shall consider 
upper bounds in terms of the degrees in the graph. It seems more difficult to obtain 
reasonable lower bounds. 
4.1. Brooks’ theorem a 

A sequential colouring of the vertices of a graph G in any order, using for each 
vertex a least possible colour from 1,2,3..., produces a colouring with at most 
A(G) +1 colours, i.e., y(G) < 4(G) +1. A deep theorem of Hajnal and Szemerédi 
(1970), see also Bollobds (1978), says that G can in fact be (A(G) + 1)-coloured 
with all colour-classes of equal or almost equal size. Another basic result in graph 
colouring theory, and the first non-trivial result relating x to a tractable graph 
constant for graphs in general, is the theorem of Brooks (1941). It implies easily 
that the problem-type MAXIMUM DEGREE 3 GRAPH 3-COLOURABILITY 
can be solved by a polynomial algorithm (compare Theorem 3.4). 


Theorem 4.1. Let G be any graph. Then x(G) < A(G) +1. If A(G) = 2 then equal- 
ity holds iff G has an odd circuit as a connected component. If A(G) 4 2, then 
equality holds iff G has a complete graph on A(G) +1 vertices as a connected com- 
ponent. 


Proof. For A(G)=1 the theorem is obvious, and for A(G) =2 it follows from 
Theorem 1.1. Hence we only need to prove that A(G) < s and G 2 K,,, for some 
s > 3, imply that G is s-colourable. The proof of this is by induction over |V(G)|. 
For |V(G)| < 4 the statement is obviously true. 

If G is disconnected or has a cutvertex, then by induction each block of G is 
s-colourable, and hence G is. If G is 2-connected with a cutset of two vertices x 
and y, then G = G; U G2, where G; and G2 each have fewer vertices than G and 
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V(G, M G2) = {x,y}. By induction G, and G; are both s-colourable. If xy € E(G), 
then we may assume the two colours of x and y in the two s-colourings to be the 
same, hence the two s-colourings combine to an s-colouring of G. If xy ¢ E(G), 
then by induction G) = G; Uxy is s-colourable or equal to a K;,, for i= 1,2. Then 
again G is s- -colourable, except possibly when at least one of G' and Gj, say Gi, is 
a K,,1- But in this case both x and y have degree 1 in G2, since A(G) < s. Hence 
G> has an s-colouring in which x and y have the same colour, since s > 3. Because 
G, = K,,; —xy this s-colouring of G2 can be extended to an s-colouring of G. 
Thus we may assume G to be 3-connected. 

Colouring the vertices of G in some order, using for each vertex a least possible 
colour from 1,2,3,..., produces an (s+ 1)-colouring of G. If the vertex x gets 
colour s+1, then deg(x) =s and the s neighbours of x have colours 1,2,...,s 
Moreover, in this situation we may recolour x by any colour i and recolour the 
unique neighbour of x of colour i by one of 1,2,...,5+1. In this way we push 
the undesired colour s +1 from x to any neighbour of x or it disappears. Pushing 
along any path x),X2,X3,...,%, Starting in x,, we obtain an (s+ 1)-colouring in 
which at most the vertex x, of the path has colour s+1. Since there are paths 
from any vertex to x,,, we may push the colour s +1 so that it is present only at 
Xn. If deg(x,) < s we may recolour x, to obtain an s-colouring of G. Thus we may 
assume that all vertices of G have degree equal to s. 

Let p and q be vertices of G not joined by an edge. Since there are paths from 
any vertex to g, we may push the colour s +1 so that it is present in G at most at 
q. In such a colouring the vertex p has two neighbours x and y of the same colour. 
Since G is 3-connected there is a path P from q to p avoiding x and y. Pushing 
the colour s +1 from q along P, either the colour s+1 disappears or it ends up 
at p. In the latter case, since x and y are coloured alike and deg(p) = s, there is 
a colour i from 1,2,...,s that may be given to p, thus obtaining in any case an 
s-colouring of G. O 


The above proof, based on sequential colouring and colour interchange, is the 
original argument of Brooks (1941). Melnikov and Vizing (1969) gave a simple 
proof based only on interchange, and Lovasz (1975) gave one based on selecting the 
initial order of the vertices more carefully to obtain the s-colouring immediately. 

For graphs G with A(G) > 3 and not containing a K4g).1, Brooks’ theorem im- 
proves the obvious bound y(G) < A(G) + 1 by 1. Excluding the existence.of smaller 
complete subgraphs, further improvements were obtained independently around 


the same time by Borodin and Kostochka (1977), Catlin (1978) and Lawrence 
(1978). 


Theorem 4.2. Let G be any graph. If G2 K,, where 4<r<A(G)+1, then 
x(G) < A(G) +1 - [(Delta(G) +1)/r], ie. x(G) < (r-1)/r- (A(G) +2) 


Proof. The proof uses a result of Lovdsz (1966) that if dj +d. +...+d, 2 A(G) — 
q+1, then V(G) can be decomposed into classes V,,V2,...,V,, such that the 
subgraph G; induced by V; has A(G,) < d;. The general case follows immediately 
from the case g = 2. For g =2 a partition for which d, - |E(G2)| + d -|E(G))| is 
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as small as possible has the desired property, as can easily be seen by moving a 
possible vertex of too high degree to the other side. 

Now let d) = d) =--- =dg_1 =r —1 and d, >r—1, so that 3d; = A(G)~—q+1 
and g = |(A(G) + 1)/r|. Then by Lovasz’ theorem there is a decomposition of G 
into G;,G2,...,G, with A(G;) < d;, and hence x(G;) < d; by Brooks’ theorem 4.1. 
Therefore x(G) < 3x(G;) < 3d; = A(G)+1-q. O 


For triangle-free graphs the theorem only gives the same as for K4-free graphs; 
however, improvements in the triangle-free case have been obtained in 1982 by 
A. V. Kostochka: y(G) < 2(4(G) +3)/3, and recently by J. H. Kim: y(G) <c- 
A(G)/ log A(G). 

Brooks’ theorem may be formulated and generalized in other directions, in 
particular in terms of colour-critical graphs. If x is a critical vertex of G, then 
deg(x) > x(G) — 1. Using this and 3 deg(x) = 2 - |E(G)| we may formulate Brooks’ 
theorem as follows. 


Theorem 4.3. Let G be k-colour-critical. Then the minimum degree &(G) is > 
k — 1. In particular 2 -|E(G)| = (k - 1) -|V(G)|, and equality holds iff G is complete 
or an odd circuit. 


Extensions of this version of Brooks’ theorem have been given by Dirac (1957a) 
and Gallai (1963a). An elegant proof of Dirac’s theorem was given by Weinstein 
(1975). 1 


Theorem 4.4. Let G be k-colour-critical. If k > 4 and G # Ky, then 2-|E(G)| > 
(k —1)-|V(G)| + (k — 3). 


Theorem 4.5. Let G be k-colour-critical. If k > 4 and G # K,, then 2-|E(G)| > 
(k — 1) -|V(G)| + ((k — 3)/(k? — 3)) - |V(G)L. 


4.2. The colouring number 


Colouring the vertices of a graph G in the order x4,%2,...,Xn, giving each 
vertex a smallest possible colour from 1,2,..., produces a colouring using 
at most max; deg(x;, G[x,x2,...,x;]) +1 colours, since x; is joined to only 
deg(x;, G[x1,x2,...,x;]) previously coloured vertices. G[X] denotes the subgraph 
of G induced by X. Taking the minimum over all possible permutations P of the 
vertices, gives 
xX(G) < min max deg(xpii), G[xp1),Xp()s---XP@)) +1. 

The number on the right hand side is called the colouring number of G, de- 
noted col(G). The name “colouring number” was first used by Erdés and Hajnal 
(1966). Obviously y(G) < col(G) < A(G) +1, with col(G) = 4(G)+1 iff G has a 
A(G)-regular connected component. The colouring number seems like another in- 
tractable graph constant as the minimum is taken over nm! permutations: however. 
it is not, as was first proved independently by Finck and Sachs (1969) and Matula 
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(1968). Before showing this, let us consider a very closely related bound, due to 
Szekeres and Wilf (1968). 


Theorem 4.6. Let f be a real valued function defined on all graphs, such that f 
is non-decreasing (i.e. Gy C Gz => f(Gi) < f(G2)) and f(G) = 8(G) +1 for all G. 
Then x(G) < f(G). Moreover, the unique best such function f is 


SW(G) = max mindeg(x, G[V’]) +1 = max 6(G[V’]) +1, 
xeV’ 


where the maximum is taken over all subsets V’ of V(G). 


Proof. Let G be k-chromatic and G’ be a k-colour-critical subgraph of G with 
- vertex set V’. By Theorem 4.3, 6(G’) > k — 1, hence k < 6(G’) +1 < 8&(G[V])+1< 
A(GIV') < fC). 

For the segond part, it is obvious that SW satisfies the two properties. And 
any G has a subset V’ of V(G) such that SW(G) = 6(G[V’]) +1. Moreover, 
6(G[V'])+1< f(G[V']) < f(G) for any f. Therefore SW(G) < f(G) for all G and 
all f satisfying the two properties. 0 


Again, since there are 2!” possible sets V’, the graph constant SW seems to 
be intractable, but it is not. The following surprising, but easy, theorem of Finck 
and Sachs (1969) shows that col(G) can be obtained from a smallest last ordering 
of the vertices of G. 


Theorem 4.7. For any graph G the numbers col(G) and SW(G) are equal. More- 
over, if X1,X2,...,X, is a smallest last ordering of the vertices of G, then 


col(G) = max deg(x;, G[x1,.x2,..-,x;]) +1 


= max 6(G[x1,x2,...,x;]}+ 1 =SW(G). 


Proof. Let X1,X2,---,;X,_ be a smallest last ordering. It is obvious from the defini- 
tions of col and SW that 


col(G) < max deg(x;, G[x1,x2,...,x;]) +1 
= max 6(G[x), x2,...,.x)]) +1 <SW(G). 


On the other hand, it is a matter of routine to check that col has the two properties 
of Theorem 4.6 and therefore, by that theorem, SW(G) <col(G). O 


Graphs G for which SW(G) < k +1 have also been called k-degenerate by Lick 
and White (1970). Thus the property to be of colouring number k +1 is often 
called to be of degeneracy k. 
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4.3. Lower bounds 


The size w(G) of a largest complete subgraph of G is an obvious lower bound for 
x(G); however, Karp (1972) noted that & is as intractable as y. Moreover, w is 
often not a very good lower bound, as indicated in Theorem 2.1. By taking into 
consideration a more general class of subgraphs than the complete ones, Hajés 
(1961) overcame this second deficiency of w as a lower bound. A graph G is k- 
constructible if either G = K, or G is obtained from two disjoint k-constructible 
graphs G, and G, with edges x;y, € E(G,) and x2y2 € E(G2) by removing x,y, 
and x2y2, identifying x, and x2 to one vertex x, and joining y, and y, by a new edge 
or G is obtained from a k-constructible graph G, by identifying two non-adjacent 
vertices x; and x2, removing possible parallel edges that might arise. The two 
operations above we shall call Hajés’ construction and identification, respectively. 

It is easy to see that if G contains a k-constructible subgraph G’ then x(G) > 
x(G’) > k. Hajés’ theorem states that this lower bound for y(G) is best possible 
for any G, and thus the chromatic number is characterized in constructive terms, 


Theorem 4.8. Let G be any graph. Then x(G) >k iff G has a k-constructible 
subgraph. In particular, if G is k-colour-critical then G is k-constructible. 


Proof. We shall assume that the theorem is false, i.e. there is a G with y(G) 2k 
and without a k-constructible subgraph. G may be assumed to be maxima in the 
sense that the addition of any new edge e to G gives rise to a k-constructible 
subgraph G, of GUe. If non-adjacency is an equivalence relation on V(G) then 
the number of equivalence classes is > k, since x(G) > k. But then G contains the 
k-constructible graph K,. Therefore, there are three vertices x, y and z with xy ¢ 
E(G), yz ¢ E(G) and xz € E(G). Then disjoint copies of the k-constructible graphs 
Gyy and Gy, may be combined by Hajés’ construction, removing the edges xy and 
yz, identifying the two copies of y and adding xz. For each vertex ¢ both in Gyy 
and Gy, we then identify the two copies of t, thereby obtaining a k-constructible 
subgraph of G. Contradiction. oO 


Thus, to prove y(G) 2 k we need only exhibit a k-constructible subgraph of G 
and by Theorem 4.8 there is at least one. The problem is that no reasonably sized 
upper bound on the length of the k-construction in terms of |V(G)| is known, and 
probably no such polynomial bound exists. A detailed exposition of known results 
and unsolved problems related to Theorem 4.8 has been presented by Hanson et 
al. (1986). 

A result reminiscent of Hajés’ theorem 4.8 was obtained by D.J. Kleitman, 
and Lovasz (1982, 1994) as a dual to an important corresponding result for the 
maximum size of a stable set by Li and Li (1981). For a graph G on n vertices 
1,2,..., define the graph polynomial f(G: x1,x2,..-,%n) = U(x; — x;), where the 
product is taken over all pairs (i, 7) with i < j and ij € E(G). The graph polynomial 
was first defined and used by J.J. Sylvester and J. Petersen in 1878 and 1891. 
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Theorem 4.9. Let G be any graph. Then y(G)2k iff there exist graphs 
G, G2,---,Gm on V(G), each containing a K, as a subgraph, such that 


m 


i F(G! x1) X250005%n) = Do Gf (Gi: 1, X25-- An); 


i=] 


where €; is a suitable sign (+1 or —1). 


Thus, to prove x(G) 2k we need only exhibit the graphs G; and the sum of 
their polynomials. A problem again is that no reasonably sized upper bound on m 
in terms of 2 is known; probably no such polynomial bound exists. 

A different type of bound in terms of the eigenvalues A of the adjacency matrix 
was obtained by Hoffman in 1970, see the survey by Hoffman (1975). The upper 
bound is a possible f in Theorem 4.6. 


~ 


Theorem 4.10. 1 — (Amax/Amin) < ¥ < 1+ Amax- 


Finally, let us mention a result of Lovasz (1978a), used to prove X(KG,,,) > A+2 
for the Kneser graphs, but whose graph theoretic meaning is not well understood. 
The neighbourhood complex of a graph G is a simplicial complex whose vertices 
are the vertices of G and whose simplices are those subsets of V(G) which have a 
neighbour in common in G. 


Theorem 4.11. /f the neighbourhood complex of G is a (k —1)-connected topo- 
logical space, then y(G) > k +2. 


5. Colour-critical graphs 


The importance of the notion of a critical graph is that problems for k-chromatic 
graphs in general may often be reduced to k-colour-critical graphs, and these are 
less arbitrary. And even if the k-colour-critical graphs for k > 4 are not very re- 
stricted, as we saw in section 2, they do give rise to interesting results and problems. 


3.1. Subgraphs of colour-critical graphs 


The 4-colour-critical graphs with many edges described in section 2, and general- 
izations of them, show that any (k — 2)-chromatic graph for k > 4 is a subgraph 
of some k-colour-critical graph. A (k — 1)-chromatic graph G’ with an edge xy, 
such that any (k — 1)-colouring of G’ —xy give x and y different colours, is not 
a subgraph of any k-colour-critical graph G. The reason for this is that G’ — xy 
would be a subgraph of G — xy, and in a (k — 1)-colouring of G — xy the vertices 
* and y must be coloured the same. Greenwell and Lovasz (1974) proved that this 
oe what can go wrong. A proof of the “if” part can be based on Theorem 
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Theorem 5.1. For k > 4, a graph G' is a proper subgraph of some k-colour-critical 
graph iff G’ is (k — 1)- “colourable and for all edges xy of G' the graph G' — xy has 
a (k — 1)-colouring ¢,, in which x and y are coloured the same. 


As far as more special subgraphs are concerned, there are of course many 
(k — 1)-colour-critical subgraphs in a k-critical graph. Toft (1974b) proved the fol- 
lowing. 


Theorem 5.2, Let G be k-colour-critical, k > 3, and let x,y, and xzy2 be two differ- 
ent edges of G. Then there is a (k — 1)-colour-critical subgraph G' of G containing 
Xx y1, but not x2y2. 


Proof. Consider a (k — 1)-colouring ¢ of G— x,y; with 9(x,) = ¢(y,) =1. One 
of xy and y2 has a colour different from 1, say g(x2.) = k — 1. Let A be the set of all 
nodes of colour k — 1. Then G — x,y, — A is (kK — 2)-colourable, however, G — A is 
(k — 1)-chromatic, since A is stable. Any (k — 1)-colour-critical subgraph of G—A 
is a possible G’. O 


Corollary 5.3. Let G be k-colour-critical, k > 3. Then G is (k — 1)-edge-connected. 


Proof. This is true for odd circuits, i.e. for & = 3. By induction we may assume 
it to be true for k —1. Let F be a cut in G. Obviously |F| > 2. Let e, and e, 
be edges in F. Then by Theorem 5.2 there is a (k — 1)-colour-critical subgraph 
G’ containing e|, but not e2. Moreover, by induction |F M E(G’)| 2k — 2. Then 
|F| >& —1 because of e2, O 9 


Corollary 5.4. Let G be k-colour-critical, k > 4, with all (k — 1)-colour-critical sub- 
graphs isomorphic to K,_,. Then G = K,. 


Proof. Let x, y and z form a K3 in G. By Theorem 5.2 and the assumption, 
there is a G, = K,_, containing x and y, but not z, and a Gp = K,_, containing y 
and z, but not x. Then (G, — xy) U (G2 — yz) U xz is not (k — 2)-colourable, hence 
contains a (kK — 1)-colour-critical subgraph G’. By assumption G‘ = K,_,. This is 
only possible if G; UG, Uxz = K, © G, and hence G= K,. O 


Corollary 5.3 is a classical result of Dirac (1953) generalizing 6(G) > k — 
Corollary 5.4 and its proof is due to Stiebitz (1987). 

J. NeSetfil and V. Rédl, at the conference in Keszthely in Hungary in 1973, 
asked if a large k-colour-critical graph always contains a large (k — 1)-colour-critical 
subgraph. For k 2 5 this is still open. 

Finally, let us mention a deep result of Gallai (1963b), proved ingeniously by 
factorization theory applied to the complement. By Theorem 2.6, it implies that if 
we want a catalogue of all k-colour-critical graphs on < k+¢ vertices for a fixed 
t, then it is sufficient to consider the cases kK < t +1. This has been done for t < 4. 
Let us in passing note the easy fact that no k-colour-critical graph can have exactly 
k +1 vertices, as first pointed out by Dirac (1952c). 


Theorem 5.5. If G is k-colour-critical and |V(G)| < 2k — 2, then the complement 
of G is disconnected, i.e. G contains a complete bipartite subgraph H with V(H) = 
V(G). 
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5.2. Low and high vertices 


As an important extension of Brooks’ theorem 4.1, Gallai (1963a) characterized 
the subgraph G[L] of a k-colour-critical graph G induced by the set L of low 
vertices, i.e. the vertices of degree k — 1. 


Theorem 5.6. Let G be k-colour-critical, k > 4. Then the blocks of the subgraph 
G[(L] of G are odd circuits and/or complete graphs. 


Proof. The idea of the proof is similar to the proof of Brooks’ theorem 4.1. Let 
C be an even circuit of G[L] with vertices x), x2,...,x2,, and let us assume that 
x, is not incident with any chord of C. Let y be a k-colouring of G with colours 
1,2,...,& such that only x, has the colour k, x2, has colour 1, and x2 has colour 
2. Some x; has a colour different from 1 and 2, say 3, since C is even. The k —3 
neighbours of x outside C have colours 3,4,...,k —1. 

Now, push the. colour k around C, as the undesired colour s +1 was pushed 
in the proof of Theorem 4.1, until the colour k is back at x. This gives a new 
k-colouring of G with only x, of colour &, but with the other colours of C moved: 
x; has the original colour of x;,, for 2 <i < 2n—1 and x2, has the original colour 
of x2. Pushing the colour & around in C a number of times, the vertex x», will 
eventually get the colour 3. But then x; is joined to two vertices of colour 3, and 
hence it may be recoloured by one of the colours 1,2,...,k — 1 to which it is not 
joined. Thus we obtain a (k — 1)-colouring of G, which is impossible. 

The conclusion is that any vertex of any even circuit C of G[L] must be incident 
with a chord of C. But then it is easy to prove by induction that an even circuit in 
G[L] has all its chords. This implies the desired block structure by straightforward 
arguments. 0 


If G is k-colour-critical, k > 4 and G # K,, then by Brooks’ theorem 4.1 G has at 
least one high vertex, i.e. a vertex of degree > k. It is a curious fact, first noted by 
Borodin and Kostochka (1977), that no k-colour-critical graph with high vertices, 
all of degree equal to k, seems to be known for k > 9. 

Gallai (1963a) proved constructively that Theorem 5.6 is best possible. G.A. 
Dirac and Sachs and Stiebitz (1983) proved constructively that any graph that is 
a proper subgraph of a k-colour-critical graph can be made the subgraph G[H] 
induced by the high vertices H of some k-colour-critical G. Dirac (1957a) and 
Gallai (1963a) characterized the situation when G[H] = Ky. Further work in this 
direction by Sachs and Stiebitz (1989) suggests that as long as y(G[H]) < k — 2 the 


Structure of G is manageable. Finally, a proof of Theorem 4.5 can be based on 
Theorem 5.6, 


5.3. Circuits, subdivisions and Kempe chains 


The answer to the Keszthely problem of J. NeSetfil and V. Rédl mentioned above 
's affirmative for k = 4, since Kelly and Kelly (1954) proved the existence of a long 
circuit in a large k-colour-critical graph G, and therefore also a long odd circuit. 
An alternative argument has been presented by Voss (1977, 1991). The growth of 
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the length of a longest circuit as a function of |V(G)| may however be slow. Taking 
a connected graph in which each block is a K,_, or a K2, with all vertices of degree 
k —2 or k—1 and degree k — 2 only in endblocks, together with an extra vertex 
x joined to all vertices of degree & — 2, gives a k-colour-critical graph G in which 
a longest circuit has length less than 2(k — 1) -log|V(G)|/log(k — 2), as proved by 
Gallai (1963a). It seems not to be known if this example is best possible. A result 
from the other side, due to Erddés and Hajnal (1966), says that a k-colour-critical 
G always has an odd circuit of length > k — 1. A generalization of this result was 
proved in a very elegant way by Neumann-Lara (1982), using generalized Kempe 
chains. 

Suppose ¢ is a k-colouring of a graph G with colours 1,2,...,k. A connected 
component in the subgraph induced by the vertices of two of the colours, say i 
and j, is called a Kempe chain. The importance of this concept lies in the fact that 
changing the two colours in a Kempe chain gives a new k-colouring of G. Such 
interchanges, originally used by Kempe (1879) in his attack on the four colour 
problem, is still the main tool for proving configurations in planar graphs to be 
reducible. A well written exposition on Kempe chains and their uses was given by 
Whitney and Tutte (1972). 

More generally, let P be any permutation of the colours 1,2, ...,k of ». Consider 
a vertex x, all neighbours N, of colour P(p(x)) of x, all neighbours N2 of colour 
P(P((x))) of vertices in Nj, all neighbours N3 of colour P?(y(x)) of vertices in 
N>2, etc. Changing the colour ¢(y) of all vertices y in N =x UN, UN, U9: from 
y(y) to P(p(y)) gives a new k-colouring of G. We call N. a generalized Kempe 
chain from x w.rt. P. Letting P be cycle permutations of length j one can easily 
prove the following. 


Theorem 5.7, Let G be k-colour-critical. Then any edge xy of G is contained in 
at least as many circuits of length 1 modulo j for j =2,3,...,k —1 as an edge in 
K,. In particular, for k odd any edge is in at least (k —2)! different odd circuits of 
length > k. 


A circuit may be thought of as a subdivision K3S of K3, i.e. it is obtained by 
inserting new vertices of degree 2 on the edges. G. Hajds suggested the following 
generalization of the four colour theorem. 


Conjecture 5.8. Let G be 5-chromatic. Then G contains a subdivision K;S of Ks. 


Any 4-chromatic graph contains a K,S as proved by Dirac (1952a), but this does 
not depend very strongly on the graph being 4-chromatic, since already 5(G) 2 3 
implies the existence of a K,S. For k > 7 the corresponding statement is not true, as 
proved by Catlin (1979), who exhibited the basic 8-colour-critical counterexamples 
of five complete 3-graphs joined completely to each other as a 5-cycle. Erdés and 
Fajtlowicz (1981) proved the following. 


Theorem 5.9. The statement “G contains a subdivision of a Kya)” is false for 
almost all graphs. 
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Proof. Consider all graphs with vertex set 1,2,...,”. If we consider each edge to 
appear with probability 5, then each of the 2@ possible graphs G has probability 
2-“. The average number of stable sets of size r is (")-29-©/2@, For r = [(2+ 
£) log n/ log 2] this average number tends to 0 for m — oo. We express this by saying 
that almost all graphs have all stable sets of size at most (2+ €)logn/log2 and 
hence chromatic number > (log2/(2 + €)) - (n/logn). 

Let m ~ 3,/n. The average number of edges on any set of size m is. 3(5), and 
it can be shown that almost all graphs have < G + £)- (3) edges on every set of 
m vertices. Therefore the existence of a K,,S implies the existence of at least 
(3 — £)(3) vertices to subdivide the edges of K,,. But G ~ €)(3) > n, so almost all 
graphs contain no K,,S. Since m is less than y for almost all graphs the theorem 
follows. © 


More precise estimates than those in the above proof were given by Bollobds 
_ and Catlin (198%). 


5.4, Contraction-critical graphs 


The existence of many different types of k-colour-critical graphs may tempt us 
to look for a less fine ordering than C in the set of all graphs, to obtain more 
restrictive minimal graphs. Let G, 2 G, denote that Gz can be obtained from G, 
by deletions of vertices and/or edges and contractions of edges, in any order. The 
terminology that G2 is a minor of G, is often used. The k-chromatic graphs that 
are minimal w.r.t. > are called k-contraction-critical. Hadwiger’s conjecture 1.3 then 
becomes as follows. 


Conjecture 5.10. y(G) 2k implies G > K,, ie. K, is the only k-contraction- 
critical graph. 


Dirac (1960, 1964a) obtained the basic properties of k-contraction-critical graphs. 


Theorem 5.11. Let G be k-contraction-critical # K,. Then 5(G) 2 k > 5. For each 
vertex x of G there are at most deg(x) — k +2 stable vertices among the neighbours 
of x. Moreover, G is 5-connected. 


Ke is non-planar and 5(G) > 6 implies non-planarity, hence the first statement 
of Theorem 5.11 for kK =6 implies the five colour theorem for planar graphs. A 
deeper generalization is the following result, due to Dirac (1964a). 


Theorem 5.12. If y(G) > 6 then G > K;, where Kz denotes the complete 6-graph 
with one edge missing. 


Proof. G > G* where G* is 6-contraction-critical. If Gt = Kg we are done, hence 
assume G* # Kg. A best possible extremal result of Dirac (1964b) proved by in- 
duction over |V(H)| says that 2|E(H)| > 7|V(H)|—15 implies H > K,. By this 
and Theorem 5.11, 6(G") = 6. Let deg(x) = 6 and let N be the six neighbours of 
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x in G*. By Theorem 5.11 the graph N* = G*(N] does not have a stable set of size 
3, hence it must contain a triangle. 

Suppose first that N* contains two disjoint triangles on the vertices x), x2, +3 
and x4,x5,%5, respectively. Then by Theorem 5.11 G-— x contains three disjoint 
paths P,, P.,P3, where P; joins x; and x;,3. The removal of x, x1,.x3 and x5 leaves a 
connected graph by Theorem 5.11, hence it contains a path P between P, — x5 and, 
say, P3 — x3. Contracting P into an edge and P,P; —xs5 and P;—x; into single 
vertices, we obtain a K,. 

If N* does not have two disjoint triangles, then since there are no stable sets 
of three vertices, N* contains a K;. With x we get a KZ in G*. Since G* is 5- 
connected, any vertex y outside the K, is joined to it by five paths, any pair of 
which have only y in common. But then againG’ > K,. Q 


Further developments of the above results have been obtained by Mader 
(1968a,b), Jakobsen (1971) and Toft (1972b). 

Finally, we turn our attention to Wagner’s equivalence theorem. Wagner (1937) 
characterized those graphs not contractible to Ks. A simplified proof of Wagner’s 
theorem was given by Halin (1964). If one is only interested in the equivalence 
theorem a shortcut is possible, since only the following corollary of Wagner’s the- 
orem is needed. This was pointed out by Young (1971), but the corollary was also 
explicitly mentioned by Ore (1967). 9 


Theorem 5.13. [f G is 4-connected and contains a K33S, then G > Ks. 


The proof is a rather simple case analysis. A slight extension of Theorem 5.13 is 
the basis of Halin’s proof of the full Wagner theorem. The surprising consequence 
of Theorem 5.13 is the following. 


Theorem 5.14. If G is 5-contraction-critical #4 Ks, then G is planar. In particular 
Hadwiger’s conjecture 5.10 for k =5 is equivalent to the four colour theorem. 


Proof. A 5-contraction-critical G is 4-connected by Theorem 5.11. If G is non- 
planar then by Kuratowski’s theorem G contains a K;S§ or a K33S. By Theorem 5.13 
G 2 Ks, hence G = Ks. 

Since G 2 Ks implies G to be non-planar, Hadwiger's conjecture 5.10 for k = 
5 implies the four colour theorem. Conversely, a counterexample to Hadwiger’s 
conjecture 5.10 for k = 5 implies a counterexample to the four colour theorem by 
the first part of Theorem (5.14). O 


Recently Robertson et al. (1993) have obtained deep structural results on graphs 
not contractible to Ks. As a corollary they obtain that all such graphs are 5- 
colourable, assuming the four colour theorem, thus proving Hadwiger’s conjecture 
5.10 for k = 6. 
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6. Graphs on surfaces 


Most of the early papers on colouring theory deal with maps M on surfaces and 
face colourings of M,i.e., colourings of the connected regions into which M divides 
the surface, such that faces with a common boundary line get different colours. 
By assigning a vertex to each face of M and joining any two of these by an edge 
whenever the corresponding two faces share a common boundary line, we obtain 
the dual graph G of M. The graph G can also be embedded on the surface and a 
colouring of the vertices of G corresponds to a face colouring of M. This translation 
of face colouring to vertex colouring was first pointed out by Kempe (1879). Kempe 
(1879) also mentioned the importance of colouring theory for abstract graphs, 
but until the papers by Whitney (1932) and Brooks (1941) the theory remained 
centered on maps and graphs on surfaces. 

Colouring theory for graphs on surfaces is well developed. The monograph by 

Biggs ct al. (1%76) contains an account of the history until 1936. 


6.1. Planar graphs 


Instead of colouring faces or vertices, Tait (1878-80) considered colourings of the 
edges of a planar graph. A plane triangulation G is a graph embedded in the plane 
for which each face boundary is a K3. Such a G is also called maximal plane; it 
has 3|V(G)| — 6 edges and is at least 3-connected for |V(G)| 2 4. To prove the 
four colour theorem it is, of course, sufficient to prove that plane triangulations 
are 4-colourable. Tait (1878-80) stated the following theorem in a face-colouring 
formulation. 


Theorem 6.1. Let G be a plane triangulation. Then x(G) <4 iff the edges of G 
can be labeled by three labels such that each face boundary has all three labels. 


Proof. Let ¢ be a 4-colouring of G with colours (0,0), (0,1), (1,0) and (1,1). 
These colours may be added coordinatewise modulo 2. For each edge xy define 
&(xy) = g(x) + p(y). Then w is a desired labeling of the edges with labels (1,0), 
(Q,L) and (1,1). 

Conversely, let y be a labeling of the edges with labels (1,0), (0,1) and (1,1) such 
that each face has all three labels. Let C be any circuit of G. The interior of C is 
ie into triangular faces T;. Hence, since we add modulo 2 and (1,0) + (0,1) + 

1,1) = (0,0), 


~ ¥) =>- > ve) = (0). 


ec E(C) - tee, 


Colour an arbitrary vertex z of G by the colour v(z) = (0,0). For any vertex x let 
W be a walk from z to x. Define g(x) = Dececw) ¥(e). Then v(x) is well-defined, 
1.€. p(x) does not depend on the particular W chosen, since y sums to zero around 
any circuit. Moreover, ¢ is a 4-colouring with colours (0,0), (1,0), (0,1) and (1,1). 
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The face-colouring version of Theorem 6.1 states that a plane 2-connected 3- 
regular graph is 4-face-colourable iff it is 3-edge-colourable, where any two adja- 
cent edges in an edge-colouring have different colours. Tait (1878-80) suggested 
that any 2-connected 3-regular graph can be 3-edge-coloured, but the famous graph 
of Petersen (1898) disproves this. 

The following theorem is due to Heawood (1898). It was also originally formu- 
lated for face colourings. 


Theorem 6.2. Let G be a plane triangulation. Then x(G) < 4 iff the faces can be 
labeled by two labels +1 and —1, such that the sum of the labels of the faces around 
any vertex is 0 modulo 3. 


There are, of course, other equivalents of the four colour theorem than those 
early ones implied by Theorems 6.1 and 6.2 above, for example two statements 
obtained from different characterizations of the chromatic number of an arbitrary 
graph in terms of orientations, one due to Minty (1962) and the other indepen- 
dently due to T. Gallai and B. Roy, see chapter 1. 

As far as 2-colourings are concerned it follows from Theorem 1.1 that a 2- 
connected plane graph is 2-colourable iff all circuits are even iff all face boundaries 
are even. For 3-colourings of plane graphs we cannot expect any good characteri- 
zation because of Theorem 3.4; however, for triangulations Heawood (1898) stated 
the following. 


Theorem 6.3. Let G be a plane triangulation. Then G is 3-colourable iff all vertices 
of G have even degree. 


A deep theorem obtained by Grétzsch (1959) says the following. 
Theorem 64. Let G be a plane graph without triangles. Then G is 3-colourable. 


Griinbaum (1963) suggested that Theorem 6.4 remains true, even if there are up 
to three triangles in G. This was proved by Aksionov (1974); see also Aksionov 
and Melnikov (1978) and Steinberg (1993), correcting a mistake discovered by 
T. Gallai in Griinbaum’s original argument. A short proof of Grétzsch’s theorem 
6.4 has recently been given by Thomassen (1993b), who also proved that graphs 
embedded on the torus and having all circuits of length > 5 are 3-colourable. 


6.2. Heawood’s formula 


Curiously, colouring theory for higher surfaces, i.e. connected compact 2- 
dimensional manifolds, is in general simpler than for the plane or sphere. We 
shall see why this is so. The topological prerequisite will be the generalized Euler 
formula 


|F(G)| — |E(G)| + |V(G)| 2 3—A(S), 


where G is a graph embedded on the surface S$, F(G) is the set of faces of G, 
and A(S) is the connectivity of S. The number 3 — A(S) is also called the Euler 
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characteristic of S. The possible S may be classified as follows: The orientable 
surfaces are the sphere Sy with h = 1 and the sphere with g handles S, with h = 
1+ 2g. S, is the torus. The non-orientable surfaces are the projective plane Po with 
h,= 2, the Klein bottle Ky with h = 3, the projective plane with g handles P, with 
h =2+2g, and finally the Klein bottle with g handles K, with h = 3 + 2g. 

For a connected graph G with |V(G)| 23 each face is bounded by at least 3 
edges, hence 3|F(G)| < 2|E(G)|. Then from the generalized Euler formula: 

|E(G)| < 3|V(G)| — 9 + 3k. (6.5) 

The formula of Heawood (1890) is now easy to obtain. 


Theorem 6.6. Let G be embeddable on a surface S of connectivity h > 2. Then 


x(G) < |) = HW. 


Proof. Suppo that G is k-chromatic with k > 7. Let G’ be a k-colour-critical 
subgraph with n vertices and e edges. Since 5(G’) > k — 1 we have by (6.5) 

(k —1)-n<2e < 6n— 18+ 6h. 
Since n 2 k 27 this implies 

(k —7)-k —6(h — 3) <0 


or 
7+V24h -2 
k< ae =H(h). OO 


For the torus S; Heawood’s formula gives y(G) < 7. Heawood (1890) noted 
that the 7-chromatic graph K; can be embedded on the torus, hence the colouring 
problem for the torus has the answer 7. For the Klein bottle the formula also gives 
X(G) <7, but Franklin (1934) proved that the colouring problem for the Klein 
bottle has the answer 6. However, this is the only case for which the Heawood 
number H(h) is not the correct answer to the colouring problem. This is the re- 
markable theorem of Ringel (1954) and Ringel and Youngs (1968). For orientable 
surfaces with 3 < h < 15 this had been established already by Heffter (1891). 


Theorem 6.7. For a surface S of connectivity h > 2, where S is not the Klein bottle, 


the Heawood number H(h) is the maximum chromatic number of graphs embed- 
dable on S. 


The proof of this major result, completed in 1968, was obtained by ingenious 
embeddings of the complete H(h) graphs on the surfaces of connectivity h as 
described also in the excellent monograph by Ringel (1974). This is, of course, 
Sufficient for the proof of Theorem 6.7; however, Dirac (1952c) proved that it is 
also necessary, in fact in a very strong sense. The theorem of Dirac was perhaps 
the first breakthrough in colouring theory for higher surfaces after the paper of 
Heffter (1891). The idea of such a result and a proof in the case of the torus was 
first obtained by P. Ungar. We shall present the simple proof of Dirac (1957b). 
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Theorem 6.8. For h=3 and h 25 any H(h)-chromatic graph G on a surface of 
connectivity h contains a Ky) as @ subgraph. 


Proof. Let k = H(A) and G’ be a k-colour-critical subgraph of G with n vertices 
and e edges. If G’ # K, then by Theorems 4.4 and 6.5 


(k —1)-n+(k —3) <2e < 6n+6(h — 3). (6.9) 


Since there are no k-colour-critical graph on k +1 vertices, as we remarked in 
section 5, we have n > k+2. Thus for A =3 and h 2S this gives a contradiction 
by an easy calculation. O 


Dirac’s Theorem 6.8 also holds in the missing cases # = 2 and 4 as proved by 
Albertson and Hutchinson (1979). The simple type of argument used above also 
shows the following. 


Theorem 6.10. (a) Let G be a k-colour-critical graph # K,, k > 8, on a surface of 
connectivity h > 5. Then |V(G)| < (6h — 15 — k)/(k — 7). 

(b) Let G be a k-contraction-critical graph # K,, k 27, on a surface of connec- 
tivity h > 3. Then |V(G)| < (6h — 18)/(k — 6). 


Dirac (1956, 1957c) obtained a number of results based on the above finiteness. 
For example Hadwiger’s conjecture 5.10 is true for all (H7(k) — 1)-chromatic graphs 
on a surface of connectivity A > 5. Albertson and Hutchinson (1980) proveg the 
corresponding result for 6-chromatic graphs on the torus, using the four colour 
theorem, but a simpler argument has been obtained by Mayer (1989). Thomassen 
(1994a) proved that there are only four 6-critical graphs on the torus, and he 
observed that Gallai’s theorem 5.6 implies that the number of 7-critical graphs on 
any fixed surface is finite. In fact, this follows more directly from Theorem 4.5. 

It is tempting to suggest that any locally planar graph on any surface is 4- 
colourable (with some suitable definition of locally planar). However, Fisk (1978) 
constructed 5-chromatic graphs satisfying any reasonable local planarity definition 
on any surface except the sphere. Albertson and Stromquist (1982) proved that 
locally planar toroidal graphs are 5-colourable, a best possible result by Fisk’s 
examples. Hutchinson (1984) defined “locally planar” to mean that all edges are 
“short” and proved that such locally planar graphs on any surface are 5-colourable. 
This was generalized by Thomassen (1993a), who proved that there exists a number 
n(S) such that any graph embedded on S with any non-contractible circuit of length 
> n(S) is 5-colourable (a circuit is called contractible if its deletion disconnects the 
surface in such a way that one of the resulting components is homeomorphic to a 
plane disc). 


7. Stable sets 


Colouring a graph G by & colours means covering the vertex set V(G) by & stable 
sets. To ask for the existence in G of just one stable set of a given size seems 
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simpler; however, the theory of stable sets in graphs is as complicated as colouring 
theory with many results of a similar nature. For example, the four colour theorem 
implies the existence of a stable set of size at least |V(G)|/4 in a planar graph, 
but no proof of this last result independent of the seemingly much stronger four 
colour theorem is known. 

In this and the next section we look in particular at algorithmic aspects, critical 
graphs and stable set polytopes, although stable sets occur in many other contexts, 
for example in the theorems of Ramsey (1930), Turan (1941), Gallai and Milgram 
(1960) and Hajnal and Szemerédi (1970). 

The monograph by Lovdsz and Plummer (1986) contains in chapter 12 an ex- 
cellent survey of critical graphs and stable set polytopes. 


7.1. Algorithmic aspects 


To determine for a given graph G and a given integer k if G has a stable set 
of size k, is N&complete as proved by Karp (1972). Thus a(G), the size of a 
maximum stable set in G, is as intractable as y(G) is. For special types of graphs, 
however, a(G) is well-characterized and computable by polynomial algorithms. 
The following max-min theorem is due to K6nig (1931) and Gallai (1958). 


Theorem 7.1. Let G be bipartite without isolated vertices. Then «(G) equals the 
size p(G) of a minimum set of edges covering all vertices. 


Proof. a(G) < p(G) is obvious in any graph without isolated vertices. For the 
other direction, delete edges from G as long as a does not change. By Theorem 7.2 
below, the obtained graph H consists of a set A of isolated vertices and a set B 
of disjoint edges. The edges of B together with one edge from each vertex of A 
cover all vertices, hence p(G) < |A|+|B| =a(H)=a(G). O 


The sect B of edges in the proof of Theorem 7.1 is a maximum matching in 
G, since a larger matching M implies the contradiction a(G) < |V(G)| -|M|< 
|V(G)| — |B| = |A|+|B| = a(G). Thus the well-known polynomial algorithms to 
determine a maximum matching in a bipartite graph G immediately also determine 
a{G), and in fact a stable set of size a(G). For line graphs L(G) the polynomial 
matching algorithm of Edmonds (1965) applied to G determines a(L(G)). More 
generally, a polynomial algorithm to determine a(G) for claw-free graphs G was 
obtained by Minty (1980) and Sbihi (1980). 

Not being able to compute a(G) efficiently in general, one might look for upper 
and lower bounds as was done for y(G) in section 4. Lovasz (1982, 1994) gave ex- 
cellent surveys of such bounds, including the theorem of Li and Li (1981) referred 
to in section 4. 


7.2. a-critical graphs 


In his elegant proof of Turan’s theorem, Zykov (1949) introduced the notation of a 
k-saturated graph as a graph not containing a K,,,, but containing a K,,, whenever 
any new edge is added. If x is a vertex joined to all others in a graph G, then G 
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is k-saturated iff G—.x is (kK —1)-saturated. Thus in a study of saturated graphs 
one might exclude such vertices x. Doing this and considering complements the 
class of a-critical graphs is obtained, i.e. a graph G is a-critical if it has no isolated 
vertices and a(G — xy) > a(G) for all edges xy € E(G). 

Because of the NP-completeness of a(G) we cannot hope that @-critical graphs 
have a simple structure; in fact, if there is a polynomial way to demonstrate that a 
graph is a-critical then NP would be equal to co-NP, which is considered unlikely. 
However, various structural properties and a deep classification result have been 
obtained, as we shall see. The following useful result is due to Berge (1972). 


Theorem 7.2. In an a-critical graph G any two adjacent edges uv and vw belong 
to an odd circuit without chords. In particular, a bipartite a-critical graph consists 
of a set of disjoint edges. 


Proof. Let 7,, and T,, denote stable sets of size a(G) +1 in G— uv and G — uw, 
respectively. Let G’ be the subgraph of G induced by the symmetric difference 
T of T, and T,, ie. T = (TyU Tw) — (TN Ty). G' is bipartite with T,\7y and 
Tw\T, forming the two sides. Of course, u € T,\Ty and w € T,,\T,. The vertex v 
is joined by edges to only u and w in G’, so if u and w are in the same connected 
component of G’ then a shortest uw-path in G’ and the edges uv and vw form 
the desired odd circuit. If u and w are in different connected components of G’, 
then the graph induced by T Uv is bipartite. Thus (7, Ty — v) together with at 
least half of the vertices of T Uv is a stable set in G. It has size > a(G¥ This 
contradiction proves the theorem. O 


Obviously, a graph is -critical iff all connected components are a-critical. A con- 
nected a-critical graph different from a single edge is 2-connected by Theorem 7.2. 
Moreover, the a-critical graphs that are 2-connected, but not 3-connected, can be 
characterized constructively. This was done independently by Gallai, Plummer and 
Wessel, see Wessel (1970). The construction corresponds to Theorem 2.5(b) (ii) for 
colour-critical graphs. A special case says that subdividing an edge of a 2-connected 
a-critical graph by an even number of new vertices preserves a-criticality. For later 
use let us note that this also preserves the value of |V(G)|— 2a(G). 

By induction over |/| the following is easy to prove. 


Theorem 7.3. For any stable set I of an a-critical graph G the neighbours N(1) of 
I, ie. the vertices ioe I seed fc to I by edges, satisfy |N(1)| > |I|. In particular, 
|V(G)| — 2a(G) 2 


The property of Theorem 7.3 is important in general, as Tutte (1953) proved 
it equivalent to having a perfect 2-matching, i.e. a system of disjoint circuits and 
edges covering all vertices. 

The following three results were obtained by Hajnal (1965), Erdés et al. (1964) 
and Lovasz (1977). 


Theorem 7.4. Let G be a-critical. Then 
(a) All vertices have degree < |V(G)| — 2a(G) +1. 
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(b) G has at most (a rly edges. 
(c) For any subset T C V(G) covering all edges of G the subgraph induced by T 
has at most (‘T!-S*!) edges. 


(b) follows from (a), whereas the proof of (c) is substantially more involved 
using multilinear algebra. (c) can be used to obtain a classification of all connected 
a-critical graphs G according to the value of |V(G)|—2a(G). The classification 
was conjectured by Gallai and proved by Lovdsz (1978b). 


Theorem 7.5, For any fixed value v 2 1 there exists a finite set of basis graphs such 
that all connected a-critical graphs G with |V(G)| — 2a(G) = vu, and only these, arise 
from the basis graphs by subdividing edges by an even number of vertices. 


In flavour this result is reminiscent of Gallai’s theorem 5.5 for colour-critical 
graphs, implying for these graphs a finite basis for any fixed value 1, where t = 


|V(G)| - x(C). 


8. Perfect graphs 


The opposite extreme of the sparse graphs described in section 2 are the perfect 
graphs. A graph G is perfect if G and each of its induced subgraphs have the prop- 
erty that the chromatic number y equals the size of a largest complete subgraph 
w. The idea of perfect graphs, due to C. Berge, has been one of the most fruitful 
in graph theory, and it also relates to linear programming and to computational 
complexity. For perfect graphs both y, w and the size of a largest stable set a can 
be found by polynomial algorithms. This was proved by Groétschel et al. (1981) 
based on the ellipsoid method for linear programming. However, it is not known 
whether there is a polynomial algorithm to decide if a given graph is perfect or 
not. 


8&1, Examples of perfect graphs 


Berge (1961) considered several classes of perfect graphs, and he conjectured that 
complements of perfect graphs are perfect. For example, any bipartite graph is 
perfect, since y = w =2 if there is at least one edge and y =w =1 if there are 
none. Also the complements of bipartite graphs are perfect. This is equivalent to 
Theorem 7.1, since y(G) is the minimum number of complete subgraphs of G 
covering all vertices of G, and thus equals p(G) for G bipartite without isolated 
vertices. A reformulation of Theorem 7.1 is therefore as follows. 


Theorem 8.1. /f G consists of two disjoint complete graphs and some edges be- 
tween them, then x(G) = (G). 


Theorem 8.1 has been used as a lemma in graph colouring theory by Brown and 
Jung (1969), T. Gallai, and Toft (1972b). For example, the characterization of the 
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k-colour-critical graphs with a cut of & — 1 edges, mentioned in section 2, follows 
from it. 

A graph is called a rigid-circuit graph (or triangulated or chordal) if every circuit 
of length > 4 has a chord, i.e. an edge joining two non-adjacent vertices on the 
circuit. 

Berge (1961) proved that rigid-circuit graphs are perfect, and Hajnal and Surdnyi 
(1958) and Dirac (1961) characterized them constructively and proved that their 
complements are perfect as well. 


Theorem 8.2. G is a rigid-circuit graph iff either G is complete or G can be ob- 
tained from two smaller disjoint rigid-circuit graphs G, and G2 by identifying two 
complete subgraphs of the same size in G, and G». In particular a rigid-circuit graph 
is perfect. 


Proof. If G can be obtained as described then obviously G is a rigid-circuit graph. 
Conversely, let G be a non-complete rigid-circuit graph and let 5 be a minimal set 
of vertices in G such that G — S is disconnected. Let G = G, UG? with G; N G2 = 
G[S] and G,; 4 G[S] # G.. If G[S] contains two vertices x and y not joined by an 
edge, then let P; and P, be shortest paths from x to y in G; and Gy, respectively, 
with only the vertices x and y in S. Then P, U P» is a circuit of length > 4 without 
a chord. This is a contradiction, and therefore G, 1 G2 is complete. Moreover, G, 
and G2 are rigid-circuit graphs, since they are induced subgraphs of the rigid-cizguit 
graph G. So the first part of the theorem follows. Since x(G) = max(y(G;), (on) 
the second part of the theorem follows by induction over |V(G)|. O 


Theorem 8.3. Let G be a rigid-circuit graph. Then for all proper complete sub- 
graphs K of G there is a vertex x outside K for which the neighbours N(x) of x in 
G induce a complete graph. In particular G is perfect. 


Proof. The existence of x is proved by induction over |V(G)|. If G is complete, 
the existence of x is obvious. Otherwise, by Theorem 8.2 G = G, UG), where 
G,M Gy is complete, and G; and G2 are smaller rigid-circuit graphs than G. In 
this case K is a subgraph of either G; or G2, say G1. Then by induction x can be 
chosen in G2 — (G; 9 G2). This proves the existence of x. 

That G is perfect is also obtained by induction over |V(G)|. From the first part 
we have 


w(G) > wo(G —x—N(x))+1=x(G—x—N(x))+1 > x(G), 


where the middle equality holds by induction. Hence w(G) = x(G) and G is per- 
fect. O 


There are several other classes of known perfect graphs, for example compa- 
rability graphs, where V(G) is the set of elements of a partially ordered set and 
xy € E(G) iff x and y are comparable, interval graphs, where V(G) is a set of in- 
tervals on a line, and xy € E(G) iff x and y overlap, and split graphs, where V(G) 
can be partitioned into two classes, one inducing a complete graph and the other 
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being stable. The interval graphs are those rigid-circuit graphs whose complements 
are comparability graphs, and the split graphs are those rigid-circuit graphs whose 
complements are also rigid-circuit graphs. These results are due to Gilmore and 
Hoffman (1964) and Foldes and Hammer (1977). Grétschel et al. (1988) have a 
more comprehensive list of known classes of perfect graphs. 


8.2. The perfect graph theorem 


The conjecture of Berge that the complement of a perfect graph is perfect was 
proved by Fulkerson (1971, 1972) for so-called pluperfect graphs, and by Lovasz 
(1972a) in general. Fulkerson’s proof that the complement of a pluperfect graph 
is pluperfect was based on the duality theory of linear programming. The follow- 
ing replacement theorem, proved by Lovdsz (1972a), implies that the classes of 
pluperfect and perfect graphs are identical. 


Theorem 8.4. © Let G be perfect and let x be a vertex of G. Replace x by a complete 
graph K joined completely to all neighbours N(x) of x in G. Then the obtained 
graph H is perfect. 


Proof. It is sufficient to prove the theorem for K being a complete 2-graph K2. 
Moreover, it is sufficient to prove that y(H) = w(H). 

Obviously w(G) < w(H) < w(G) +1. If w(H) = o(G)+1, then y(H) < x(G)+ 
1=(G)+1=a@(H), and hence y(H) =w(H). Assume therefore that w(H) = 
w(G). Then x is not contained in a complete w(G)-graph of G. Colour G by w(G) 
colours so that x has colour 1. Let A denote the set of those vertices of colour 1 that 
are different from x. Then w(G— A) < w(G), and hence a (w(G) — 1)-colouring 
y of G — A is possible. Colouring the vertices of A and one of the vertices of the 
K» by a new colour, and the other vertex of the K2 by ¢(x), we obtain from ¢ a 
(G)-colouring of H, and w(G) = w(H). Hence x(H)=@(H). O 


More generally, Lovasz (1972a) proved that K in Theorem 8.4 can be any perfect 
graph. Lovasz (1972a,b) also gave two further proofs of the perfect graph theorem, 
ane based on hypergraphs and one characterizing the perfect graphs as follows. 


Theorem 8.5. A graph G is perfect iff every induced subgraph G' of-G satisfies 
a(G') -w(G') > |V(G*)]. 


Lovasz’ ingenious hypergraph proof formulated for graphs gives a simple direct 
Proof of the perfect graph theorem, as pointed out by Fulkerson (1973). 


Theorem 8.6. A graph G is perfect iff the complement G is perfect. 
Proof. Let G be perfect and suppose by induction that the theorem holds for 


all smaller graphs. Since the induced subgraphs of perfect graphs are perfect, it 
is sufficient to prove that y(G) = w(G). Let K,,K2,...,Km be all the maximal 
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complete graphs of G. If there is a K; meeting all maximum stable sets of G, then 
w(G — Kj) < w(G), hence 


x(G) < y(G— K;)+1=w(G — Kj) +1<«(G), 


and we are through. 

Therefore, we may assume that G — K; contains a maximum stable set A; of G 
for all j. For all vertices x; in G, let a; denote the number of these sets A; containing 
x;. Let H be obtained from G by replacing each vertex x; by a complete w;-graph. 
Then 


w(H) = max > wo, < m1, 
xjiEV(K;) 


since A; does not count in the sum, and any other A; counts at most once. More- 
over, 


x(H) > |V(H)|/a(H) = (7 @)/a(G) = a(G)m/a(G) = 
Hence w(H) < x(H), but this contradicts H being perfect by Theorem 8.4. 


8.3. The strong perfect graph conjecture 


Any graph containing an odd circuit of length > 5 or its complement as an induced 
subgraph is not perfect. Berge conjectured that the opposite is true as well. ‘his 
implies Theorem 8.6 and is called the strong perfect graph conjecture. It may be 
formulated in terms of critically imperfect graphs, i.e. non-perfect graphs for which 
any proper induced subgraph is perfect. 


Conjecture 8.7, The only critically imperfect graphs are the odd circuits of length 
2 5 and their complements. 


The following is an equivalent formulation in terms of k-vertex-critical graphs, 
i.e. kK-chromatic graphs for which x(G — x) < k for all vertices x, due to Wessel 
(1977), Comparing with Corollary 5.4 indicates that it is the word “induced” that 
makes the conjecture difficult. 


Conjecture 8.8. Let G be a k-vertex-critical graph, k > 4, for which any k’-vertex- 
critical induced subgraph is a complete &’-graph for all k’ < k. Then G is a complete 
k-graph or the complement of a circuit of length 2k — 1. 


The strong perfect graph conjecture has been proved for special classes of graphs. 
For example, Tucker (1977) proved Conjecture 8.8 for k = 4, and Parthasarathy 
and Ravindra (1976) proved Conjecture 8.7 for graphs having no induced 3-star 
K,3. Thus the strong perfect graph conjecture is equivalent to the statement that 
no critically imperfect graph contains an induced 3-star. 

By Theorem 8.5 a critically imperfect graph G has a(G)-w(G)+1 vertices. 
Several other properties of critically imperfect graphs have been found by among 
others Padberg (1974), Bland et al. (1979) and Chvatal (1985). 
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Theorem 8.9. A critically imperfect graph G on n vertices satisfies 

(a) n= a(G)-w(G)+1. 

(b) G has exactly n complete w(G)-graphs. 

(c) G has exactly n stable sets of size a(G). 

(d) Every vertex of G belongs to exactly w(G) complete w(G)-graphs and to 
exactly a(G) stable sets of size a(G). 

(e) Every complete w(G)-graph is disjoint from exactly one stable set of size a(G), 
and vice versa. 

(f) No set S C V(G) with G—S disconnected contains a vertex joined by edges 
to all other vertices of S, i.e. G does not have a star-cutset. 


84. Sufficient conditions for perfectness 


’ Based on Theorem 8.9 (f), Hayward (1985) proved that weakly triangulated graphs, 
ie. graphs for which neither it nor its complement contains a chordless circuit of 
length > 5, ar€ perfect. There are other similar strengthenings of the condition in 
the strong perfect graph conjecture, sufficient to imply perfectness. Gallai (1962) 
proved that it is sufficient that each odd circuit of length > 5 has two non-crossing 
chords. Olaru (1969) and Sachs (1970) proved that it is sufficient that each odd cir- 
cuit of length > 5 has two crossing chords. A common generalization was obtained 
by Meyniel (1976). The survey of Lovasz (1983) contains an elegant proof. 


Theorem 8.10. Let G be a graph for which any odd circuit of length >5 has at 
least two chords. Then G is perfect. 


From the definition of perfectness it immediately follows that in a perfect graph 
G any induced subgraph H has a stable set meeting all maximum complete sub- 
graphs of H. 

A graph G is called very strong perfect if any vertex of any induced subgraph H 
is contained in a stable set of H mecting all maximal (not just maximum) complete 
subgraphs of H. As proved by Hoang (1987), these very strong perfect graphs are 
exactly the Meyniel-graphs of Theorem 8.10. 

Finally we state the semi-strong perfect graph theorem, conjectured by Chvatal 
(1984) and proved by Reed (1987). Two graphs G and H on the same vertex set 
are called P,-isomorphic if a set of four vertices induces a path in G iff it induces 
a path in H. Note that G and G are always P4-isomorphic. Since, moreover, the 
only two graphs that are P,-isomorphic with a chordless odd circuit of length 2 5 
are the circuit itself and its complement, Theorem 8.11 lies between the Perfect 
graph theorem 8.6 and the Strong perfect graph conjecture 8.7. 


Theorem 8.11. If G and H are P,-isomorphic and G is perfect, then H is perfect. 


8.5. Stable set polytopes 


For any graph G we let the stable set polytope S(G) denote the convex hull in 
RIV of the incidence vectors of the stable sets of G. A basic technique due 
to Grotschel et al. (1981, 1988) is to look for various classes of inequalities valid 
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for S(G) and then develop polynomial algorithms to maximize linear objective 
functions subject to these inequalities. Such algorithms provide upper bounds for 
the maximum weight of a stable set. If the inequalities in addition happen to 
describe the exact facets of S(G) for certain types of graphs, then for such graphs 
a(G) can be determined in polynomial time. This turns out to be the case for 
interesting classes, most notably the perfect graphs. 

Possible classes of inequalities satisfied by the incidence vectors of stable sets 
are 

(a) x 20, 

(b) x; +x; <1 for all edges ij, 

(c) x(K) <1 for all maximal complete subgraphs K, 

(d) x(C) < (JC|—1)/2 for all odd circuits C. 

As an example of the basic technique described above, a weighted version of 
Theorem 7.1 implies that the inequalities (a) and (b) describe 5(G) iff G is bipartite 
without isolated vertices. Similarly, the following holds. 


Theorem 8.12. The inequalities (a) and (c) describe S(G) iff G is perfect. 


Proof. Let P(G) be the polytope described in (a) and (c). Obviously 5(G) C 
P(G). Assume first that G is perfect and let x be a rational vector in P(G). Then 
for a common denominator q of the entries in x the vector qx is integer and it is 
2 0 by (a). For each vertex i of G replace i by a complete gx;-graph. The obtgined 
graph H is perfect by Theorem 8.4, hence y(H) = w(H). 

By (c) w(H) <q. Then H is q-colourable, implying the existence of stable sets 
S|, 5S2,...,Sq in G such that each vertex i of G is contained in exactly qx; of 
them. The sum of the incidence vectors for S;,5S2,...,S, divided by q is thus x, 
and x is therefore a convex combination of vectors from G. Thus x € S(G), and 
5(G) = P(G). 

Assume conversely that S(G) = P(G). By induction over |V(G)| we shall first 
prove that y(G) = w(G). The face {x € S(G) | 5° x; = a(G)} of S(G) is contained 
in a facet of S(G), but since S(G) = P(G) the facet must be of the form {x € 
S(G) | x(K) = 1} for some complete subgraph K. Thus K meets all stable sets of 
size a(G) in G. Then 


x(G) < x(G— K)+1=a(G - K) +1 < a(G), 


where the middle equality is by induction, and we use w(G — K) = a(G — K) < 
a(G) = #(G). 

5(G) = P(G) implies easily that also for any induced subgraph H of G the 
polytope S(H) is described by (a) and (c) Thus as above y(//) = w(H), and hence 
G is perfect. 

We have so far in this proof obtained the implications (G perfect = S(G) = 
P(G) => G perfect). Applying this to G provides the opposite direction. 1 


Note that the above proof includes a proof of the perfect graph theorem based 
on some of the same ideas as the earlier proof. Contrary to what one might expect, 
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optimization over the polytope P(G) described by (a) and (c) is NP-hard, like over 
S(G). This was proved by Grotschel et al. (1981). For the polytope described by 
(a), (b) and (d), however, optimization of linear objective functions can be carried 
out in polynomial time. The graphs G for which this polytope equals S(G) are 
called t-perfect, studied first by Chvatal (1975). 

Finally we mention briefly the subject of orthogonal representations due to 
Lovasz (1979) and developed by Grétschel et al. (1988). For a graph G an or- 
thogonal representation of G is a sequence (u; |i € V(G)) of vectors from R" for 
some integer n, such that ||u;|| = 1 for all i, and wu; and u; are orthogonal for all 
pairs of non-adjacent vertices (for example, all the vectors could be mutually or- 
thogonal). For any c € R" with ||c|] =1 and any stable set S in G, the vectors u; 
for i € S are mutually orthogonal, and so }7,.5(c-uj)* < 1, or 


(e) Se -u;)? +x; <1. for all incidence vectors x of stable sets. 

ieV 

TH(G) is now defined as {x € R'”(l | x > 0 and x satisfies (e) for all possible 
orthonormal representations of G and all possible unit vectors c}. TH(G) is the 
intersection of infinitely many halfspaces, so it is convex, but it is not in general 
a polytope. However, S(G) C TH(G), and by taking {u; |i¢ V\K}U {c} to be 
mutually orthogonal unit vectors and u; =c for j € K, the constraint (e) equals 
(c), and hence TH(G) C P(G), where P(G) is described by (a) and (c). It follows 
by Theorem 8.12 that S(G) = TH(G) for perfect graphs G. 

The most important property of TH(G) is that for arbitrary G, using the el- 
lipsoid algorithm, one can optimize any linear objective function over TH(G) in 
polynomial time, i.e. (G,w) = max{wx | x € TH(G)} can be found in polynomial 
time. In particular, a(G) can be polynomially computed for perfect graphs G, and 
hence also w(G) and x(G). 


9. Edge-colourings 


An edge-colouring of a simple graph or multigraph G is an assignment of colours 
to the edges of G such that no two adjacent edges are assigned the same colour. 
The edge-chromatic number y(G) is the minimum number of colours needed in 
an edge-colouring of G. The edge-chromatic number is called the chromatic index 
by some authors. If L(G) denotes the line graph of G, then y(G) = x(L(G)), and 
one may think of edge-colourings of graphs as vertex colourings of line graphs. 

Compared with vertex- or face-colourings, edge-colourings have received less 
attention until relatively recently, although much studied concepts like matchings 
and latin squares have connections to edge-colourings. The first real breakthrough 
was the theorem of Vizing (1964, 1965a) that y(G) of a simple graph G is either 
A(G) or A(G) +1. Vizing considered multigraphs rather than just simple graphs, as 
also Shannon (1949) had done earlier. In much work on edge-colourings, however, 
authors restrict themselves to simple graphs, and multigraph edge-colouring is a 
topic where further work needs to be done. 
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Criticality plays an important role, as in the case of vertex-colouring. We shall 
call a simple graph or multigraph G k-edge-critical if y(G) = k and y(G’) < k for 
all proper subgraphs G’ of G. Edge-critical graphs were first considered by Vizing 

1965b). 
ane and Wilson (1977, 1978) gave excellent surveys of edge-colourings and 
edge-critical graphs. More recently Hilton and Johnson (1987) described interesting 
conjectures and results on graphs whose vertices are critical with respect to the 
edge-chromatic number. 


9.1. Bipartite graphs 


After Tait’s theorem 6.1 the next important observation on edge-colourings was the 
following theorem of K6nig (1916). In a different formulation it says that the line 
graphs of bipartite graphs are perfect. The proof involves looking at a connected 
component in a subgraph consisting of the edges of two colours @ and B, and 
changing the two colours there. Such a subgraph we shall call an (a, 8)-Kempe 
chain. For edge-colourings a Kempe chain is either a path or a circuit. 


Theorem 9.1. Jf G is a bipartite simple graph or multigraph, then y(G) = A(G). 


Proof. Obviously y(G) > A(G). If y(G) > A(G), then let G’ be a (A(G) + 1)- 
edge-critical subgraph of G. Delete an edge xy from G’ and A(G)-edge-colour the 
graph G’ — xy. Since x is incident with at most 4(G)—1 coloured edges, there 
is at least one colour a of the A(G) colours missing at x, and likewise tifere is 
a colour 8 missing at y. If a and B can be chosen equal, then xy may be given 
that colour, and G’ is A(G)-edge-colourable. Since this is not the case, a and B 
are necessarily different. Therefore, the colour 8 is present at x and there is an 
(a, 8)-Kempe-chain H in G’ starting at x. Since G' is bipartite and B is missing at 
y, the vertex y is not in H. But then a A(G)-edge colouring of G’ can be obtained 
by changing the colours a and GB on the edges of H and giving xy the colour f. 
This is a contradiction. O 


An n-edge-colouring of the complete bipartite graph K,,, corresponds to a latin 
square. Answering a conjecture of T. Evans from 1960 on completing partial latin 
squares, Andersen and Hilton (1983) proved the following result, also character- 
izing the extremal cases. 


Theorem 9.2, A partial edge-colouring ¢ of at most n edges of Kn.» can be extended 
to an n-edge-colouring of Kn, except when one of the following two cases occurs: 
(a) For some uncoloured edge xy, there are n coloured edges of different colours, 
each incident with x or y. 
(b) For some vertex x and some colour i the colour i is not present at x but is 
present at all vertices y for which xy is an uncoloured edge. 


9.2. The bounds of Shannon and Vizing 


For a graph G obviously A(G) < y(G). Vizing (1964, 1965a) proved that if 4(G) 
denotes the maximum multiplicity of G, i.e. the maximum number of edges joining 
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the same pair of vertices, then y(G) < 4(G) + «(G). This same result was obtained 
independently by Gupta (1967). We shall consider the extension and elegant proof 
due to Kierstead (1984). An s-triangle is a graph with three vertices x, y and z and 
s xz-edges, s — 1 xy-edges and one yz-edge. 


Theorem 9.3. Let G be a graph of multiplicity z(G) 21. Then y(G) < A(G)+ 
u(G). Moreover, if u(G) >2 and y(G) = A(G)+ w(G), then G contains a u(G)- 
triangle. 


Proof. Suppose that the theorem is false. Then there is an s > 2 such that 4(G) < 
s, G does not contain an s-triangle and y(G) 2 A(G)+s=A+s. 

Then G contains a (4 + s)-edge-critical subgraph G’. Let e9 = xox; be an edge of 
G' and let ¢ be a (A+ 5 ~— 1)-edge-colouring of G' — e9. Let G(x) denote the set of 
those of the (A +s — 1) colours not present at the vertex x. Then a path P in G with 
distinct vertices x9,x,,X2,...,X, in this order and edges Cy = XX}, €) =XpX2,. 
€n-1 = Xn—1Xn 18 said to be accéntable if the first edge eg is the uncoloured edge of 
-G', and p(e;) € Us @(x;) for all i > 1. 

If P is acceptable in G’ with respect to a (A+ 5 — 1)-edge-colouring ¢ of G' — é, 
and if there are i and j anda colour B such thatO0 <i <j <nandB € (x;) NG(x);), 
then G’ can be (4 +5 — 1)-edge-coloured. We shall prove this by double induction 
over j and over j — i. Since y(G‘) = 4+, the conclusion is that $(x;) N ¢(x;) must 
be empty for any two vertices of an acceptable path. 

If j = 1 then i = 0, and the colour 8 may be given to eg, thus extending ¢ to a 
(A+s —1)-colouring of G’. 

If j = 2 and j —i =1, then B is missing at x; and x;,,. Let g(e;) € @(4m), where 
m <i, We may recolour e; by 8 and obtain the colouring gy’. The subpath P’ of 
P from xo to x; is an acceptable path wrt. gy’. Moreover, $/(xm) 1 G'(x;) contains 
the colour y(e;). Since m <i < j, the desired result follows by the induction over 
J 

So now consider the general case where j —i 22. Let 6 € O(xj41). If B =6, 
then j can be replaced by i +1, and we are done by the induction over j. If 8B #4 
necessarily, then the colour B is present at x;41, and there is a (8,5)-Kempe- 
chain C starting at x;,,. By the induction over j we may assume that the colour 
8 ts present at all vertices xo,x,,...,x; and that B is present at all the vertices 
X0,%1,.-+5%;.1,Xiat- Thus none of the edges e),e2,...,¢; can have colours B or 5 
by the definition of an acceptable path. The chain C may or may not end at x;. In 
any case change colours 8 and 6 on C and obtain ¢’. 

If C does not end at x,, then the subpath P’ of P from xg to x;41 is acceptable 
w.r.t. y' and B € @'(x;)N $F’ (x41). Hence we are done by the induction over j. If 
C ends at x,, then P is acceptable w.rt. y’ and B € 9'(xi41) NG (x;). Hence we are 
done by the induction over j — i. 

As mentioned, the conclusion is that @(x;) @(x;) =@ for any two vertices on 
any acceptable path. But we shall prove that this is not so on a longest acceptable 
path P. This contradiction proves the theorem. 

If P(x, NG(x;) =0 for Aj, then NycjcqlP(&s)| > (At s -1—A)) +2 > 
n(s —1)+2 different colours are missing at the vertices of the path P — x,. (The 
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additional 2 appears since xq and x, are each incident with at most A — 1 coloured 
edges.) Moreover, all these n(s — 1) + 2 missing colours must be present at x,, since 
otherwise @(x;) 1 G(4n) #4 for some j. Since P has maximal length, x, is joined 
by at least n(s — 1) +2 different edges to P — x,. Since 4(G) < s and G does not 
contain an s-triangle, x, is joined to xg and x; by < 2s — 2 edges, to x2 and x3 by < 
2s — 2 edges, etc. This gives < n(s — 1) + 1 edges from x, to P — x,. Contradiction. 

g 


For a simple graph G it was proved by Beineke (1968) that G is a line graph 
L(H) of a simple graph H iff G does not contain any of nine specified graphs 
as an induced subgraph. Moreover, A(H) = @(G) where w(G) is the size of a 
maximum complete subgraph of G, provided A(H) > 3. Therefore, the case x = 1 
of Theorem 9.3 may be formulated as the result that y(G) < w#(G)+1 when G 
is a line graph of a simple graph. Based on Theorem 9.3, Kierstead and Schmerl 
(1983) were able (by a rather lengthy argument) to eliminate all but two of the 
forbidden nine subgraphs. This result is an important extension of Vizing’s theorem 
for simple graphs. 


Theorem 9.4. Let G be a simple graph not containing K, (the complete 5-graph 
with a missing edge) or K,3 as an induced subgraph. Then x(G) is either w(G) or 
w(G) +1. 


For graphs or multigraphs Shannon (1949) proved the following. ¥ 
Theorem 9.5. y(G) < 3A(G). 


Proof. Suppose k = y(G) > 3A(G) and let G' be a k-edge-critical subgraph of G. 
aa k < A(G) + w(G) by Theorem 9.3, there are two vertices x and y joined by 

2 3A(G) edges. Delete an xy-edge e from G’ and let ¢ be a (k — 1)-edge-colouring 
of G' —e.In g the number of the k — 1 colours missing at x is > (k — 1) — (A(G) — 
1) =k — A(G), and likewise at y. No colour is missing at both ; x and y, hence 


2(k ~ A(G)) + 54(G) ~1<¢k-1 
implying that k < 3A(G). 


9.3. Various problems and results 


Although Theorems 9.3 and 9.5 give tight bounds for y, several interesting related 
problems can be posed. We shall briefly explain some of these. 

Theorem 9.3 implies a way of classifying simple graphs into two classes. The 
graph G is said to be of class 1 if ¥(G) = A(G) and of class 2 if y(G) = A(G) +1. 
It is an NP-complete problem for a given 3-regular simple graph G to decide if G is 
of class 1 or 2, as proved by Holyer (1981). A 3-regular 2-connected simple graph 
of class 2 has been conjectured by Tutte (1966, 1969) to contain a subdivision of the 
Petersen graph. Such graphs without any cutset of < 3 edges, except three edges 
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at the same vertex, and without circuits of length < 4 have been called snarks by 
Gardner (1976). The Petersen graph is the smallest snark. Snark-hunting goes back 
to Descartes (1948b) and got a big push forward by Isaacs (1975). Very recently, 
snarks with arbitrarily long shortest circuits have been constructed by M. Kochol 
and M. Skoviera. 

Shannon’s theorem 9.5 is best possible as shown by a multigraph on three ver- 
tices, any two joined by the same number of edges. These graphs are examples of 
ring-graphs, i.e. odd circuits with parallel edges. For multigraphs, Jakobsen (1975) 
conjectured such ring-graphs to be extremal w.r.t. the number of vertices. More 
precisely, Jakobsen conjectured the following. 


Conjecture 9.6. Let m be odd 2 3 and let G be a k-edge-critical multigraph with 
m m—3 


| - A(G) + 


Then |V(G)| <m—2. 


k> 


m-—-1 


For m = 3 this is Shannon’s theorem 9.5. For m = 5,7,9 and 11 the conjecture has 
been proved by Goldberg (1973), Andersen (1977), Goldberg (1984) and Nishizeki 
and Kashiwagi (1985). 

Goldberg (1984) posed an interesting extension of Conjecture 9.6. Goldberg 
thought of this conjecture already around 1970, and equivalent conjectures have 
been posed by Andersen (1977) and Seymour (1979). Let 


E(H) 
ie ann 7 


It is easy to see that w(G) < y(G). Goldberg's conjecture states the following. 
Conjecture 9.7. If y(G) 2 4+2 then y(G) = w(G). 


Conjecture 9.7 implies Conjecture 9.6, which in turn implies that there are only 
finitely many edge-critical multigraphs G with y(G) > A(G) +2 for any fixed A(G). 
Whether this is so is not known. Conjecture 9.7 also implies all edge-critical multi- 
graphs G with y(G) > A(G)+2 to have an odd number of vertices. This critical 
graph conjecture was disproved by Goldberg (1979; 1981) for simple edge-critical 
graphs with y(G) = A(G)+1. 


10. Concluding remarks 


This survey has demonstrated that graph colouring has come a long way since its 
beginning more than a century ago, but also that challenging unsolved problems 
remain. 


We have concentrated on classical vertex- and edge-colouring. Many authors 
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have considered extensions in various directions. We shall finish by briefly describ- 
ing some of these. 

(a) In a usual vertex-colouring the objective may be different from using as few 
colours as possible. For example, the maximum number of colours possible in a 
vertex-colouring, where any two different colours are present at vertices joined 
by an edge, is the achromatic number, introduced by Harary et al. (1967). The 
maximum number of colours from N for which any vertex of colour i is joined 
to vertices of all colours 1,2,...,i—1 is the Grundy number, studied by Christen 
and Selkow (1979). 

(b) A graph homomorphism of G into H is a mapping ¢ : V(G) > V(H) for 
which any edge xy of G is taken into an edge y(x)p(y) of H. For H= kK, a 
homomorphism is just a &-colouring and for general H it has been called an H- 
colouring. Albertson and Collins (1985) include a brief guide to graph homomor- 
phism. Hell and NeSetiil (1990) proved that the existence problem for H-colouring 
is NP-complete iff H is non-bipartite. Also the directed case has been studied, for 
example Bang-Jensen et al. (1988) solved the complexity problem for tournaments. 

(c) For any graph-property P an assignment of colours to the vertices (or edges) 
of G, such that each colour class induces a graph with property P, has been called 
a conditional colouring by Harary (1985). For example, P may be “k-degenerate”, 
i.e. all subgraphs have vertices of degree < k, as studied by Lick and White (1970), 
or P may be “without paths of length k”, as studied by Chartrand et al. (1968). 
The edge-case for the property “circuit-free” has a very satisfactory solution by 
W. T. Tutte and C. St. J. A. Nash-Williams. It is a special case of covering a matroid 
by a minimum number of independent sets, solved by J. Edmonds (for details, see 
chapter 8 of the monograph by Welsh 1976). 

(d) In an acyclic colouring of a graph G any two colour classes together induce a 
subgraph of G without circuits. Borodin (1979) proved a conjecture of Griinbaum 
(1973) that any planar graph has an acyclic 5-colouring. Borodin’s proof is rem- 
iniscent of the four colour theorem proof by K. Appel, W. Haken and J. Kock, 
involving some 450 reducible configurations (but no computers). No case of a sur- 
face S, except the sphere, where the maximum acyclic chromatic number for graphs 
on § is strictly greater than the maximum chromatic number, seems to be known. 
This was pointed out by M. O. Albertson and D. M. Berman, and independently 
by O. V. Borodin. 

(e) Colourings may involve colouring vertices and/or edges and/or faces at the 
same time. For example, the total chromatic number is the smallest number of 
colours in a simultaneous colouring of vertices and edges, where any two incident 
cxadjacent elements get different colours. For simple graphs, Behzad (1965) and 
independently Vizing (1968) conjectured that A(G) + 2 colours always suffice in a 
total colouring. Another example is the best possible 6-colour-theorem of Borodin 
(1984, 1989), conjectured by G. Ringel: The vertices and faces of a planar graph 
may be 6-coloured with adjacent and incident elements getting different colours. 
More generally, Borodin (1984, 1989) proved: A 1-planar graph, i.e. a graph em- 
bedded in the plane such that each edge is crossed over by at most one other edge, 
is 6-colourable. 
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(f) Let for each vertex x [edge x] of a graph G a set list(x) of k colours be given. 
The list chromatic number Xjg(G) list edge chromatic number yj.,(G)| is the 
minimum value of k for which a vertex [edge] colouring exists, where each x gets a 
colour from its list, no matter what the lists look like (but they must all be of size k), 

This concept was first studied by Vizing (1976) and by Erdés et al. (1979) under 
the name choosability. Obviously the total chromatic number is at most yjj.¢(G) + 2. 
Perhaps yist(G) = y(G) for all G; for bipartite G this question, known as Dinitz’ 
problem was recently solved in the affirmative by F Galvin with a surprisingly 
short proof. The obvious bound j;.4(G} < 24 — 1 was improved by Bollobds and 
Harris (1985), replacing the 2 by a smaller constant for large A. 

Recently y4.¢ has received a considerable amount of attention following a new 
type of results by Alon and Tarsi (1992), proved by algebraic techniques applied. 
to the graph polynomial, see also the recent survey by Alon (1993). A basic result 
of Alon and Tarsi is: If G is a directed graph with maximum outdegree d, and if’ 
the number of Eulerian (spanning) subgraphs of G (i.e. for all vertices x € V(G) 
the outdegree of x in the subgraph equals the indegree of x) with an even number: 
of edges differs from the number of Eulerian (spanning) subgraphs of G with an 
odd number of edges, then yj4(G) < d+ 1. 

In particular the above result of Alon and Tarsi (1992) was used by Fleischner: 
and Stiebitz (1992) to prove that any 4-regular graph on 3n vertices having a de- 
composition into a Hamiltonian circuit and n pairwise edge-disjoint triangles, is 
3-colourable. This was conjectured by P. Erdés and is equivalent to the statement: 
that for any partition of the integers into triples there is another partition into 
three classes such that each class contains a member from each triple, but does 
not contain a consecutive pair of integers. 

Recently, Thomassen (1994b) proved that the list chromatic number of a planar’ 
graph is < 5. In particular, Thomassen’s proof provides a new proof of the century 
old five colour theorem, avoiding both the use of Euler’s formula and the recolour- 
ing technique of Kempe, thus making it conceptually simpler than any previous 
proof. 

(g) Set-colourings of graphs assign to each vertex a set of r colours, such that for 
any edge xy the vertices x and y are assigned disjoint sets. Under the name set- 
chromatic number the infimum over r of (the minimum number of colours divided. 
by r) was studied by Bollobds and Thomason (1979). 

(h) Finally, a k-flow is an assignment of a direction and an integer from the 
tange 1,2,...,k —1 to each edge of a graph so that at each vertex the flow out 
equals the flow in. For k small, the solutions to k-colouring and k-flow problems. 
are similar. A good survey was provided by Younger (1983), and Seymour (1981a) 
explored connections to colourings. The concept is due to W. T. Tutte. Tutte (1950, 
1954, 1966) posed these attractive conjectures generalizing the five- and four-colour 
theorems. Seymour (1981b) proved the first conjecture with 5 replaced by 6. For 
further information, see the excellent appendix to this chapter by Seymour. 


Conjecture 10.1. Let G be without a cut consisting of one edge. Then G has a 
5-flow. If G contains no subdivision of the Petersen graph, then G has a 4-flow. 
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1. Introduction 


The theory of nowhere-zero flows was introduced by Tutte (1954), and provides 
an interesting way to generalize theorems about region-colouring planar graphs to 
general graphs. In this “mini-chapter” we shall survey the main ideas of the 
subject. 

Let G be a digraph, and let I’ be an abelian group. A I-circulation in G is a 
function @ : E(G)— TF such that for every vertex v, 


> ($e): e €8 *(v)) = VP): e€ 5 (v)), 


where 5 *(v), 8 (v) are the sets of edges with tail v and head v respectively, and 
the summation is in ©. An R-circulation (R is the additive group of reals) is 
usually just called a circulation. If k > 1 is an integer, a nowhere-zero k-flow in G 
is a circulation @ such that for every edge e, |(e)| is one of 1,2,...,k—1. We 
see that if an undirected graph G has a nowhere-zero k-flow for some assignment 
of directions to its edges, then it has one for every assignment (replace (e) by 
—(e) if the direction of e is changed). Thus, the problem raised by Tutte (1954) 
of determining for which values of k a graph has a nowhere-zero k-flow is a 
problem about graphs, not digraphs. Tutte’s problem is motivated by the 
following observation. 


Theorem 1.1, Take a planar drawing of a planar graph G. The regions of this 
drawing may be coloured with k colours, such that for each edge the two regions 
bordering it have different colours, if and only if G has a nowhere-zero k-flow. 


Proof. Suppose for each region r, a(r)€ {1,...,k} is a “colour”, such that for 
each edge the regions bordering it have different colours. Assign a direction to 
each edge, and an orientation to the plane, and for each edge e let #(e) = a(r,) — 
a(r,), where r,,r, are the regions bordering ¢ to its left and right respectively. 
Then @ is a nowhere-zero k-flow. For the converse, suppose that ¢ is a 
nowhere-zero k-flow. Since every integer-valued circulation in a planar graph is 
an integral combination of the “unit” circulations each of which runs around the 
perimeter of a region, it follows that for each region r there is an integer B(r), 
such that for each edge e, (e) = B(r,) — Br), where again r,,r, are the regions 
bordering ¢ to its left and right respectively. Let a(r) be the residue of B(r) 
modulo &; then @ is a colouring of the regions. O 


Theorem 1.1 allows us to state that a planar graph is k-region-colourable 
without mentioning regions, and hence in terms that make sense for general 
(non-planar) graphs. That suggests that we investigate how far the standard 
results about region-colouring planar graphs can be generalized to results about 
nowhere-zero flows in general graphs; and this is the main topic of the theory of 
nowhere-zero flows. 

First, what about the four-colour theorem? By Theorem 1.1, it is equivalent to 
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the assertion that every planar graph with no isthmus has a nowhere-zero 4-flow. 
(An isthmus is an edge not in any circuit. The isthmus condition is natural, for no 
graph with an isthmus has a nowhere-zero k-flow for any value of k.) Is this true 
for non-planar graphs as well? Unfortunately not, for the Petersen graph has no 
nowhere-zero 4-flow. Nevertheless, in contrast with vertex-colouring general 
graphs, there is a universal upper bound. In other words, the following holds. 


Theorem 1.2. There is a constant k such that every graph with no isthmus has a 
nowhere-zero k-flow. 


This was conjectured by Tutte (1954); indeed, Tutte conjectured that k=5 
satisfied Theorem 1.2. This remains open, but Jaeger (1975) proved that & =8 
satisfies Theorem 1.2, and later Seymour (1981a) showed it for k =6. We shall 
discuss these results later. 


2. Group-valued flows 


All the main results and problems about nowhere-zero flows concern nowhere- 
zero k-flows, that is, R-circulations. But Tutte noticed that using J’-circulations 
(where I" is a finite abelian group) is often helpful in proving results about 
R-circulations. 

Thus, let F be a finite abelian group. A nowhere-zero I-flow in a digraph G is a 
F-circulation ¢ in G such that o(e) #0 for all edges e. One can ask, given a 
digraph G and a finite abelian group F, does G have a nowhere-zero I’-flow? This 
is relevant to the k-flow problem because Tutte noticed that G has a nowhere- 
zero I’-flow if and only if G has a nowhere-zero |I"|-flow. That is the main result 
of this section, and we shall prove this in two steps, as follows. 


Theorem 2.1. Let G be a digraph, and let I, I’ be finite abelian groups with 
[|= |0'|. Then the number of distinct nowhere-zero I-flows in G equals the 
number of distinct nowhere-zero T''-flows in G. 


Proof. Let n(G) be the number of distinct nowhere-zero I’-flows in G, and define 
n'(G) similarly. We prove that n(G)=n’(G) by induction on |E(G)|. If every 
edge of G is a loop, then 


” 


n(G)=(k- 1)" O'=n'G), 


where k = |I"| =|I"’|, and the result holds. We assume then that some edge f is not 
a loop. Let A be the set of all f-circulations ¢ in G with d(e) #0 (e € E(G), 
e#f) and $(f) =0; and let B be the set of all nowhere-zero [-circulations in G. 
Let G, be obtained from G by contracting f. For every ¢ € A U B, the restriction 
of ¢ to E(G,) is a nowhere-zero I’-flow in G,, and conversely every nowhere-zero 
I-flow in G, is the restriction of a unique member of AU B (since f is not a 
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loop). Thus n(G,) =|A U B| =|A| + |B. But 2(G) =|B|, and n(G,) =|Al|, where 
G, is obtained from G deleting f. Hence n(G)=n(G,)~—n(G,), and similarly 
n'(G)=n'(G,) —n'(G,). But from the inductive hypothesis, n(G,) =n'(G,) and 
n(G,) =n'(G,); and so n(G)=n'(G), as required. O 


Theorem 2.2. Let G be a digraph, and let k=1 be an integer. Then G has a 
nowhere-zero Z,-flow if and only if G has a nowhere-zero k-flow. 
(Z, is the additive group of integers modulo &). 


Proof. To prove “if”, let ¢ be a nowhere-zero k-flow in G, and let #’(e) be the 
residue of ¢(e) modulo k. Since ¢(e) 40, +k, it follows that ¢’ (s Z, is 
non-zero, and hence ¢' is a nowhere-zero Z,-flow. 

The converse is less easy. Suppose that there j is a nowhere-zero Z, flow. Hence 
there is a function ¢ : E(G)— R such that 

(1) for every edge e, f(e) is one of +1, +2,...,+(k-1); 
and, denoting } (o(e): e€ 5 *(v)), Ls (d(e): e€ 5 (v)) by H(v"), H( ) respec- 
tively, 

(2) for every vertex v, d(v)=o(v_) modulo k. 

For any such @, let D(d) = Lev) |6(@) — ov DI, and choose ¢ satisfying 
(1) and (2) with D(@) minimum. We shall show that ¢ is a nowhere-zero k-flow. 
If we reverse the direction of an edge e and replace ¢(e) by —¢(e), then the new 
function ¢ still satisfies (1) and (2) in the new digraph, and D(@) is unchanged, 
and so we may assume, to simplify notation, that 

(3) for every edge e, d(e) 20, and so (e) is one of 1,2,...,k—1. 

Let S= {v EV(G): 6") > dv _)}, and T= {vu EV(G): dv") <o )}. 

(4) There is no directed path P from S to T. 

For if there is such a path, define #’(e) = o(e) — k if e E E(P), and ¢'(e) = (e) 
otherwise; then @’ satisfies (1) and (2) (because of (3)) and D(p’) = D() — 2k, 
contrary to our choice of @. 

For X CV(G), we denote by 6 ~(X) the set of edges with tail in X and head in 
V(G) — X, by 5°(X) the set of edges with both ends in X, and by 6 (X) the set of 
edges with head in X and tail in V(G) — X. 

(S) S=T=9. 

For by (4) there is a partition (A,B) of V(G) with SCA, TCB and 
8*(A)=@. Then 


2 $e") - dv)= + ¢@)+ Dd ¢€)- Dy 40) 


e€6 (A) e€5 '(A) ed %A) 
- » #@=- 2d ¢@)<0, 
e@& (A) e€5 (A) 


and since #(v") — @(v-) =0 for all vE A it follows that S =. Similarly T=9. 
From (5), ¢ is a nowhere-zero k-flow, as required. O 


From Theorems 2.1 and 2.2 we deduce the following result of Tutte mentioned 
previously. 
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Theorem 2.3. If I is a finite abelian group, then a digraph G has a nowhere-zero 
I-flow if and only if it has a nowhere-zero |I|-flow. 


Proof. Let k = |I"|. By Theorem 2.1, G has a nowhere-zero I’-flow if and only if it 
has a nowhere-zero Z,-flow. By Theorem 2.2, G has a nowhere-zero Z,-flow if 
and only if it has a nowhere-zero k-flow. O 


3. Applications of Theorem 2.3 


Theorem 2.3 is frequently useful, because I-flows are often easier to handle than 
k-flows. Let us see some of its applications. 

There is a well-known result that any Eulerian planar triangulation is 3-vertex- 
colourable; let us cast this in terms of region-colourings and thence in terms of 
nowhere-zero flows, and thereby try to obtain an extension to non-planar graphs. 
In terms of region-colourings it asserts, via planar duality, that any bipartite cubic 
planar graph is 3-region-colourable, that is (by Theorem 1.1), has a nowhere-zero 
3-flow. This can indeed be extended to general graphs, as follows. 


Theorem 3.1. Every bipartite cubic graph has a nowhere-zero 3-flow. 


Proof. Let (A, B) be a 2-colouring of a bipartite cubic graph G. Direct edge e 
from A to B and define ¢(e) = 1; then @ is a nowhere-zero Z,-flow. By Theorem 
2.3, G has a nowhere-zero 3-flow, as required. UO 


Actually, a converse holds as well: a cubic graph has a nowhere-zero 3-flow if 
and only if it is bipartite. We omit the proof. 
A second application is the following. 


Theorem 3.2. A graph G has a nowhere-zero 4-flow if and only if E(G) is the 
union of two cycles of G. 
(A cycle of G is a set of edges which may be partitioned into circuits.) 


Proof. By Theorem 2.3, G has a nowhere-zero 4-flow if and only if it has a 
nowhere-zero Z, x Z,-flow. Let C,,C,C E(G), and for e& E(G) let d(e) = 
(¢,(e), ¢,(e)), where 


1 ifeEc, 
0 otherwise , 


$,(e) = { 
for i= 1,2. Then ¢ is a nowhere-zero Z, x Z,-flow if and only if C,, C, are cycles 
with union E(G). Since any function ¢ : E(G)— Z, x Z, is describable this way, 
the result follows. O 


Similarly, we have the following. 
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Theorem 3.3. If G is cubic, then G has a nowhere-zero 4-flow if and only if G is 
3-edge-colourable. 


Proof. By Theorem 2.3, G has a nowhere-zero 4-flow if and only if it has a 
nowhere-zero Z, x Z,-flow. Let the non-zero elements of Z, x Z, be a, b, c. Then 
since a+b+c=0, it follows easily that a function ¢: E(G)— {a,b,c} is a 
nowhere-zero Z, X Z,-flow if and only if it is a 3-edge-colouring (with “‘colours” 
a,b,c). O 


4. Nowhere-zero 4-flows 

tT 
In view of Theorem 3.3, investigating which graphs have nowhere-zero 4-flows 
extends investigating which cubic graphs are 3-edge-colourable. Tutte (1966) 
proposed the following. 


Conjecture 4.1. Every graph with no isthmus and no Petersen graph minor has a 
nowhere-zero 4-flow. 


There has been a good deal of work done on Conjecture 4.1. For instance, in 
view of Theorem 3.2, Conjecture 4.1 asserts that if G has no isthmus and no 
Petersen graph minor then E(G) is the union of two cycles. Seymour (1981b) 
showed that this conjecture is equivalent to a conjecture of Tutte (1966) about 
matroids, the “tangential 2-block” conjecture. 

By Theorem 1.1, Conjecture 4.1 would extend the four-colour theorem, for 
planar graphs have no Petersen graph minor. A somewhat richer source of such 
graphs is those which can be obtained from a planar graph by adding one vertex, 
joined arbitrarily. Conjecture 4.1 implies that any such graph with no isthmus has 
a nowhere-zero 4-flow. This last conjecture is equivalent to the following 
conjecture of Grétzsch (unpublished). 


Conjecture 4.2. Let G be a planar graph. Then G is 3-edge colourable if and only 
if 

(i) every vertex of G has valency <3, and 

(ii) no subgraph of G has one vertex of valency 2 and all others of valency 3. 


To see the equivalence, given G in Conjecture 4.2, add a new vertex adjacent 
to vertices of G such that every vertex of G has valency 3 in the enlarged graph. 
If the enlarged graph has a nowhere-zero 4-flow then (by the argument of 
Theorem 3.3) G is 3-edge colourable. The converse implication is a little more 
complicated, using “vertex-splitting”, and we omit it. 

Let us mention another extension of Conjecture 4.2 (Seymour 1979b). 


Conjecture 4.3. Let G be a planar graph, and let &k = 0 be an integer. Then G is 
k-edge-colourable if and only if 
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(i) every vertex of G has valency <k, and 
(ii) for every X C V(G) with |X| odd, there are at most 44(|X|— 1) edges with 
both ends in X. 


Conjecture 4.1 may be regarded as saying that in a sense, graphs with no 
Petersen graph minor are not much worse than planar graphs; and indeed there 
are now a couple of results of this kind. For instance, Alspach et al. (1994) have 
shown the following, generalizing a theorem of Seymour (1979a) for planar 
graphs. 


Theorem 4.4. Let G be a graph with no Petersen graph minor, and let 
p:E(G)—Z* be a function. Then there is a list of circuits of G such that every 
edge e occurs in precisely p(e) of these circuits, if and only if for every X CV(G) 

(i) E (ple): e € &(X)) is even, and 

(ii) p(f) <4 (ple): e € 8(X) for every f €8(X). 

(6&(X) is the set of edges with one end in X and the other in V(G) — X.) 


Theorem 4.4 is related to the double cover conjecture (take p = 2), see chapter 
5. It is also related to the “sums of circuits” property, discussed in chapter 10. 

The Petersen graph is the only exception in an even stronger sense, in a 
theorem of Lovasz (1987). 


Theorem 4.5. Let G be a 3-connected simple graph such that G\{u,v} has a 
perfect matching for every two vertices u, v. Let w:E(G)—~Z be such that 
DI (wle): e € &(v)) és the same for every vertex v. If G is not the Petersen graph, 
then w is expressible as an integral combination of the (0, 1)-characteristic functions 
of the perfect matchings of G. 


In the same paper Lovasz obtained a characterization of the lattice of integral 
combinations of the 1-factors of a general graph G. 
5. The 3-flow conjecture 
The following famous theorem is due to Grétzsch (1958). 


Theorem 5.1. If G is planar and has no circuit of length <3 then G is 3-vertex- 
colourable. 


Via planar duality and Theorem 1.1, this asserts that every 4-edge-connected 
planar graph has a nowhere-zero 3-flow. Tutte (1966) asked whether in this 


formulation planarity is needed, and conjectured not. 


Conjecture 5.2. Every 4-edge-connected graph has a nowhere-zero 3-flow. 
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This remains open; indeed, it is not known if there is any & such that every 
k-edge-connected graph has a nowhere-zero 3-flow. Jaeger (1975) proved the 
following weakening of Conjecture 5.2. 


Theorem 5.3. Every 4-edge-connected graph has a nowhere-zero 4-flow. 


The proof of Theorem 5.3 uses the following lemma, a corollary of a theorem 
of Nash-Williams (1961). 


Lemma 5.4. For any k =1, every 2k-edge-connected graph has k spanning trees, 
mutually edge-disjoint. ‘ 
Proof of Theorem 5.3. Let G be 4-edge-connected. By Lemma 5.4, G has two 
spanning trees T,, T, with E(T,) E(T,) =@. For each e € E(G) — E(T,) let C, 
be the unique set of edges in E(7,) U {e} forming a circuit. Let F be the set of all 
f © E(T,) belonging to C, for an odd number of edges e € E(G) — E(T;), and let 
C, =F U(E(G) — E(T,)). Then C, is a cycle with E(G) — E(T,) CC,. Similarly, 
there is a cycle C, with E(G) — E(T,) C C,. Then C, UC, = E(G), and so G has a 
nowhere-zero 4-flow by Theorem 3.2. 0 


6. The 5-flow conjecture 


As we stated earlier, Tutte (1954) proposed the following strengthening of 
Theorem 1.2, which would generalize the five-colour theorem for planar graphs. 


Conjecture 6.1. Every graph with no isthmus has a nowhere-zero 5-flow. 


Jaeger (1975) proved the following, which is now superceded but which had a 
very pretty proof that we would like to include here. 


Theorem 6.2. Every graph with no isthmus has a nowhere-zero 8-flow. 


Proof. It is easy to see that it suffices to prove that every 3-edge-connected graph 
has a nowhere-zero 8-flow. Thus, let G be 3-edge-connected. By applying Lemma 
5.4 to the 6-edge-connected graph obtained from G by replacing each edge by two 
parallel edges, we deduce that there are three spanning trees T,, T,, T, of G with 
E(T,) N E(T,)O E(T;) = 6. As in Theorem 5.3, let C, 2 E(G) — E(T;) be a cycle 
(i= 1, 2,3). Then C, UC, UC, = E(G), and so (by an analogue of Theorem 3.2 
for 8-flows) G has a nowhere-zero 8-flow. O 


Theorem 6.2 was improved by the following result of Seymour (1981a). 


Theorem 6.3. Every graph with no isthmus has a nowhere-zero 6-flow. 
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Proof. We proceed by induction on |E(G)|. We may assume that 

(1) G is simple, 3-connected, and cubic. 

For evidently, we may assume that G is connected and loopless, and hence 
2-edge-connected since it has no isthmus. If |6(X)| =2 for some X CV(G), the 
result follows from the inductive hypothesis by contracting an edge in 6(X). Thus 
we may assume that G is 3-edge-connected. If some vertex v has valency =4, then 
there are edges ¢,,e, with ends u,,v and u,v say, such that if we delete e,, e, 
and add a new edge e, joining u,,u,, the resulting graph G’ has no isthmus. By 
the inductive hypothesis G’ has a nowhere-zero 6-flow @’, say; and hence so does 
G (define |#(e, | = |(e,)| = |#(e)|, and otherwise @ = $’). Thus we may assume 
that there is no such vu. Hence G is cubic, and so it is 3-connected. 

For X C E(G) we define X’ to be the smallest set Y C E(G) with X C Y such 
that there is no circuit C with 0< |E(C) — Y| <2. We say that X is good if there is 
a Z,-circulation @ with (e) #0 for all e€ E(G) — X. 

(2) If XC E(G) and X* = E(G) then X is good. 

For we proceed by induction on |E(G) — X|. We may assume that X ¥ E(G), 
and so there is a circuit C with 0<|E(C)- X|<2. Let X'= X UE(C). Then 
X’? = E(G), and so from the inductive hypothesis X’ is good; let @’ be the 
corresponding Z,-circulation. Let @ be a Z,-circulation with ¢)(e)= +1 (e€ 
E(C)), and ¢,(e) =0 otherwise. Then one of 6’, 6’ + do, ' — dp is non-zero on 
all the (at most two) edges of E(C) — X, and hence is the required Z,-circulation. 

(3) There is a cycle C of G with C? = E(G). 

For choose a cycle C with C’ forming the edge-set of a connected subgraph H, 
and with C? maximal. Certainly H is induced, and no vertex of G not in V(H) has 
22 neighbours in V(H) (because H is connected). If H # G, there is a maximal 
2-connected subgraph B of G\V(H) such that at most one vertex of B has a 
neighbour not in V(B) UV(#7). Now B must contain two vertices with neighbours 
in V(H) since G is 3-connected, and hence there is a circuit D of B, two vertices 
of which have neighbours in V(H). But (CU D)’ is connected, contrary to the 
maximality of C. Thus H = G, as required. 

From (2), there is a Z,-circulation ¢, say such that ¢,(e) #0 for all e € E(G) — 
C. Let ¢, be a Z,-circulation such that ¢,(e)#0 for all e€C. Let P(e)= 
(¢,(€), ¢,(e)) for all e; then ¢ is a nowhere-zero Z, x Z,-flow in G. By Theorem 
2.3, G has a nowhere-zero 6-flow, as required. O 
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1. Introduction 


Various types of tilings of the Euclidean plane have been studied since ancient 
time. The problem of existence and classification of such tilings can be formulated 
as problems concerning infinite planar graphs with certain symmetry properties. 
In a series of recent papers Griinbaum and Shepard have investigated these 
problems extensively and written a beautiful book (Griinbaum and Shepard 1987) 
on the subject. The drawings in Griinbaum and Shepard (1987) show that these 
problems are not only interesting mathematically but also for aesthetic reasons. 
Another well-known classification problem of aesthetic nature involving planar 
graphs is that of classifying the Platonic solids, a problem which was elved by 
Euler’s formula for planar graphs. 

Planar graphs (or more generally graphs drawn in the plane with certain 
restrictions on how edges cross) are also of interest for practical reasons. The 
design of printed circuits has led to several problems on such graphs. Among such 
problems we may mention the problem of drawing a graph such that the total 
number of crossings is least possible and the problem of decomposing the graph 
into as few planar graphs as possible. The graph invariants thus arising are called 
the crossing number and thickness, respectively. These have been studied for their 
mathematical interest and we mention some of the results in this chapter. 

Another practical reason for studying graph representation problems is that 
graph algorithms (when carried out in practice) may be very sensitive to the way 
in which the graphs are represented. One way is to represent the graphs as 
intersection graphs of set systems, where the sets may be some easily specified 
objects in a Euclidean space, for example d-intervals. 

We say that a graph G can be embedded in a topological space X if the nodes of 
G can be represented by distinct elements in X and each edge of G can be 
represented by an arc in X, i.e., the image of a 1-1 continuous function from the 
unit interval [0, 1] into X. Moreover, we require that two edges have at most one 
end in common. In this context many topological spaces X are uninteresting as 
the following observations show. 


Proposition 1.1. Every simple graph can be embedded into R® such that all edges 
are straight line segments. 


Proof. For wae the nodes can all be represented on the curve consisting of 
all points (¢,¢°,¢) where tER. O 


We define a k-book as the topological space obtained from k disjoint unit 
squares by selecting one side of each and identifying all these sides into one line 
(called the spine of the k-book). Below is a short proof of a result of Atneosen 
(1968). 


Proposition 1.2. Every graph G can be embedded in a 3-book. 
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Proof. Represent all nodes of G on page 1 of the 3-book and connect every 
vertex uv by deg(v) straight line segments on page 1 to the spine. Now it is an easy 
exercise to extend the line segments to an embedding of G using only polygonal 
arcs on pages 2 and 3 (which may be thought of as a rectangle). O 


Propositions 1.1 and 1.2 explain why almost all results on graph embeddings 
concern the plane R* or compact 2-manifolds. The theory of knots in R° is not 
considered as graph theory. One result by Conway and Gordon (1983) should, 
however, be mentioned in this context. 


Theorem 1.3. Let G be a graph embedded in the 3-dimensional space such that 
every edge is a polygonal arc. If G contains a K,, then G contains a pair of disjoint 
cycles which are homologically linked. If G contains a K,, then G contains a 
knotted cycle. 


Recently, Robertson et al. (1993b) have completely characterized the graphs 
which have linkless embeddings in the 3-dimensional space. 

If we delete an edge e of a graph G embedded in X where X denotes the plane 
or a compact 2-manifold, then the resulting graph G-e is also embedded in X. 
Also, the contraction of the edge e results in a graph G/e embeddable in X, and 
hence the same holds for every minor of G, i.e., every graph obtained from G by 
successively deleting and/or contracting edges. The correlation between minors 
and embeddings is demonstrated by Kuratowski’s (1930) classical planarity 
criterion that a graph is planar if and only if it contains no subdivision of any of 
the Kuratowski graphs K, and K, , (see fig. 1.1). It is an easy exercise to see that 
a graph contains a subdivision of K,, if and only if it has K,, as a minor. 
Moreover, a graph G has K, as a minor, if and only if K either contains a 
subdivision of K, or has K,, plus two independent edges as a minor. Hence 
Kuratowski’s theorem is equivalent to the following result by Wagner (1937) (see 
also Harary and Tutte 1965). 


Theorem 1.4. A graph is planar if and only if it has none of K, or K,, a8 @ minor. 


3,3 K, 


Figure 1.1. The Kuratowski graphs. 
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Kuratowski’s theorem is not only the most fundemental planarity criterion as 
we point out in section 2.1. It also is a prototype of characterization in terms of 
forbidden minors, a type of characterization which is not only important in 
connection with graph embeddings, but also in other parts of graph theory and 
matroid theory. The study of minors has led to some of the deepest results in 
graph theory, some of which will be mentioned in this chapter. 

In this chapter we first treat planarity and related invariants such as thickness 
and crossing numbers. Also, we consider more special representations of planar 
graphs and graphs in the plane. Then we proceed to graphs on higher surfaces. 
The investigations in this area were for several years centered around the 
Heawood conjecture on the minimum genus of a 2-manifold into which the 
complete graph K,, can be embedded. Ringel solved the problem in specihl cases 
and the solution was completed by Ringel and Youngs (1968), (see the survey of 
White 1978b). In section 3.3 we reproduce a complete solution when n=7 
(mod 12). The method is based on the rotation embedding scheme, a simple idea 
which turns many embedding problems into combinatorial problems. 

One of the most important unsolved problems in graph theory is Hadwiger’s 
conjecture (see also the chapter by Toft) that every graph of chromatic number at 
least k has K, as a minor. In section 4 we present the classical result of Wagner 
(1937) which gives a complete description of the graphs having no K, as a minor. 
As a corollary (called Wagner’s equivalence theorem), Hadwiger’s conjecture for 
k = 5 is equivalent to the 4-colour-theorem. We also mention methods and results 
related to the theorem of Wagner. 

A well-known theorem of Kruskal (1960) asserts that, given an infinite 
sequence of trees there is always one that contains a subdivision of another. This 
does not hold for graphs in general, but if “subdivision” is replaced by “minor” 
then the statement above becomes true with “trees” replaced by “graphs”. This 
has been proved by Robertson and Seymour in a series of “graph minor’ papers 
beginning with Robertson and Seymour (1983). This remarkable results has far 
reaching consequences. Consider graph property p such that any minor of a graph 
with property p also has property p. For example, p might be the property of 
being embeddable in a given surface. Then the Robertson-Seymour theorem 
implies the existence of a finite set of graphs G,,..., G,, such that a graph G has 
property p iff it has no minor isomorphic to any of G,,..., G,,. Combined with 
the solution of the k-path problem (also due to Robertson and Seymour) it gives a 
polynomial time algorithm for testing whether a graph has property p. Section 4 
is devoted to a discussion of the Robertson—-Seymour theory. 

The literature on graph embeddings is immense and therefore many important 
results and methods have not been included in this chapter. Also, there are 
already excellent books or survey articles on the subject. White’s (1973) book 
covers embeddings on higher surfaces and emphasizes also the algebraic aspects. 
White and Beineke (1978) is a more recent survey and the most recent book in 
the same spirit is the one by Gross and Tucker (1987). The Heawood conjecture 
is surveyed in White (1978b) and is the subject of Ringel’s (1974) book. Crossing 
numbers are treated in surveys of Guy, for example Guy (1971). Furthermore, 
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many text books on graph theory contain information on these subjects. 
However, we hope that the present chapter may be of use to those who wish to 
become acquainted with some of the important results and methods in a relatively 
fast way. Thus we demonstrate the power of the rotation scheme by including a 
short proof of one of the cases in the Heawood conjecture. 

We treat various embeddings in the plane in some detail and describe the basic 
methods in connection with minors exemplified by a short proof of Wagner’s 
characterization of the graphs that do not have K, as a minor. Finally, we 
emphasize the interaction between minors and embeddings as best demonstrated 
by the Robertson—Seymour theory. 


2. Graphs in the plane 


2.1. Planarity criteria 


We distinguish between a plane graph (a graph embedded in the plane) and a 
planar graph (i.e., an abstract graph isomorphic to a plane graph). It is easy to see 
that a plane graph can be redrawn such that all edges are polygonal arcs and from 
now on we assume that plane graphs are embedded in this way. A face of a plane 
graph G is a (polygonal) arcwise connected point set of R?\G and the boundary 
of a face is defined in the obvious way. The topological prerequisites needed for 
handling plane graphs are summarized in the next proposition. 


Proposition 2.1. (a) A cycle embedded in the plane has precisely two faces (called 
the interior and exterior of the cycle, respectively) each of which has the cycle as 
boundary. 

(b) A plane isomorphic to K,, has precisely three faces. The boundaries of 
these faces are the three cycles of K, 3. 

(c) Every face of a 2-connected plane graph G has a cycle of G as boundary. 


Proof. (a) is a variant of the Jordan curve theorem; a short proof of this together 
with (b) is given in Thomassen (1981). (c) follows easily from (b) with the aid of 
the following observation of H. Whitney: Every 2-connected graph can be 
obtained from a cycle by successively selecting two distinct nodes and adding a 
path between them. © 


We also point out that a simple graph operation reduces many problems on 
graphs (in particular planar graphs) to the 3-connected case. We consider a 
2-connected graph G which has a separating set {x, y} of two nodes and form the 
two graphs G, and G, indicated in fig. 2.1. In these graphs x and y are adjacent 
while x and y may or may not be adjacent in G. It is an easy exercise to show that 
G is planar if and only if both G, and G, are planar. Also, it is easy to see that a 
graph is planar if and only if each block is planar and from these observations we 
may conclude that Kuratowski’s theorem (and several other results on planar 
graphs) is only of interest for 3-connected graphs. If one (or both) of G, and G, in 
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G G G 


Figure 2.1. Splitting a graph of connectivity 2. 


fig. 2.1 has a separating set of two nodes we may repeat the operation of fig. 2.1 
until we obtain a collection of 3-connected graphs. These graphs turn out to be 
independent of the order in which we perform the operation of fig. 2.1. Hopcroft 
and Tarjan (1974) call them the 3-connected components of the graph. They 
showed that the 3-connected components can be found in linear time and used 
that to describe a linear planarity test algorithm (see section 3.5). 

Here we describe another planarity test algorithm which also proves Kuratow- 
ski’s theorem. It is based on the following contraction lemma which is a 3- 
connected counterpart to the observation of Whitney mentioned in the proof of 
Proposition 2.1. 


Lemma 2.2. Every simple 3-connected graph other than K, has an edge whose 
contraction results in a 3-connected graph. 


Lemma 2.2 can be obtained from Tutte’s characterization of the 3-connected 
graphs, but Lemma 2.2 also has a short independent proof (see Thomassen 1980). 
Based on Lemma 2.2 our planarity algorithm is as follows: Let G be any simple 
3-connected graph. By Lemma 2.2, we form a sequence of graphs H,, 
A,,...,H,, where H, =G, H,,=K, and H,,, is a 3-connected graph obtained 
from H, by contracting an edge (and replacing any multiple edge that might occur 
by a single edge) for i=1,2,...,m-— 1. Now draw H,, in the plane and, if H,,, 
is plane, then we modify it, if possible, to a plane drawing of H, by splitting a 
vertex z up into two adjacent vertices x and y keeping all edges not incident with 
z unchanged. In particular, the modification of H,,, only affects the edges eee 
(or outside) the cycle C of H,,, — z which is the boundary of the face of H;,, — 
containing z. So, if H, cannot be drawn as a plane graph, then the reason is that C 
together with x, y and the edges incident with x and y is nonplanar. Then it is 
easy to find a subdivision of K, or K,, in this subgraph of H, (for details, see 
Thomassen 1981). In that case He (and hence also G) contains K, or K,, as a 
minor. 

An edge xy satisfies the conclusion of Lemma 2.2 iff the graph obtained by 
deleting x and y is 2-connected. This can be tested in O(q) time where q is the 
number of edges of the graph. Thus an edge satisfying the conclusion of Lemma 
2.2 can be found in O(q*) time. Hence the above sequence H,, H>,...,H,, can 
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be found in O(q’) time. The reverse operating (going from H,, to H,) can be 
done in fewer steps. Thus the algorithm is an O(q’) time algorithm. 

Other short proofs and applications of Kuratowski’s theorem are discussed in 
Thomassen (1981), and recently Tverberg (1987) obtained yet another short 
proof. The history of the theorem is the subject of Kennedy et al. (1985), see also 
the chapter by Biggs et al. in this volume. Here we just mention a few 
implications. 

Tutte (1963) (see also Thomassen 1980) pointed out that Kuratowski’s theorem 
implies MacLane’s (1937) planarity criterion Theorem 2.3 below. For the 
definition of the cycle space the reader is referred to chapter 1 by Bondy. 


Theorem 2.3. A graph is planar if and only if its cycle space has a basis of cycles 
such that every edge is in at most two of these cycles. 


We also mention the characterization due to Chartrand and Harary (1967) of 
the outplanar graphs, i.e., the graphs that can be embedded in the plane such that 
all nodes are on the outer face boundary. We express the result in terms of minors 
(see also the chapter by Seymour). 


Theorem 2.4. A graph G ts outerplanar if and only if it contains no minor 
isomorphic to K, or K, ,. ; 


Proof. G is outerplanar if and only if the graph G’ obtained from G by adding a 
new node joined to all other nodes is planar. Hence Theorem 2.4 follows from 
Theorem 1.4. O 


2.2. The dual graph 


If G and H are connected graphs we say that H is a dual graph of G, and write 
H = G*, if there exists a function g: E(G)— E(H) which is 1-1 and onto such 
that a subset E C E(G) forms a cycle in G iff g(£) is a minimal edge cut in H. 
One can prove a number of fundamental and interesting facts concerning 
(abstract) dual graphs (for example that G is a dual of G*). Here we mention 
only one such example which emphasizes the “‘duality” of the two operations used 
in the definition of minors. 


Proposition 2.5. If G* is a dual of G, and e is an edge of G, and e* is the 
corresponding edge in G*, then G* — e* is a dual of G/e and G*/e* is a dual of 
Ge. 


In particular, if G has a dual graph, then every minor of G has a dual graph. 
One can show that none of K, and K,, has a dual graph. (This is a finite 
problem.) Hence Kuratowski’s theorem implies the nontrivial part of Whitney’s 
(1932) planarity criterion. 
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Theorem 2.6. A graph G has a dual graph if and only if G is planar. 


The easy part of Theorem 2.6 is established by the geometric dual graph of a 
plane graph G. In every face of G we insert a node and for every edge e of G, we 
draw an edge e* such that e* crosses e (and no other edge of G or the geometric 
dual graph) and joins the two nodes in the faces adjacent to e. If e is an isthmus of 
G, then only one face is adjacent to e and in that case e* becomes a loop. 

The importance of duality becomes perhaps more clear when going from graphs 
to matroids (see the chapters by Recski and Welsh, respectively). For planar 
graphs the geometric dual graph is often a useful too! (primarily because of 
Proposition 2.7 below) and some results which hold for planar graphs and not for 
graphs in general can sometimes by “dualized” such that they give cise to 
theorems or problems for general graphs. The dual graph (or matroid) is of 
interest in electrical networks as indicated by the following simple example: Let N 
be an electrical network whose underlying graph is plane and whose edges are 
resistances, voltage generators and current generators. Let N* be the dual 
network, i.e., the underlying graph of N* is the geometric dual of that of N, and 
the resistances of N* are the reciprocals of those of N, and voltage (resp. current) 
generators in N* correspond to current (resp. voltage) generators of N. Then the 
solution of N (i.e., the current and voltage vectors) becomes the solution of N* 
simply by interchanging between the current and voltage in every edge. 

Theorem 2.6 can be extended as follows: If G, H is a pair of (abstract) dual 
graphs then they have a plane embedding as geometric dual graphs. Suppose now 
that G is 2-connected. (It is then easy to see that every dual graph of G is also 
2-connected.) If G, and G, are plane graphs isomorphic to G, and the facial 
cycles (i.e., the face boundaries) of G, and G, are the same cycles in G and, 
furthermore, the outer cycle in G, corresponds to that in G,, then it is not 
difficult to prove that the two plane representations G, and G, are equivalent in 
the sense that there exists a homeomorphism of the plane taking G, onto G,. If 
we regard G, and G, as embeddings of G on the sphere, then the same 
conclusion holds without the assumption on the outer cycles. Hence we get the 
following. 


‘n 


nema 


Figure 2.2. A plane graph and its dual graph. 
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Proposition 2.7. Let G be a 2-connected plane graph. There is a 1-1 corre- 
spondence between the nonequivalent embeddings of G into the sphere and the 
nonisomorphic dual graphs of G. 


Let H be a 2-connected graph and let G be a dual graph as the leftmost graph 
in fig. 2.1. A 2-switching of G is obtained by deleting every edge zx (where z is in 
G,) and replacing it by zy. The resulting graph G’ clearly has the same cycles as 
G (in the sense that an edge set forms a cycle in G iff the corresponding edge set 
in G' also forms a cycle). Hence H is a dual of G’ and so G' is a dual of H. 
Whitney (1933) proved that every dual graph of H can be obtained from G by a 
sequence of 2-switchings. Combined with Proposition 2.7 this gives a com- 
binatorial description of the nonequivalent embeddings of H into the sphere. In 
particular, if H is 3-connected, then H has only one such embedding (up to 
equivalence). Another combinatorial explanation of the last fact was also given by 
Tutte (1963). 


Proposition 2.8. if H is a simple 3-connected planar graph, then the facial cycles in 
any plane embedding of H are those cycles in H which are chordless and 
nonseparating (in the sense that the deletion of the nodes of the cycle leaves a 
connected graph). 


The nonequivalent embeddings into the plane (or sphere) can also be described 
algebraically based on extensions of Theorem 2.3. 


2.3. Convex representations 


The graph G(P) of d-polytope P (i.e., the convex hull of a finite point set in R*) 
is the graph whose nodes are the vertices of P and whose edges are the edges of 
P. It can be shown (see Griinbaum 1967) that G(P) is d-connected. Moreover, if 
d = 3, then an appropriate projection of P onto R* shows that G(P) has a convex 
representation, i.¢., a plane representation such that all facial cycles are convex 
polygons. In particular, G(P) is planar and 3-connected. Steinitz’ theorem (see 
Grinbaum 1967) below shows that the converse holds. 


Theorem 2.9. A graph G is the graph of a 3-polytope if and only if G is simple 
planar and 3-connected. 


In particular, every simple 3-connected planar graph has a convex representa- 
tion. Tutte (1963) considered planar straight line representations such that every 
node not on the outer cycle is the mass center of its neighbours. This is a special 
type of convex representations and finding such a representation amounts to 
solving certain system of linear equations. Tutte proved that this can always be 
solved for 3-connected planar graphs. 

Tutte (1960) also proved that any convex polygon can play the role of the oute: 
cycle (provided, of course, that it has the right number of corners). Tutte’s result 
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was in fact more general and we state here the generalization of Thomassen 
(1980) of Tutte’s result. 


Theorem 2.10. Let G be a 2-connected plane graph with outer cycle §. Redraw S 
as a convex polygon 2. Then J can be extended to a convex representation of G 
(inside 3) if and only if G satisfies the following conditions: 

(a) If H is a connected component of G — V(S), then the nodes of S joined to H 
are not ail on the same straight line segment of 2%. 

(b) ff e is an edge of G— E(S), then the ends of e are not on the same straight 
line segment of S. 

(c) if G has a separating node set {x, y} as in fig. 2.1, and S is in G,, then G, is 
just a path from x to y. 


The afore-mentioned result of Tutte (1960) is the special case of Theorem 2.10 
when % is strictly convex. In this case (b) can be omitted, and (a) reduces to the 
statement that no two consecutive nodes on S form a separating set. The 
extension Theorem 2.10 of Tutte’s result is convenient when going from finite to 
infinite graphs as pointed out in Thomassen (1984). For example, every infinite, 
locally finite 3-connected simple planar graph with no node accumulation points 
has a convex representation, even such that every bounded face is a convex 
polygon with at most 8 corners. (For finite graphs we can replace 8 by 6 which is 
best possible.) Moreover, if the graph is doubly-periodic (i.e., its automorphism 
group includes two translations in different directions), as is the case for many of 
the tilings mentioned in the introduction, then the graph has a doubly-periodic 
convex representation. This last result was proved first by complicated methods 
by Mani-Levitska et al. (1979) and a short proof based on Theorem 2.10 appears 
in Griinbaum and Shepard (1981) and Thomassen (1984). 

The following application of Theorem 2.10 appears in Thomassen (1989b). 


Theorem 2.11. Let G, S and 3 be as in Theorem 2.10 and suppose (a), (b), (c) are 
satisfied. Suppose further that & has no horizontal edge. Now orient each edge of G 


Figure 2.3. Examples of the forbidden configuration in Theorem 2.10. 
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such that the top (respectively bottom) vertex of X is the only sink (respectively 
source). Then & can be extended to a convex representation of G such that all edges 
are directed upwards. 


It is easy to see that a maximal plane graph is 3-connected and has therefore by 
Theorem 2.10 a convex representation. In particular, every planar graph has a 
straight line representation. This is usually called Fary’s theorem although it was 
first discovered by Wagner (1936). Similarly, Theorem 2.11 implies the following 
result which extends a result of Platt (1976). (Here, an oriented graph is obtained 
from a graph by assigning an orientation to every edge.) 


Corollary 2.12, Let G be an oriented simple graph with only one source x and one 
sink y and assume further that G U {xy} is planar. Then G has a plane straight line 
representation such that all edges are directed upwards. 


It is an interesting unsolved problem in connection with Hasse diagrams to 
characterize those oriented graphs which have plane straight line representations 
with all edges directed upwards. For oriented graphs with a node dominating all 
other nodes (a directed analogue of outerplanar graphs) there is a Kuratowski 
type theorem by Thomassen (1989b) with more than 20 types of forbidden 
subgraphs, so the general problem is probably difficult. N. Alon and L. Lovasz 
(and possibly others) have generalized the above-mentioned result of Wagner and 
Fary to general surfaces: If a graph embeds in the surface it embeds with 
geodesics. 

A plane graph is rectangular if all faces are bounded by rectangles whose sides 
are either vertical or horizontal straight line segments. Ungar (1953) proved the 
following result on rectangular representations. 


Theorem 2.13. Every planar, cubic, 3-connected, cyclically 4-edge-connected 
graph has a rectangular representation. 


2.4. Other special representations of planar graphs 


A d-interval Braph G is the intersection graph of a collection of d-intervals 
(boxes) in R*. If the intervals can be chosen such that no two have an interior 
point in common and any two intersecting intervals have a (d —1)-interval in 
common, then we say that G is a strict d-interval graph. The d-interval graphs 
have been characterized for d=1 only. Thomassen (1986) extended Theorem 
2.13 to give a necessary and sufficient condition for rectangular representations in 
terms of forbidden configurations in the same spirit as Theorem 2.10 and that was 
used to prove the following. 


Theorem 2.14. A graph G has a Strict 2-interval representation if and only if G is u 
proper subgraph of a 4-connected planar graph. 
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Moreover, Thomassen (1986) obtained a characterization of the graphs in 
Theorem 2.14 in terms of forbidden subgraphs and proved also the following. 


Theorem 2.15. Every planar graph has a strict 3-interval representation. 


Theorem 2.15 was conjectured by Scheinerman and West inspired by their 
counterpart of Theorem 2.15 proved in (Scheinerman and West 1983). 


Theorem 2.16. Every planar graph is the intersection graph of a set system where 
each set is the union of three real intervals. 


Melnikov (1981) introduced an analogous type of representation (which we 
here call a Melnikov representation: The nodes of the graph are disjoint horizontal 
straight line segments in R? and the edges are all possible vertical straight line 
segments which connect two nodes and intersect no other node or edge. Duchet 
et al. (1983) proved that every maximal planar graph has a Melnikov representa- 
tions. Thomassen (1984) extended this to all 3-connected graphs. Tamassia and 
Tollis (1986) proved the following. 


Theorem 2.17. A graph G has a Melnikov representation if and only if it has a 
plane representation such that all cutnodes are on the boundary of the outer face. 


We can test, in linear time, if a graph G satisfies the conclusion of Theorem 
2.17: Just add a new node v and join v to all cutnodes of G. Then apply a 
planarity test algorithm to the resulting graph G'. Applying Kuratowski’s theorem 
to G’ yields the following. 


Corollary 2.18. A pianar graph G has no Meinikov representation iff either 

(i) G contains a subdivision of K, such that two “opposite” paths contain 
cutnodes of G, or 

(ii) G contains a subdivision of K,., such that each of the three paths between 
the nodes of degree 3 contains a cutnode of G, or 

(iii) G contains a subdivision of K, minus an edge such that the two nodes of 
degree 3 are cutnodes in G, or 

(iv) G contains a subdivision of K, such that the nodes of degree 3 are cutnodes 
in G, 


The following result is due to Koebe (1936) and Andreev (1970). 


Theorem 2.19. Every planar graph can be represented as the intersection graph of 
closed discs in the plane such that no two discs have an interior point in common. 


Sinden raised the problem of characterizing those graphs which are intersection 
graphs of open curves in the plane. Such graphs are investigated carefully by 
Kratochvil et al. (1986). Bouchet (1987) obtained an excluded minor characteri- 
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zation of those string graphs where the curves are all chords of a fixed circle. 
Kratochvil (private communication) has proved that recognizing string graphs in 
general is NP-hard. 


2.5. Plane drawings with crossings 


Every graph can be drawn in the plane such that any two edges have at most one 
point in common (and such that any such point is an end or a crossing point) and, 
furthermore, such that no three edges have a crossing point in common. Such a 
drawing will be called normal. 

The crossing number cr(G) and the rectilinear crossing number cr(G) is the 
smallest number of crossings in a normal drawing (respectively normal drawing 
with straight edges) of G. cr(G) has been investigated for several classes of graphs 
(see, e.g., Guy 1971), but only few exact results are known. Here we consider 
complete graphs and complete bipartite graphs only. 

The crossing number cr(K,) was considered first by P. Erdés in 1940, as 
mentioned by Guy (1969b) who was the first to obtain the following upper bound. 


Theorem 2.20. 
v(K,,) <1 [n/2] [(a — 1)/2] Ln —2)/2] L@ —3)/2]. 


Proof. Guy (1969b) gave the following elegant proof: Let the nodes of K,, be the 
vertices of a regular n-gon. Joint two nodes by straight line segments if these are 
in directions within a given quadrand. Otherwise join them outside. O 


Another simple proof is described by White and Beineke (1978). The big 
unsolved problem on crossing numbers is whether the inequality of Theorem 2.20 
is in fact an equality. This has been verified for n < 10 (see Erdés and Guy 1973). 

Guy (private communication) has shown that cr(K,) = cr(K,) = 36 but cr(K,) > 
18 = cr(K,) and cr(K,,) > cr(K,,). It is believed that 


cr(K,) >cr(K,) forallnz 210. 


The best known upper bound for cr(n) has been obtained by Jensen (1971). 


Theorem 2.21. 
er(K,,) < |(7n* — 56n° + 128n? + 48n |(n — 7)/3| + 108)/432]. 


It was conjectured by Erdés and Guy (1973) that Theorem 2.21 is an equality. 
However, Jensen (private communication) has found several better drawings. 
Jensen can prove that cr(K,4 <hr for nm large (which is better than Theorem 
2.21) and believes that cr(K,)/cr(K,)—> 1 as n>. 

If K,,, is drawn with cr(K,,,) crossings, then the average number of crossing. 
in the n+1 K,’s in this K,,, is ( —3)cr(K,,,)/(a +1) and hence this is an 
upper bound for cr(K,,). In other words, cr(K,)/(7) Scr(K,4,)/("7!). Hence 
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lim,.. Ct(K,)/n* exists. Similarly, lim,_,..¢r(K,)/n* exists. Since cr(K,) = 
cr(K,) = 1 we conclude that 


or(K,) > or(K,) = 4(4) . 
Kleitman (1970) obtained the better inequality 
or(K,) = cr(K,) = 3 (4) 


for n sufficiently large. 

The crossing number problem for complete bipartite graphs is also known as 
Turan’s brick-factory problem and it was believed to be solved by Zarankiewicz 
(1954). This is discussed by Guy (1969a) who notes that Zarankiewicz proved 
only the upper bound. 


Theorem 2.22. 
er(K,,,,) = |m/2] [@m — 1)/2] |n/2] La — 19/2]. 


It is believed that Theorem 2.22 is in fact an equality. (This has been verified by 
Kleitman (1970) for min(@m,n) <6). If true, then cr(X,,,,) =cr(K,,,,). It was 
conjectured by Eggleton (1986) and proved by Thomassen (1988a), that if G is a 
normal drawing such that every edge crosses at most one other edge, then there 
exists a homeomorphism of R’ onto R’ taking G into a graph where every edge is 
a straight line segment unless G contains a so-called B- or W-configuration. (Such 
a configuration has at most 4 edges and is therefore easy to detect.) As a 
consequence, every 2-connected 3-edge-connected planar graph G and its dual 
can be drawn as geometric dual graphs such that all edges (except one) are 
straight line segments. As another easy consequence, cr(G) <1 implies cr(G) < 1. 
Bienstock and Dean (1993) proved that cr(G) <3 implies cr(G) = cr(G) and that 
there are infinitely many graphs H such that 


cr(G) >4=cr(G). 


Garey and Johnson (1982) proved that it is NP-complete to settle the following 
question: Given a graph G and a natural number &, is cr(G) =k? 

There are other problems on plane representations of graphs with certain 
crossings allowed. For example, Woodall (1971) gave a complete characterization 
of the graphs which have a normal drawing in the plane such that every two 
nonadjacent edges cross under the assumption that no such graph can have more 
edges than nodes. This assumption, which is the so-called thrackle conjecture by 
Conway, has not been verified, though. 


2.6. Graph thickness 


The thickness (G) of a graph G is the smallest number k such that G can be 
decomposed into & planar graphs. A lower bound for ¢(G) is easily obtained from 
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Euler’s formula 
n~-~et+f=2, (2.23) 


where n, e, f are the number of nodes, edges, and faces (respectively) of a 
connected plane graph G. Euler’s formula follows easily, by induction on e, from 
Proposition 2.1 (a) and (b). If G if simple, then every face has at least three edges 
on its boundary, and every edge is on the boundary on at most two faces. Hence, 
for G simple, 


ex3n-6. (2.24) 
Now, (2.24) implies 
t(G) 2 e/(3n — 6) (2.25) 
and hence 
t(K,,) = [(1/2n(n — 1))/(3n — 6)] 
= [(1/2n(n — 1) + 3n — 7)/3n — 6] = [(n + 7)/6]. (2.26) 


Beineke and Harary (1969) showed that the inequality of (2.26) is an equality for 
almost all n. They first showed that the inequality of (2.25) is an equality for the 
graph obtained from K,, by deleting a perfect matching. So, this graph has 
thickness k. Then they showed that the inequality for t(K,,) is an equality for all x, 
n#4 (mod6). Others modified the construction of Beineke and Harary (1969) 
(see White and Beineke 1978) to dispose of the cases n =4 (mod 6) except the ; 
case n = 16 which was finally settled by Mayer (1972) as shown in fig. 2.4. 

Battle et al. (1962b) showed that «(K,)=3. Combining the above results we 
get. 


Theorem 2.27. ¢(K,) =1(K,,) =3, and 
(K,,) = [(n + 7)/6] 
for all n#9, 10. 


The book-thickness of a graph G is the smallest number k such that G can be 
embedded into the k-book (defined in the introduction) such that all nodes are or 
the spine and no edge intersects more than one page (except at the spine). It is 
easy to see that a graph G has book-thickness 1 iff it is outerplanar. If G has 
book-thickness <2, then clearly G is planar. Furthermore, we can add edges to G 
(in the 2-book) so that we obtain a Hamiltonian cycle of the resulting graph G 
such that all edges of E(G*)\E(G) except one are on the spine. Conversely, : 
Hamiltonian planar graph clearly has book-thickness <2. So, a graph G ha: 
book-thickness <2 iff G is a subgraph of a Hamiltonian planar graph. Chvat2 
(private communication) has shown that it is NP-complete to decide if a maxinu 
planar graph is Hamiltonian. Hence it is NP-complete to decide if a graph hi 
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Figure 2.4. (K,,) = 3. 


book-thickness <2. Mansfield (1983) proved that it is NP-complete to decide if a 
graph has thickness <2. 

Yannakakis (1989) proved that every planar graph has book-thickness <4. 
Every triangle free planar graph is a subgraph of a 4-connected (and hence 
Hamiltonian) planar graph (see, e.g., Thomassen 1986). So, every triangle free 
planar graph has book-thickness 1 if it is outerplanar and book-thickness 2 
otherwise. Malitz (1988) proved that graphs of genus g (defined in section 3.2 
below) have book-thickness at most O(./g) which is essentially best possible. 


3. Graphs on higher surfaces 


3.1. The classification of surfaces 


As noted earlier, a graph embedded in the plane may also be regarded as a graph 
on the sphere. In this section we consider graphs on other surfaces. The 
topological prerequisites are the basic definitions such as compact and arcwise 
connected Hausdorff topological space, homeomorphism, open and closed sets 
and continuous deformation of a curve (which will not be repeated here). 
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In what follows a surface S is a compact arcwise connected Hausdorff 
topological space which is locally homeomorphic to a disc (which means that, for 
every point p on S, there is an open set in S containing p which is homeomorphic 
to an open disc in the plane). A surface can be obtained in the following way: 
Take a collection of pairwise disjoint convex polygons (and their interior) of side 
length 1 in the plane. Identify each side with precisely one other side (possibly in 
the same polygon). This results in a topological space S and a simple graph G 
(whose nodes are the corners of the polygons and whose edges are the sides). If G 
is connected, then also S is arcwise connected. If, in addition, S is locally 
homeomorphic to a disc at every node of G, then S is a surface. We say that G is 
a 2-cell embedding of S. If all the polygons are triangles (and no side of a triangle 
is identified with a side of the same triangle) then G is a triangulation of S and S is 
a triangulated surface. The simplest triangulated surface is the (boundary of the) 
tetrahedron. A face of a surface with a 2-cell embedding is the interior of one of 
the polygons. 

We shall now define the (triangulated) surfaces S, and N,. 

S, is the sphere. If we cut out two disjoint open discs in S, and identify their 
boundaries such that the clockwise orientations of these boundaries disagree, then 
we have added a handle to S,. If we add g handles we obtain S,. If instead we cut 
out an open disc in S, and identify any two opposite points on the boundary, then 
we have added a crosscap to S,. If we add A crosscaps (h = 1), then we obtain N,,. 
S,, N,, N, are the torus, the projective plane and the Klein bottle, respectively. It 
is easy to see that all surfaces S, and N, are triangulated. (They can be obtained 
from the boundary of the tetrahedron by cutting out triangles instead of discs). 
We shall use the following version of the classification theorem. 


Theorem 3.1. Every triangulated surface is homeomorphic to S, (g #0) or N, 
(k= 1). 


A short graph theoretic proof of Theorem 3.1 is given by Thomassen (1992). 
Theorem 3.1 also holds when “triangulated” is omitted, since one can prove that 
every surface can be triangulated (see, e.g., Kerékjart6 1923). Thomassen (1992) 
gave a simple graph theoretic proof of the Jordan-Schénfliess theorem and used 
that to give a very short proof of the fact that every surface can be triangulated. 

The proof of Theorem 3.1 in Thomassen (1992) also includes Euler’s formula. 


Theorem 3.2. Let G be a 2-cell embedding of a surface S with n nodes and e edges 
and f faces. If S is homeomorphic to S,, then 


n-e+f=2-2¢.° 
If S is homeomorphic to N,,, then 


n-et+f=2—-h. 
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So, for fixed S$, the number n—e +f does not depend on G. It is called the 
Euler characteristic of S. Euler’s formula implies the following. 


Theorem 3.3. if G is a simple graph with n nodes and e edges embedded in Ss, 
respectively N,, then 


e=x<3n—6+ 6g, 
respectively 


e<s3n—6+3h. : 

Proof. We consider only S, (the proof for N, is similar). Let H be a triangulation 
of S,. Now we can redraw G, if necessary, such that all edges are polygonal arcs 
(when we think of S, as a union of triangles in the plane). We may think of GUH 
as a graph M embedded in S, with say m nodes and q edges. We can assume that 
G is connected since otherwise we can add edges to G and still have an 
embedding in S,. So, M is a 2-cell embedding of S, and by Theorem 3.2 M has 
2—2g—m-+q faces. We now delete successively from M edges and nodes of 
degree 1 such that, at each stage, the current graph is connected and such that we 
end up with G. Whenever we delete an edge the number of faces is either 
unchanged or decreased by 1. Thus G has at most 2—2g—n+e faces. Since 
every edge is on the boundary of at most two faces and every face has at least 
three faces on its boundary, an easy count shows that 


e=3n—6+ 6g. O 


An easy count also shows that Theorem 3.3 is an equality for each triangulation of 
S, or N,. Thus a triangulation of S, has too many edges to be embedded in S,. for 
g’<g. Hence all of S,,5,,... are pairwise nonhomeomorphic. Also N,, N,,..- 
are pairwise nonhomeomorphic. Clearly, N, (and hence each N,, 4 2 1) contains 
a MObius strip. S, does not, see, e.g., Thomassen (1992). Hence we get the 
following. 


Theorem 3.4. All the surfaces S,,5S,,... and N,,N,,.-. are pairwise 
nonhomeomorphic. 


The surfaces S, are called orientable while the surfaces N, are called nonorient- 
able. 

Rather than starting with a surface S, or N, and drawing a graph G on it, we 
are going to start with G and then “extend” G to a surface S in which G is a 2-cell 
embedding and in which the number f of faces is easy to determine. We then 
decide which surface we get by computing the Euler characteristic and then 
decide whether or not the surface contains a Mébius strip. 
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3.2. The rotation scheme 


The basic combinatorial tool for graph embeddings in orientable surfaces is a 
simple but powerful observation attributed to Heffter (1891) and Edmonds 
(1960). Consider a graph G in S, and let p be a node. The embedding defines a 
cyclic permutation 7, of the edges incident with p where 7,(a) is the successor of 
a in the clockwise orientation around p. Let us now consider an edge a = pg and 
the following sequence: pagbuc--- where b=77,(a), u is the end of b distinct 
from q, c= 7,,(b) etc. Since G is finite we will traverse some edge twice in the 
same direction. This will happen first for the edge a. The set of edges (considered 
now as ordered pairs of vertices) in this sequence is called the orbit containing a 
(and b,c,...). Note that every edge a = pq belongs to two orbits (which may be 
identical), one for each direction of a. The orbits depend only on the permuta- 
tions 7,. Returning to the embedding it is clear that 


fr, (3.5) 
where f is the number of faces and r the number of orbits. So, by Euler’s formula 
n-etr>2—2g, (3.6) 


where n and e are the number of nodes and edges, respectively, and if equality 
holds, then the embedding is a 2-cell embedding. We now formulate the rotation 
scheme. 


Theorem 3.7. Let G be a connected graph such that, for each node p, 7, is a cyclic 
permutation of the edges incident with G. Then there exists a 2-cell embedding of G 
in some S,, g 20, such that 7, is the clockwise ordering of the edges incident with 
p. For any such embedding, n-e+r=n—e+f=2-—2g, where r is the number 
of orbits and f is the number of faces. In particular, g is unique. 


Proof. Consider any orbit pagbuc--- of G. We draw a convex polygon in the 
plane such that the sides are pq, qu,... and such that the walk pagbuc:-: is 
anticlockwise around the polygon. We do this for each orbit such that the 7 
polygons (and their interior) are disjoint. We now form a topological space as 
follows. The edge a occurs in two polygons (which may be identical) namely as pc 
and gp. We identify these sides. Doing this for each edge results clearly in < 
surface S. It only remains to be proved that S contains no Mobius strip. We shal 
only indicate the proof. First one can see that, if a surface with a 2-cell embedding 
contains a Mobius strip, then it contains a closed polygonal arc, such that left anc 
right interchange as we walk along it (see Thomassen 1992). Then we can show 
that S has no such closed polygonal arc. The last part of Theorem 3.7 follow: 
from Theorem 3.2. O * 


We now define the genus y(G) and the maximum genus 7,(G) of a graph Ga 
the minimum (respectively maximum) g such that G has an embedding (respec 
tively 2-cell embedding) in S,. By Theorem 3.7, it is a finite problem to determin 


. 


Embeddings and minors 321 


y(G) and y,(G) when G is fixed. We might also consider the minimal genus for 
2-cell embeddings but this leads to y(G) by the following observation of ‘Youngs 
(1963). 


Theorem 3.8. Every embedding of a connected graph G into Sic) i @ 2-cell 
embedding. 


Proof. If Theorem 3.8 were false, then by Theorem 3.2 there is an embedding of 


G into S,;¢) such that 


n-etf>2—2y(G). 
This embedding defines a permutation m, Of the edges incident with p, for every 
node p. If r is the number of orbits, then r =f, i.e., 


n-et+r>2-2y(G). 
By Theorem 3.7, there exists an embedding of G into S, where 
n—e+tr=2-2g'. 


This implies g’<y(G), a contradiction. O 
We also mention Duke’s (1966) interpolation theorem. 


Theorem 3.9, if y(G)<g <+,(G), then there exists a 2-cell embedding of G into 
S,. 


& 


Proof. Every cyclic permutation a, can be obtained from every other cyclic 
permutation 7, by successively shifting two consecutive edges. Such a shift affects 
at most three orbits (in the sense that three orbits may be transformed into one 
orbit and vice versa). Since the genus and the number of orbits are related as 
described in Theorem 3.7, the genus is changed by at most one by a shift of two 
consecutive edges. hence Theorem 3.9 is a consequence of Theorem 3.7. UO 

Theorem 3.7 can also be used to give a purely combinatorial proof of the 
following result of Battle et al. (1962a) (see Gross and Tucker 1987). 


Theorem 3.10. The genus of a graph is the sum of the genera of its blocks. 


An analogue of Theorem 3.10 for nonorientable surfaces was obtained by Stahl 
and Beineke [1977]. Their formula is a slight modification of Theorem 3.10. The 
different behaviour of orientable and nonorientable surfaces in connection with 
genus additivity becomes more striking when we consider more complicated graph 
amalgamations as demonstrated by Archdeacon (1986). 

We conclude this section with an application of the rotation scheme to the 
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maximum genus. By Theorem 3.7, 
Yu(G) = ((e —n + 1)/2] 


and equality holds if and only if there exist cyclic permutations z, such that G has 
at most two orbits. The graphs with this property were described elegantly by 
Xuong (1979) and, independently, by Jungerman (see Behzad et al. 1979). 


Theorem 3.11. if G is a connected graph with e edges and n nodes, then 
m(G) < le-n + 1)/2| 


with equality holding if and only if G has a spanning tree T such that at most one 
component of G— E(T) has an odd number of edges. 


Sketch of proof. Assume first that G has a spanning tree 7 as described in 
Theorem 3.11. We prove, by induction on e, that G has a cyclic permutation 7, 
(for each node p) such that G has at most two orbits (and such that at least one of 
these meets all nodes). This is obvious if e=n—1 or n (because in that case G 
can be drawn in the plane with at most two faces). So assume that e=n+ 1. Let 
H be a connected component of G — E(T) with at least two edges. Since H has a 
node v such that H—v is connected, it is easy to find two edges a,,a, of H 
incident with v (if v has degree =2) or the neighbour of v (if v has degree 1) such 
that at most one component of H — {a,,a,} has an odd number of edges. Hence 
at most one component of G — {a,,a,} — E(T) has an odd number of edges. By 
induction, G — {@,,a@,} has cyclic permutations a, such that G — {a,,a,} has at 
most two orbits. Let a; = qv, for i= 1,2. We now modify 7,, 7, , 7, into cyclic 
permutations in G as indicated in fig. 3.1 where we have also indicated (in dotted 
lines) the orbit of G — {a,,a,} that meets all nodes. 

Suppose conversely that G has cyclic permutations 7, such that G has at most 
two orbits. We shall find a spanning tree as described in Theorem 3.11. If e<n, 
there is nothing to prove, so assume that e =n + 1. If G has two orbits, then some 
edge a=uv is contained in both orbits. (Since G is connected, some vertex v 


Figure 3.1. Adding a path of length 2 to a graph with at most two orbits. 
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must be in distinct orbits, and then some edge incident with v must be in distinct 
orbits.) If we delete edge a and modify a, and 7, just by ignoring a (i.e., the 
successor Of @ around u in G is the successor of the predecessor of a in the new 
cyclic permutation), then G — a has only one orbit and the proof is completed by 
induction. On the other hand, if G has only one orbit, then we select an edge a 
such that the walk from a to a in the orbit is smallest possible. If a is incident with 
a vertex u of degree 1, then we complete the proof by applying the induction 
hypothesis to G — u. If a is succeeded by 6 in the above walk, where b ¥a, then 
the minimality property of a implies that G — {a, b} is connected and it is easy to 
see that it has only one orbit (when we ignore a and b as above). The proof is now 
completed by induction. O 
Xuong (1979) extended the ideas of Theorem 3.11 to show that 
a(G) = max|(e-—n + 1-—¢,(G — T))/2), (3.12) 


where ¢,(H) is the number of components of H with an odd number of edges and 
the maximum is taken over all spanning trees T of G. 

By a result of Tutte (1961), every 2k-edge-connected graph has k edge-disjoint 
spanning trees. Combining this with Theorem 3.11 we obtain the following. 


Corollary 3.13. If G is 4-edge-connected, and has n nodes and e edges, then 
Ww(G) = l(e-n+1)/2). 


Furst et al. (1988) showed that the problem of determining the maximum in 
(3.12) can be reduced (in polynomial time) to the following problem which was 
solved by Giles (1982): Given a graph whose edge set is partitioned into pairs, 
find a largest forest whose edge set is a union of some of the above pairs. 


3.3. The Heawood conjecture 
Theorem 3.14. For each n= 13, y(K,) = [(n —3)(2 - 4)/12]. 


Theorem 3.14 was announced in 1890 by Heawood (see White 1978b). The 
lower bound »(K,) = (1 —3)(n — 4)/12 follows immediately from Theorem 3.3. 
But the upper bound is far from trivial. 

In a series of papers, particularly by G. Ringel, special cases of Theorem 3.14, 
which became known as the Heawaod conjecture, were settled, and in 1968 Ringel 
and Youngs (1968) announced the final solution. 

In this section we describe a proof of Theorem 3.14 in the case n =7 (mod 12). 
In other words, K,,,,> can be embedded into S,,,2,5,4,- Since Theorem 3.3 is an 
equality for such an embedding every face is bounded by a K,. In other words, 
5124247441 18 triangulated by the embedding of K,,,,,. The number of faces is 
48k° + 52k + 14 by Theorem 3.2. So, by Theorem 3.7, this problem is equivalent 
with that of describing a cyclic permutation zm, for each node of K,2,,7 such that 
the number of orbits is 48k”? +52k + 14, or “equivalently, each orbit has three 
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3hkod 3kel 3kee aked ahe2 


Figure 3.2. Currents from Z,,,,7. 


edges. Before proving this, we stress the nontrivial character of the problem by 
pointing out that in general, it is not known when a graph G with n nodes and e 
edges can be embedded into the surface S, such that every face is bounded by a 
triangle. A necessary condition is that Theorem 3.3 is an equality which implies 
that g is uniquely determined by G. Another necessary condition is that G is 
locally a wheel in the sense that the subgraph of G induced by the neighbours of 
each node has a Hamiltonian cycle. These two conditions are sufficient when 
g=0 as shown by Skupien (1966). But, they are not sufficient in general as 
demonstrated by Ringel (1978). 

Consider now the cubic graph G, of fig. 3.2. G, has 4k +2 nodes which are 
partitioned into black and white vertices. Now we define a cyclic permutation of 
the edges incident with a fixed node by the clockwise (or anti-clockwise) ordering 
of its incident edges when the vertex is black (respectively white). It is easy to see 
that the graph gets exactly one orbit in this way. (In particular, this determines a 
2-cell embedding of G, into an orientable surface of maximum possible genus, by 
Theorem 3.7. But this will not be used.) This orbit is a cyclic sequence of edges 
such that every edge appears twice. Replacing each edge by the element of 2,5, ,5 
as indicated in fig. 3.2 gives a cyclic permutation of Z,,, ,,;\{0}. Here, Z,,,,7 are 
the integers reduced modulo 12k + 7 and, when an edge labelled j is traversed in 
the negative direction, the corresponding element in the sequence is —j (modulo 
12k +7). If we start with the edge labelled 3k + 4 we get the cyclic permutation of 
© y2447\{0}: 

m1): 3k + 4, 3k +1,3k +3, 3k +2, 5k +3, 12k + 6,9k +4, 12k +5, 
5k+2,.... 
Now we regard Z,,,,, as the node set of K,.,,7. For each node we consider the 


cyclic orientation 7, of the edges incident with j (or, equivalently, of the 
neighbouring nodes) which is obtained from 7, by adding j to each term. That is, 


m2 3k+ 4+), 3K +14t),.... 


The proof is now completed when we show that each orbit in K,,,,7 consists of 
three edges. But this is straightforward to verify using the following property of 
G,: For each node v the sum of labels on arcs entering v equals the sum of labels 
on arcs leaving v. 
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The graph G, of fig. 3.2 is called a current graph. Instead of using Z,5,,,5 as 
labels one may use any abelian group. Current graphs, which were introduced by 
Gustin (1963), played an important role for the solution of the Heawood 
conjecture. For more information on current graphs and the dual concept of 
voltage graphs, and their use in embedding theory (in particular the Heawood 
conjecture), the reader is referred to White (1973, 1978b). Voltage graphs were 
introduced by Gross and are discussed in detail by Gross and Tucker (1987). 

Ringel (1965a) also used the rotation technique to establish the following. 


Theorem 3.15. For all natural numbers k,m =2, 
WKe.m) = [(& — 2)(m — 2)/4]. ; 


The genus has been determined or estimated for various other classes of 
graphs. For example, White (1970) and Pisanski (1980) have calculated the genus 
of several cartesian products of graphs. For details we refer again to White (1973, 
1978b) and Gross and Tucker (1987). 


3.4. Graphs on nonorientable surfaces 


We define the nonorientable genus y(G) of G as the smallest hk such that G can be 
embedded into N,. Clearly, 


y(G) $2y(G) +1 


since N,,,, can be obtained from S, by adding a crosscap. As observed by 
Auslander et al. (1963) there is no upper bound for y(G) in terms of 7(G). This 
is illustrated in fig. 3.3. 

The graph H,, in fig. 3.3 has n = m?’ vertices e = 2m? — 2 edges and f= m—1 


Wi nas 
| UL, VU 
LY L0G 
7 aes) outs 


Figure 3.3. Graphs of large genus in the projective plane. 


Bas 


326 C. Thomassen 


faces each of which is bounded by a 4-cycle. Suppose an embedding of H_, into 
S,¢u,,) has f’ faces. The shaded regions in fig. 3.3 form a Mobius strip which is not 
contained in S,,,,_). Hence one of the 4-cycles in the Mobius strip is not a facial 
cycle in the embedding of H,, in S\., ). So, there are many edges in the 
Syc1,,-embedding which are on faces whose boundaries have more than four 
edges. This implies that f—f’ is large (if m is large). Since 


2y(H,,)=2-nt+e-f'=1+f-f', 


it follows that y(H,,) is large when m is large. Hence, it seems, in some sense, 
easier to embed graphs in a surface N, than in S,. 
Ringel (1954, 1965b) obtained the following results. 


Theorem 3.16. For all natural numbers n, s, t where n>3,s2t22, 
WK,) = [( — 3)(n — 4)/6] and WK, ,) = [(s — 2)(¢- 2)/2]. 


Bouchet (1978) has obtained general results on regular complete k-partite 
graphs. Ringel (1977) and Stahl (1978) showed that the maximum / for which a 
connected graph with nodes and e edges has a 2-cell embedding into N, is 
simply e —n +1 which is simpler than (3.12). Stahl (1975) also established the 
nonorientable version of Duke’s interpolation theorem 3.9. As pointed out by 
White and Beineke (1978) the analogue of Theorem 3.8 does not hold for 
nonorientable surfaces. Also, it is not known if every 2-connected graph G can be 
embedded into S, or N, in such a way that every face is bounded by a cycle of G. 
This would imply the double cover conjecture formulated independently by W.T. 
Tutte and P.D. Seymour. 


Conjecture 3.17. Every 2-connected graph has a collection of cycles such that 
every edge is in precisely two of these cycles. 


One can develop a rotation scheme for nonorientable surfaces. As we are not 
going to use it here we refer the reader to Ringel (1974). 


3.5. Algorithms for embeddings in orientable surfaces 


We have already mentioned that many embedding problems are difficult: It is 
NP-hard to find the thickness, the book-thickness, and the crossing number of a 
graph, respectively, and it is NP-hard to recognize string graphs. The following 
unsolved problem in Garey and Johnson (1979) was solved by Thomassen 
(1989a). 

Theorem 3.18. The following problem is NP-complete: Given a graph G and a 
natural number k, is y(G) <k? 


The proof of Theorem 3.18 can be extended to the nonorientable genus as well. 
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It also shows that it is NP-complete to decide if a given embedding is of minimum 
genus. For fixed genus g, however, the situation is different. The fastest algorithm 
for g=0 is due to Hopcroft and Tarjan (1974). 


Theorem 3.19. There exists a linear time algorithm for deciding if a graph is 
planar. 


The first polynomially bounded algorithm for fixed genus was found by Filotti 
et al. (1979). 


Theorem 3.20. The genus of a graph with n nodes can be determined ip O(n®®) 
steps. 


The genus has been determined for some graphs of a very special type including 
the complete graphs, as we have seen. Also, Theorem 3.20 shows that the genus 
can be found for graphs of bounded genus. We shall describe a method for finding 
the genus for a large general class of graph with unbounded genus. Consider an 
embedding (which we think of as a rotation scheme) of a connected graph G. Let 
C be a cycle in G. If we choose a positive orientation of G we can speak of edges 
incident with C which go to the left (respectively right) side of C. We define the 
right side R (respectively left side L) of C as C together with all paths that start 
with an edge on the right (respectively left) side and which has no intermediate 
vertex in common with C. Clearly R and L can be found in polynomial time. Now 
C is contractible if RON L=C and one of R or L is embedded in the sphere (by 
the induced embedding). To verify this is just a matter of using Euler’s formula. 
If we think of G as drawn on a surface, then C is contractible iff C can be 
continuously deformed into a point. We say that an embedding is an LEW- 
embedding (large-edge-width-embedding) if all noncontractible cycles have length 
greater than all orbits (that is, face boundaries). Thomassen (1990) proved the 
following. 


Theorem 3.21. There exists a polynomially bounded algorithm for finding a 
shortest noncontractible cycle in an embedded graph. In particular, there exists a 
polynomially bounded algorithm for deciding if an embedding is an LEW-embed- 
ding. Moreover, there exists a polynomially bounded algorithm that finds an 
LEW-embedding for any 2-connected graph G that has such an embedding. If G is 
3-connected and has an LEW-embedding, then that is the unique minimum genus 
embedding of G. 


The genus of another large class of graphs can be computed fast by the 
following result of Fiedler et al. (1994). 


Theorem 3.22. There exists a polynomially bounded algorithm that computes the 
genus y(G) of every graph G satisfying y(G) <1. 
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Once a graph has been embedded in S,, where g is fixed, many problems 
concerning G can be solved fast (or nearly solved) simply by using the fact that G 
is either small or has a vertex of small degree. We shall illustrate this by 
considering the chromatic number x(G). Although it is difficult (in fact NP- 
complete) to find y(G) even if g =0 (see Toft’s chapter) we have the following: 
For fixed g, there exists a polynomially bounded algorithm for colouring the 
nodes of a graph G of genus <g in max{7, x(G)} colours such that no two 
adjacent nodes receive the same colour. The algorithm is easy: If G has a node v, 
of degree <6, delete it. If G — v, has a node of degree <6, delete it. This results 
in a sequence v,,U,,--.,U,, Of nodes such that H=G-— {v,,v,,...,0,,} is 
either empty or has minimum degree =7. By Theorem 3.3, H has at most 
12(g —1) nodes. In constant time we can colour the nodes of H with colours 
1,2,...,x(H) such that no two neighbours receive the same colour. Now we put 
back v,,,U,,-;,--->U; giving each of them one of the colours 1,2,...,7. 

Other embedding algorithms following from the Robertson—Seymour theory 
are discussed in sections 5.3 and 5.4. 


4. Graph minors 


In the previous sections we concentrated on the rotation scheme as a tool for 
embedding a given graph into a given orientable surface. It is equally natural to 
seek an explanation of why a given graph cannot be embedded into a given 
surface. Euler’s formula 3.2 can sometimes be used for that. But, as the graphs 
H,, Of fig. 3.3 show, it is possible that a graph satisfies Euler’s formula for S, 
without being embeddable into S,. 

Kuratowski’s classical Theorem 1.4 shows that nonplanarity can be explained in 
terms of excluded minors and, the work of Robertson and Seymour (which we 
review in section 5) shows that the same holds for embedding (and other) 
problems in general. In this section we describe some basic results on graph 
minors. 


4.1. Simplicial decompositions and tree width 


Suppose G is a connected graph and A a minimal separating node set of G. Then 
we can write G = G; UG, where G; and G, are connected and G; NG, = G(A). 
Now suppose further, that G(A) is a complete graph. If G; or G, (say G,) has a 
separating node set which induces a complete graph, then we can write G) = 
GU Gj such that G, and G, are connected and G,M G,, is a complete subgraph 
of G. We proceed like this until none of the resulting subgraphs G,, G,,..., G, 
has a complete separating subgraph. With this notation we have the following. 
Proposition 4.1. The subgraphs G,,G,,...,G, are independent of the order in 
which the decomposition is carried out. 


Proof (by induction on the order of G). If the above graphs G,, G1 are chosen 
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such that G, has a few vertices as possible, then any separating complete 
subgraph of G is contained in G;. Hence G; is in the sequence G,,G,,..., G, 
regardless of the order in which the decomposition is performed. The proof is 
now completed by applying the induction hypothesis to G;. OU 


The graphs G,,G,,...,G, are called the simplicial summands of G. Tarjan 
(1985) has described a polynomially bounded algorithm for finding a separating 
complete graph, and hence the simplicial summands can be found in polynomial 
time. If all the separating node sets in the above decomposition have only one 
vertex, then the simplicial summands are simply the blocks of the graph. The 
blocks of a graph form a tree-like structure. More generally, any graph can be 
built up from its simplicial summands in a tree-like fashion by successively 
identifying complete subgraphs. 

One of the first results on simplicial decompositions is the following of Dirac 
(1961). 


Theorem 4.2. A graph G has the property that every cycle of length at least 4 has a 
chord if and only if every simplicial summand of G is a complete graph. 


The graphs in Theorem 4.2 are called chordal or rigid circuit graphs. A subclass 
of these are the K,-cockade, i.e., the connected graphs in which every simplicial 
summand is a complete graph K,,. If every minimal separating complete graph in 
a K,,-cockade has m nodes, then we say that the K,-cockade has strength m. 

We can also now introduce the tree width of a graph. We say that G has tree 
width <w, and write tw(G) < w, if G is a subgraph of a K,,,, ,-cockade. With this 
notation we have the following. 


Proposition 4.3. tw(G) <1 iff G does not contain K, as a minor (i.e., Gis a 
forest). tw(G) <2 iff G does not contain K, as a minor. 


The first of these statements is trivial. The second is equivalent to Dirac’s 
(1960) characterization of the graphs which contain no subdivision of K, 
combined with the following (which is an easy exercise). 


Figure 4.1. A K,-cockade of strength 2. 
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Proposition 4.4. Let G) be a graph of maximum degree at most 3. Then G 
contains a subdivision of Go iff G has Gy as a minor. 


The graphs of tree width <2 are known as series—parallel graphs. Proposition 
4.3 leads to the problems of characterizing the graphs of tree width <k and the 
graphs containing no K, (or, more generally, an arbitrary graph G,) as a minor. 
For k fixed, polynomial algorithms for these problems follow from the Robertson- 
Seymour theory as we shall see. But, there are still important unsolved problems 
in this area, the most important being Hadwiger’s conjecture (see section 4.3). 


4.2. Wagner's characterization of graphs with no K,-minor 


The following observation shows how simplicial decompositions are useful for 
characterizing the graphs which contain no (fixed) graph Gy as a minor. 


Lemma 4.5. Let G, be a graph which is either (k + 1)-connected or a K,,,,. Let G 
be a graph such that G = G, UG; where G and G, are connected, G, 1G, = K,, 
and V(G{)NV(G;) is a minimal separating set of nodes in G. Then G has a minor 
isomorphic to G, if and only if one of G,, G, has a minor isomorphic to G,. 
Suppose, furthermore, that G,; 1G) is not contained in a K,,, in G. Then G is 
edge-maximal with respect to not having G, as a minor if and only if both of G,, 
G, are edge-maximal with respect to not having G, as a minor. 


Proof. Suppose G has G, as a minor. That is, it is possible to contract edges in G 
such that we get a graph containing G, as a subgraph. The connectivity condition 
on G, ensures that this subgraph cannot have vertices separated by G, NG). 
Hence this subgraph is in (a minor of) G; or G,. The same argument shows that, 
if G is edge-maximal with respect to not having G, as a minor then so are G, and 
G,. Finally, if both G; and G, are edge-maximal with respect to not having Gy as 
a minor, then we show that G U {uv} has G, as a minor for any two nonadjacent 
nodes u and v. If u,v are in the same G,, this is obvious. If u € V(G,)\V(G,) and 
v EV(G)\V(G;), then there is a u' in V(G,) NV(G;) which is not adjacent to u. 
Now G U {uv} has G| U {uv'} and hence G, as a minor. 0 


Figure 4.2. The graph L. 
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Lemma 4.5 shows that in order to characterize the edge-maximal graphs not 
having G, as a minor it is sufficient to characterize their simplicial summands 
(perhaps with some additional information of how they are “pasted together” 
along complete subgraphs). Thus Wagner’s (1937) fundamental characterization 
of the graphs having no K, as a minor can be formulated as follows. 


Theorem 4.6. Let G be an edge-maximal graph having no K, as a minor and 
having only one simplicial summand (i.e., G has no separating complete graph). 
Then G is either a maximal planar graph or isomorphic to the graph L of fig. 4.2. 


Proof. Since G has no separating complete graph, G is 2-connected. We claim 
that G is 3-connected. For suppose that G — {u,v} is disconnected, i.e., G = 
G; UG, where G;, G are connected and G, NG; = {u, v}. Since GU {uv} has 
K, as a minor, Lemma 4.5 implies that G; U {uv}, say, has K, as a minor. But G 
has G; U {uv} as a minor. This contradiction shows that G is 3-connected. 

G need not be 4-connected (because L is not 4-connected) but G is “almost 
4-connected”’ in the following sense: If G — {x, y, z} is disconnected, then it has 
precisely two components one of which has only one node. To prove this we 
assume that G=G,UG, where V(G,NV(G,) = {x, y,z} and |V(G,)|25 for 
i=1,2. For any node u € V(G,)\{x, y, z}, G has three paths P,, P,, P, from u to 
{x, y,z} such that V(P,) VV(P,) = {u} whenever i #j. Since |V(G,)| > 5 and G is 
3-connected, it is easy to see that G, — u has a path P, connecting two of P, — u, 
P,—u, P;—u. Using P,, P,, P;, P, we see that G has a minor G, which is 
obtained from G, by adding all edges between two of {x, y,z}. Since G has no 
K,-minor, G, has no K,-minor. Similarly, G; = G, U {xy, xz, yz} has no K.- 
minor. By Lemma 4.5, GU {xy, xz, yz} has no K,-minor. By the maximality of 
G, G=GU {xy, xz, yz}. But, this contradicts the assumption that G has no 
separating complete graph. Hence G is “almost 4-connected”’. 

If G is planar, then the maximality of G implies that G is a maximal planar 
graph. So assume that G is nonplanar. By Kuratowski’s theorem 1.4, G contains 
a subgraph H which is a subdivision of K,,. Let A, B be the two node sets of H 
corresponding to the partite sets of K, ,. Since G is almost 4-connected, the three 
components of H — A are not in distinct components of G. Hence G has a path P, 
connecting two of the distinct components of H — A. If the nodes of A belong to 
three distinct components of (H U P,) — B, then G has a path P, connecting two 
of these components. 

Now it is easy to see that H U(P, U P,) contains either a path P; connecting 
two disjoint A — B paths of H or else H U(P, UP,) has K, as a minor. Since G 
has no Ks as a minor we can assume the former. Then H'=HUP,; is a 
subdivision of L. 

We claim that G = H’ = L. Note that L has no K, and is maximal with respect 
to not having K, as a minor. Now if G#H'=L, then we pick a vertex 
u€V(G)\V(A') and three paths from u to H’ which are disjoint (except at u). 
This gives a subgraph having K, as a minor (because LZ is K,-free). On the other 
hand, if H’ # L, then H’ has a node of v of degree 3 such that the three nodes vu,, 
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vy, v, which correspond to the neighbours of v in L are not all neighbours of v in 
H'. Hence G has a path connecting the distinct components of H’ — {v,, v2, U3} 
and this gives a subgraph of G having K, as a minor. This contradiction proves 
thatG=L. OU 


Wagner’s theorem 4.6 combined with Proposition 4.1 and the discussion 
preceding Proposition 4.1 shows that the edge-maximal graphs having no K,- 
minor are obtained by successively pasting maximal planar graphs and copies of L 
together along complete subgraphs. Conversely, if we form a graph of this type by 
always choosing the complete subgraphs such that they are maximal in one of the 
graphs we paste together, then the resulting graph can easily be shown to be 
edge-maximal with respect to not having K, as a minor. 

Theorem 4.6 is sometimes called Wagner's equivalence theorem because it 
shows that the 4-colour-theorem is equivalent to Hadwiger’s conjectures for 
k =5. This follows because the chromatic number of a graph is the maximum 
chromatic number of its simplicial summands. 

It is easy to see that if a graph has K, as a minor then either it contains a 
K,-subdivision or else it has K,, as a minor. Moreover, every 3-connected graph 
with at least six nodes which contains a K,-subdivision has K,, as a minor. Hence 
Theorem 4.6 together with its method of proof implies the following. 


Corollary 4.7. if G is an edge-maximal graph having no K, as a@ minor, then the 
simplicial summands of G are either maximal planar graphs or copies of Ks. 


Note that the proof method of Theorem 4.6 also quickly gives the part of 
Proposition 4.3 concerning K,. 

Theorem 4.6 and Corollary 4.7 describe the “building stones’’ for the graphs 
having no minor isomorphic to K,, respectively K,,. These building stones have 
also been described for other small graphs (see Bollobas 1978 and Wagner 1970). 
Recently, Robertson (private communication) has solved the problem for the 
graph L of fig. 4.2. 


4.3. Hadwiger’s conjecture 


Hadwiger’s conjecture (which is also discussed in Toft’s chapter) states the 
following. 


Conjecture 4.8. If the graph G has chromatic number at least &, then it has K, as 
a minor. 


& 


Wagner (1964) proved the following weakening of Hadwiger’s conjecture. 


Theorem 4.9. There exists a function f(k) such that every f(k)-chromatic graph has 
K, as a@ minor. 
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Since every k-chromatic graph contains a subgraph of minimum degree at lest 
k — 1 (see Toft’s chapter), Theorem 4.9 also follows from the following result of 
Mader (1967). 


Theorem 4,10. For each k there exists a smallest natural number g(k) such that 
every graph of minimum degree at least g(k) contains K, as a minor. 


Proof. We show by induction on k, that g(k) <2". For k = 1 this is trivial. So 
assume that G is a graph of minimum degree >2*. Then G has at least 2*-1v(G)| 
edges. Let H be a maximal connected subgraph of G such that contracting H to a 
single vertex results in a graph H’ with at least 2“~'|V(H ’)| edges. Let H” denote 
the subgraph of G ~ V(H) induced by the nodes joined to H. Let u be any vertex 
of H” and let m be the degree of u in H". Let H” be the graph obtained from G by 
contracting V(H) U {u} into a single vertex. The maximality of H implies that 


|E(H")| <2*""|V(H")| = 2°""(\V(H")| - 1). 
Clearly, 
|E(H”")| = |E(H’) 


—(m+1)22*"'|V(H')|-(m4+1). 


Hence m >= 2*~'. By the induction hypothesis, H” has K,_, as a minor and, since 
each node of H” is joined to H which is connected, G has K, as a minor. O 


The function g(k) was determined for small complete (or nearly complete) 
graphs up to eight vertices by Dirac (1964) and Jakobsen (1972). These 
investigations indicated that g(k) is linear (see Wagner 1970, p. 156 and Bollobas 
1978, p. 378). But the following holds. 


Theorem 4.11. With the terminology of Theorem 4.10, there exist positive con- 
stants C,, C such that 


c,kVlog k < g(k)<c,kVlogk. 


The upper bound was found by Mader (1967) and the lower bound was derived 
from random graphs by Kostochka (1984) and De la Vega (1983). 
One of Dirac’s (1964) results states the following. 


Theorem 4.12. If G is a graph with n nodes and at least (7n — 15)/2 edges, then G 
has K, (less an edge) as a minor unless G is a K,-cockade of strength 3. 


Using Theorem 4.12 Dirac then proved the following generalization of the 
5-colour-theorem. 


Theorem 4.13. If G has chromatic number at least 6, then G has K, (less an edge) 
as a minor. 
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Recently, Hadwiger’s conjecture for k =6 was proved by Robertson et al. 
(1993a). 


5. Embeddings and well-quasi-orderings of graphs 


Konig (1936) asked if there exists a Kuratowski type characterization of the 
graphs not embeddable in a given surface S. Glover et al. (1979) and Archdeacon 
(1980) obtained such a characterization for the projective plane. For no surface 
other than the sphere and the projective plane a complete characterization is 
known. A penetrating result of Archdeacon and Huncke (1989) asserts that, for 
any nonorientable surface N,, there exists a finite family of graphs %, such that a 
graph G is embeddable in N, if and only if G has no minor (isomorphic to a 
graph) in ¥,. This general result is a special case of the following powerful result 
of N. Robertson and P. Seymour which is central in the theory of minors and 
embeddings. It provides in particular an affirmative answer to K6nig’s question. 


Theorem 5.1. Let p be any graph property such that, for any graph G with 
property p, every minor of G has property p. Then there exists a finite family ¥, of 
graphs such that a graph G has property p if and only if G has no minor in F,. 


This theorem is the culmination of the series of graph minor papers of 
Robertson and Seymour. The rest of this chapter is devoted to a discussion of 
graph structure results and algorithms related to Theorem 5.1. 


5.1. Well-quasi-orderings 


A partially ordered set (M, <) is well-quasi-ordered (abbreviated wqo) if for each 
infinite sequence a), a@,,a,,... of elements from M, there are natural numbers i, 
j such that i<j and a,;<a;. Clearly, M has no infinite strictly decreasing 
sequence. Ramsey’s theorem for infinite graphs can be formulated as follows: 
Every infinite oriented graph contains an infinite set of nodes which induce a 
subgraph with no edges or a subgraph isomorphic to the positive (respectively 
negative) numbers where there is a directed edge from m to p when m <p. So, if 


(M, <) is wqo, and a,,a,,... is any infinite sequence, then there exists an infinite 
subsequence G,,4),-.. such that a, <a,<---. 
If a@=a,,a,,...,a, and b=b,,b,,...,b,, are finite sequences from M we 


write a <b if there exists a subsequence b, ,b, ,...,0, (1Si,<i,< +--+ <i,< 
m) such that a,<b, for j=1,2,...,n. With this notation Higman’s (1952) 
theorem states the following. 


© 


Theorem 5.2. If M is wqo, then the set of finite sequences of M is w@qo. 


Proof (by contradiction). Assume @,,@,,...is an infinite sequence of finite 
sequences such that a, a, whenever i<j. Assume that the sequence has been 
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chosen such that a, has minimum length, subject to this condition a, has 
minimum length; subject to these conditions a, has minimum length, etc. Let a, 
be the first element of a, and let a; be obtained from a; by deleting a,. Now, there 


is an infinite sequence a, ,a,,,...such that i,<i,<--- and a, Sa; for 
p=i,2,.... The minimality ‘of a,,a,,...implies_ that the sequence 
@,,@),... 44, ri q; a;,. .Is not a counterexample to Theorem 5.2. So either 


a, <a, for some i, j where aie aa ee or else a; <a; for some i, j where 
1<i<i, —1 and j=1, or else a; <a) for some p, q where 1<p <q. In each 
case we obtain a contradiction to the ssumption that a,,a,,...is a counterex- 
ample to Theorem 5.2. O 


Now consider two finite trees T,, T, such that each node in 7, and T, is 
labelled by an element of M and such that a node vu, in T, is distinguished as a 
root, We write T, = T, if T, contains a tree T; such that T} is a subdivision of T,, 
the vertex in J; corresponding to v, is v,, and if a is the label of a vertex v in T, 
and 5b is the label of the corresponding vertex in 7,, then a<b. With this 


terminology we have the following general version of Kruskal’s (1960) theorem. 


Theorem 5.3. The collection of finite rooted trees labelled by elements of a wqo set 
M is w@qo. 


If T is a tree with root v and 7,, T,,..., T,, are the trees of T — v each rooted 
at the node adjacent to v, then we may think of T as the sequence T7,, 
T,,...,T,. This observation enables us to prove Theorem 5.3 in almost the 
same way as Theorem 5.2. 

Kruskal’s theorem (without reference to M) is sometimes formulated as 
follows: The collection of finite trees is wqo under topological containment. 
Wagner (1970) observed that this does not hold for graphs in general: just 
consider the sequence of double cycles, i.e., the graphs obtained from a cycle of 
length m by replacing each edge by a double edge. Also, the collection of trees is 
not wqo under inclusion. However, Theorems 5.2 and 5.3 imply the following 
results observed by Robertson and Seymour, and Mader (1972), respectively. 


Theorem 5.4, Let k be a natural number. The collection of graphs with no path of 
length k is wqo under inclusion. 


Proof. We prove by induction on k the following stronger statement: If 
H,,H,,...is an infinite sequence of graphs with no path of length k and each 
node of each H, is labelled by an element of a prescribed finite set M, then there 
exist i, j (i< i) such that H, contains an induced subgraph which is isomorphic 
(considered as a labelled graph) to H,. For k =1 the statement is an easy exercise, 
so assume that k =2. By Theorem 5.2 it is sufficient to prove rae statement for 
connected graphs. We can assume that each H, has a path x\x,---x, of length 
k — 1 since otherwise we can consider an appropriate sibeesice nes: and apply the 
induction hypothesis. Without loss of generality we can assume that the subgraphs 
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H,({x\,*5,--->%,}) are label-isomorphic for i=1,2,.... Now H; =H,- 
{x\,%5,...,,} has no path of length k — 1 (because no connected graph has two 
disjoint longest paths). Every node v of H; has a label from M. We now give it a 
label from the cartesian product of M and the set of subsets of {1,2,...,k}; the 
second coordinate is the set of those j such that v is adjacent to x}. By the 
induction hypothesis, some H; has an induced subgraph which is label-isomorphic 
to some H; such that i<j. Then H, contains an induced subgraph which is 
label-isomorphic to H;. O 


For the next result we need the following result of Erdés and Pdsa (1965). 


Theorem 5.5. There exists a function f(k) such that every graph G with no k 
disjoint cycles contains a set S of at most f(k) nodes such that G — S is a forest. 


Theorem 5.6. Let k be a natural number. The collection of graphs having no k 
disjoint cycles are wqo under topological containment. 


Proof. Let H,,H,,... be an infinite sequence of graph with no k disjoint cycles. 
By Theorem 5.5, let S,;CV(H,) such that |S,|<f(k) and H,—S, is a forest for 
i=1,2,.... Without loss of generality we can assume that all the graphs H,(S,) 
are isomorphic. Let us assume that S$, = S,= --- and let us label each node of 
H, — S; by the subset of S; = S$, to which it is joined. By Theorem 5.3, there exist 
i, j such that i<j and H,— S, contains a forest which is a subdivision of H, — S; 
with ‘greater labels”. This completes the proof. O 


5.2. Tree width and minors 


Recall that tree width is defined in section 4.1. The graphs of tree width <w, 
where w is a fixed natural number, behave in many respects like trees although 
they are not wqo under topological containment as shown by (subdivisions of) the 
double cycles. By Theorem 5.1 they are wqo by minors as shown first by 
Robertson and Seymour (1990a). This is not surprising in view of Theorem 5.3, 
but the proof involves many technical details. 

A further partial result towards Theorem 5.1 is that, if an infinite sequence of 
graphs contains just one planar graph, then one of the graphs in the sequence has 
another as a minor. This follows from the above-mentioned result of wqo of 
graphs of bounded tree width combined with the fundamental tree width theorem 
of Robertson and Seymour which we shall now describe. 

Consider the grid G, consisting of k disjoint paths P;: x, ,x,,°+>x,,, and the 
path Pi: x, x, 2°--x,, and the path Pj: x, )X,2°''X,4 and all edges x, x;., ; 
where i +j is odd. (See fig. 5.i*) 

If a graph G contains a minor of G, (and hence a subdivision of G, by 
Proposition 4.4) for & large, then G has large tree width. This follows because any 
subgraph of a graph of tree width <w has tree width <w. And, G, does not have 
small tree width when k is large. The easiest way of seeing this is probably to 
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Figure 5.1. The grid G,. 


verify that G, does not have the separation property of Proposition 5.11 when k is 
large and w is small. The tree width theorem 5.7 shows therefore that a necessary 
and sufficient condition for a graph to have large tree width is that it contains a 
G,-subdivision for some large k. 


Theorem 5.7. There exists a function f(k) such that every graph of tree width >f(k) 
contains a subdivision of G,. 


For any fixed planar graph H, there exists a natural number & such that G, has 
H, as a minor. To see this we first observe that there exists a cubic planar graph 
H, having H, as a minor. Now H, can be drawn in the plane such that all edges 
are straight line segments and every such edge can be modified such that it 
becomes a polygonal arc whose segments are all vertical or horizontal. Then it is 
easy to see that, for some large k, G, contains a subdivision of H,. So, Theorem 
5.7 implies that every graph with sufficiently large tree width has H, as a minor. 
This statement is best possible in the sense that it becomes false if Hy is 
nonplanar. This follows from the fact that the grids G, are planar and of large 
tree width when <x is large. 

Besides its afore-mentioned applications to wqo by minors, Theorem 5.7 has 
other applications. Let us define a K,,-path-cockade as a graph which is the union 
of graphs H,, H,,... Hi, each of which is a K,, such that, for each i= 
2,3,...,4, (H, UH,U --- UH,_,)H, is a (complete) subgraph of H,_,. We 
say that a graph G has path width <w if G is contained in a K,,, ,-path-cockade. 
Also this concept was introduced by Robertson and Seymour (1983) who proved, 
roughly speaking, that a graph has large path width if and only if it has any large 
tree as a minor. More precisely, let 7, be the tree containing a vertex x, such that 
X, has degree 2, all other vertices have degree 1 or 3, and all maximal paths 
starting at x, have length &. It is not difficult to show that graphs of bounded path 
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width cannot contain subdivisions of 7, for arbitrarily large k. This shows that, in 
a sense, the following analogue of Theorem 5.7 is best possible. 


Theorem 5.8. There exists a function f'(k) such that every graph of path width 
>f'(k) contains a subdivision of T,. 


Theorem 5.8 was proved before Theorem 5.7. But Theorem 5.8 is not difficult 
to derive from Theorem 5.7 because a G, with & large contains a subdivision of 
T,. for k' large. Hence it is sufficient to prove Theorem 5.8 for graphs of bounded 
treewidth which is not difficult (although some technical considerations are 
needed). 

Theorem 5.7 can also be applied to prove a considerable extension of Theorem 
5.5 of Erdés and Pésa. Motivated by Theorem 5.5 let us say that a family ¥ of 
graphs has the E—P-property if there exists, for every natural number k, a natural 
number f(k, #) such that the following holds: [If G is any graph, then either G 
contains k disjoint subgraphs each isomorphic to a member of ¥ or else G 
contains a set S of at most f(k,#) nodes such that G—-S has no subgraph 
isomorphic to a member of ¥. Theorem 5.5 says that the collection of cycles has 
the E—P-property. Graphs in the projective plane (of the type indicated in fig. 
3.3) show that the collection of odd cycles does not have the E—P-property. 
Thomassen (1988b) showed that every family ¥ of connected graphs has the 
E-P-property if we only consider graphs of bounded tree width. It was also shown 
that, for every natural number d, &, there exists a natural number A(d, k) such 
that every subdivision of the grid G,.,,, contains a subdivision of G, such that 
every “edge” of G, is a path of length divisible by d. Then these results combined 
with Theorem 5.7 give the following general result. 


Theorem 5.9, Let G, be a planar graph of maximum degree at most 3 and let d be 
any fixed natural number. If ¥ is a family of connected graphs containing all those 
subdivisions of G, in which every ‘‘edge” is a path of length divisible by d, then ¥ 
has the E-P-property. 


In particular, if * contains all cycles of length divisible by a fixed number d, 
then ¥ has the E-P-property by Theorem 5.9. 

We say that F is closed under subdivision if every subdivision of a graph in F is 
also in ¥. 


Theorem 5.10. Let ¥ be a family of connected graphs closed under subdivision. A 
sufficient condition for ¥ to have the E-P-property is that ¥ contains a planar 
graph of maximum degree 3. A necessary condition is that ¥ contains a planar 
graph which can be drawn in ihe plane (without crossings of edges) such that all 
nodes of degree =4 are on the boundary of the outer face. 


The first part of Theorem 5.10 follows from Theorem 5.9. The necessity of 
Theorem 5.10 was proved by Thomassen (1988b). Robertson and Seymour 
(1990a) proved previously that ¥ must contain a planar graph. For, if all graphs 
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Figure 5.2. Modifying a graph to a “dense” graph. 


~ 


in ¥ are nonplanar, then we choose a graph H, in ¥ of minimum genus g. Now 
let Hy be embedded in S,. We replace every edge of H, by a large multiple edge 
and modify the embedding locally as indicated in fig. 5.2. 

The resulting graph will contain a subdivision of H, even after the removal of a 
large node set. On the other hand, it contains no two disjoint graphs of ¥ since 
the disjoint union of such graphs has genus >2g>g by the genus additivity 
theorem 3.10 and the definition of g. 


5.3. Tree width and algorithms 


From an algorithmic point of view the graphs whose tree width are bounded 
above by a fixed number are convenient since they have a tree-like structure 
which we formalize below. Many problems which are intractible for graphs in 
general are solvable in polynomial time for graphs of bounded tree width, a fact 
which has been discovered independently in applied mathematics, for example in 
so-called expert systems and other areas in statistics (S. Lauritzen, private 
communication). 

With any graph G of tree width <w we can associate a tree as follows: Assume 
G' is a connected K,,,,-cockade containing G as a spanning subgraph. Then G’ 
can be described as a union of its simplicial summands H,, H,,..., H, such that, 
for each i, H, UH, U --- UH, is connected and (H, UH, U --> UH;)NH;,, is a 
complete subgraph contained in some H, , 1 <j; <i. Let T be the tree with node 
set x,,X,,...,x, such that, with the above notation, x,,, is joined to x, and no 
other node in {x,,x,,...,x,}. We shall call T a w-tree associated with G. Note 
that T is not unique. If G= K, ,, then there are two 1-trees associated with G, 
namely K, , and the path of length 3. If T is a tree with g nodes, then it is easy to 
find a node v such that no component of T — v has more than g/2 nodes. Hence 
we can partition the components of 7 — v into two parts none of which has more 
than 2(q — 1)/3 nodes. This leads to the following partition property of graphs of 
tree width <w as noted by Robertson and Seymour (1986a). 


Proposition 5.11. Every graph G of tree width <w has a node set S and two 
subgraphs G,, G, such that G=G,UG,, V(G,)NV(G,) =S, |S|<w+1, and 
|V(G,)| < w + 1+ 2(|V(G)| — w — 1)/3 for i=1, 2. 
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Proposition 5.11 implies the existence of polynomially bounded algorithms for 
several NP-complete graph problems when these are restricted to graphs of 
bounded tree width. To illustrate this, we describe polynomially bounded 
algorithms for k-colouring a graph G or proving that it has tree width >w (where 
k and w are fixed natural numbers). We first consider all (w + 1)-element subsets 
S of V(G) in order to find the decomposition in Proposition 5.11. This can be 
done in O(n”*’) steps. If such a decomposition does not exist, then G has tree 
width >w. If it does exist we consider all possible k-colourings of G(S). (There 
are at most k”*’ such colourings). For each colouring of G(S) and for each of G,, 
G, we contract each colour class into a single vertex (and joint any two of these 
vertices by an edge), and repeat the algorithm for the resulting graphs. An easy 
count shows that this algorithm is polynomially bounded. Note that this algorithm 
does not involve a polynomially bounded algorithm for deciding if G has tree 
width <w. Such an algorithm follows from the solution of the k-path problem 
which we discuss in the next section. The k-path problem for the graphs of 
bounded tree width was solved in Robertson and Seymour (1986a) and that 
method has interesting consequences, too. Thomassen (1988b) pointed out that it 
yields the following. 


Proposition 5.12. For fixed natural numbers w, k, d, d,, d,,...,d,, there exists a 
polynomially bounded algorithm for deciding whether a graph G has tree width 
>w or contains, for any prescribed nodes x,, Yy, Xz, Yo>---+>Xg, Yur & disjoint 
paths P,, P,,...,P,, such that P, connects x; and y; and has length d, (mod a) for 
i=1,2,...,k. 


The algorithm of Proposition 5.12 is similar to the afore-mentioned colouring 
algorithm. We split the graph into G, and G, as in Proposition 5.11 and then we 
translate the k-path problem in G into a (fixed) number of k’-path problems in 
G, and G,. We can assume that k = w and then k’ <2k. If necessary, we use (a 
modification of) Proposition 5.11 to split G, and G, such that we get k'<k for 
each of the graphs into which G has been split. Then we repeat the algorithm for 
each of these graphs. 


Corollary 5.13. For each fixed natural number d there exists a polynomially 
bounded algorithm for deciding whether a graph G has a cycle of length divisible 
by d. 


Proof. If G has sufficiently large tree width then it has such a cycle by Theorem 
5.7 and the remark preceding Theorem 5.9. If G has bounded tree width, then we 
apply Proposition 5.12 to each pair of adjacent nodes x,, y, letting k= 1 and 
d,=d-1. O . 


Arnborg and Proskurowski (1989) have shown that many difficult graph 
problems (for example the k-colouring problem and that of finding a Hamiltonian 
cycle) can be solved even in linear time for graphs of bounded tree width. 
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3.4. The k-path problem with applications to embeddings: Excluded minor 
theorems and algorithms 


Robertson and Seymour (1994b) proved the following. 


Theorem 5.14. For each fixed natural number k, there exists a polynomially 
bounded algorithm for deciding whether a graph G with prescribed nodes x,, y,, 
Xa, Yos+++sXqs Y_ COntains k disjoint paths P,, P,,...,P, such that P, joins x, 
and y, fori=1,2,...,k. 


Corollary 5.15. For each fixed graph Gy, there exists a polynomially bounded 
algorithm for deciding whether a graph G contains a subdivision of Gy 


Proof. Assume that G, has p nodes and k edges. Now consider a graph G with n 
nodes. If G' is a subdivision of G, in G, then we define the skeleton of G' as the 
subgraph of G’ obtained by deleting all those nodes of G’ which have distance (in 
G’) at least 2 to those nodes of G' that correspond to V(G,). The skeleton of G’ 
has at most p + 2(3) nodes. Thus the number of subgraphs of G which can play 
the role of a Go-skeleton is polynomially bounded, and, for each such subgraph, 
the problem of deciding whether it really is a skeleton of a G,-subdivision is the 
k-path problem. O 


Proposition 5.16. For each graph Gy, there exists a finite collection G,, 
Gy nse es G, of graphs such that an arbitrary graph G has G, as a minor if and 
only if G contains a subdivision of one of G,, G,,..-.,G,. 
Proof. Let Gj be the graph obtained from G, by subdividing each edge of Gy 
twice. Consider any node v in G, corresponding to a node in G, and let N(v) 
denote the neighbours of v in Gj. Now delete v, partition N(v) into classes and 
identify each class into a single vertex. Then add new nodes and edges such that 
these together with (the classes of) N(v) form a tree where none of the new nodes 
have degree 1. Thus the number of new vertices we add is less than |N(v)| < 
|V(G,)|. We perform this construction for every node v of G, and let G,, 
G,,...,G, denote all the graphs which can arise in this way. Clearly each G, has 
G, as a minor. Conversely, if G has G, as a minor, then, for every vertex v of Gy, 
the vertex set in G which has been contracted into v induces a connected 
subgraph G(v) of G. We consider all the connected subgraphs G(v) and for any 
pair G(v), Gu) we select an edge e(u, v) in G between them provided v and u are 
adjacent in G,. Then we select in each G(v) a minimal connected subgraph G’(v) 
containing all ends of the edges e(u, v), u © ¥(G,). The union of the edges e(u, v) 


and the trees G'(v) form a subdivision of one of G,, G,,...,G,. O 


Theorem 5.1, Corollary 5.15 and Proposition 5.16 imply the following which is 
probably the most general result on graph algorithms. 
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Theorem 5.17. Let p be any graph property such that, for every graph G with 
property p, every minor of G has property p. Then there exists a polynomially 
bounded algorithm for deciding whether an arbitrary graph has property p. 


Robertson and Seymour proved that the algorithm in Theorem 5.17 can be 
chosen such that it is bounded by C, n° where n is the number of nodes of G and 
C, is a constant depending on p. 

"An important consequence of Theorems 5.1 and 5.17 is the following. 


Theorem 5.18. Let S be any 2-dimensional surface. There exists a finite collection 
G,, G,,...,G, of graphs such that an arbitrary graph G can be embedded into S 
if and only if G has none of G,, G,,...,G, as a minor. Furthermore, there exists 
a polynomially bounded algorithm for deciding this. 


In particular, there exists a polynomially bounded algorithm for deciding if G 
can be embedded into S, or N, (when g and h are fixed). 

If a graph G has tree width <w, then every minor of G has tree width =w. 
Thus Theorem 5.17 implies the existence of a polynomially bounded algorithm for 
deciding if a graph has tree width <w. A corresponding excluded minor theorem 
follows from Theorem 5.1. 

The excluded minor characterizations that follow from Theorem 5.1 and hence 
also the algorithms of Theorem 5.17 are implicit in the sense that, in general, we 
do not know the family ¥, in Theorem 5.1. As mentioned earlier, the sphere and 
the projective plane are the only compact 2-dimensional surfaces for which we 
know a quantitative version of Theorem 5.18. The afore-mentioned result of 
Archdeacon (1980) and Glover et al. (1979) states the following. 


Theorem 5.19. There exists a collection of 35 graphs G,, G,,..., G3, such that 
an arbitrary graph G can be embedded in the projective plane if and only if it has 
none of the graphs G,, G,,..., G35 a8 a minor. 


3.5. Concluding remarks 


As we have seen, the well-quasi-ordering result Theorem 5.1 and the connectivity 
result Theorem 5.14 are of central importance to minors and embeddings. 
Conversely, their proofs are based on investigations of embeddings. Previously, 
embeddings on higher surfaces were studied for their own sake. (White’s 
application 1978a of embeddings to design theory is an exception.) The Robert- 
son—Seymour theory shows that graph embeddings have a natural place in general 
graph theory. Also, the proofs of Theorem 5.1 and Theorem 5.14 give much 
additional insight in embeddings, minors and connectivity which is worth 
mentioning. We have indicated how the solution of the k- -path problem for graphs 
of bounded tree width depends on nice separation properties of such graphs. 
Similar (but more complicated) separation properties are used for the k-path 
problem for graphs of bounded genus in Robertson and Seymour (1988). Suppose 


Embeddings and minors 343 


G is a graph on a surface S and suppose s,,5,,.-.,5,,¢,5---,¢, are nodes of G 
and we want to find disjoint paths P,, P,,...,P, such that P, connects s, and ¢, 
fori=1,2,...,k. Robertson and Seymour showed that these paths exist if G has 
the following properties (which we describe informally). 

(i) the vertices s,,...,¢, are “far apart” in the sense that every closed curve 
on S which separates {s,,...,#,} in the topological and sense and every open 
curve which connects two of {s, ,...,¢,} must have many points in common with 
G and 

(ii) G is “dense” on S in the sense that every non-null-homotopic curve on S$ 
has many points in common with G. 

This result was also applied to the following result on minors (again described 
informally). - 

lf H is a fixed graph on S and G is any graph on S which is dense on S (in the 
sense of (ii) above), then G has H as a minor. 

This result was then used to prove the restriction of Theorem 5.1 to graphs on a 
fixed surface and thereby obtain the excluded minor theorem (which is a special 
case of Theorem 5.1) for graphs on a fixed surface. 

In the general k-path problem also the tree width theorem plays an important 
role since the presence of a large grid can be useful for connecting prescribed 
nodes with disjoint paths. 

Although Theorem 5.17 gives a so-called good characterization for describing 
the graphs G that have no fixed graph H as a minor, it would still be desirable to 
have a more precise structural characterization of these graphs of the same type as 
Wagner’s characterization Corollary 4.7 of the graphs having no K, as a minor. 
Robertson and Seymour (1984a) suggested that, for any fixed graph H, there 
might exist natural numbers k and g (depending on H) such that any graph G 
with no minor isomorphic to H contains a set S of at most k nodes such that 
G —S is contained in a graph G’ with the property that each of the simplicial 
summands of G’ can be embedded into S,. However, Seese and Wessel (1985) 
answered this in the negative by the following interesting grid like graphs: 
Consider the graph whose nodes are all pairs (i, /), where 1 <i<n, 1<j <n, and 
whose edges are all ((i, j), (é', j’)) such that |i — i’| + |j —j’| =1. Then add a path 
X,X,°+-x, and join x, to (2,i). The resulting graphs have no K, as a minor and 
yet they do not have the above embedding property. Robertson and Seymour 
(1994a) got around this problem by a structural characterization (more compli- 
cated than suggested above) of the graphs which have no (fixed) graph H as a 
minor. Roughly speaking, those graphs are obtained by pasting graphs of genus 
<y(H) together in a tree-like fashion, except for a few nodes and a few “bad” 
areas. 

The restriction of Theorem 5.1 to graphs of maximum degree <3 is known as 
Vazsonyi’s conjecture. Nash-Williams (1965) suggested the following extensions 
of Vazsonyi’s conjecture, known as the immersion conjecture. 


Theorem 5.20. Suppose G,, G,,... is an infinite sequence of graphs. Then there 
are integers i, j (i<j) and an injective map f: V(G;)>V(G,) and a map g from 
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E(G,) into a set of pairwise edge-disjoint paths of G, such that , for any edge e = xy 
of G,, g(e) connects f(x) and f(y). 


Robertson and Seymour (private communication) have extended Theorem 5.1 
to hypergraphs in such a way that the immersion conjecture follows as a ‘dual 
version” of Theorem 5.1. 

Another problem related to Theorem 5.1 and motivated by K6nig’s problem 
mentioned at the beginning of this section is the intertwining conjecture of Lovasz 
and Milgram (see Ungar 1979). 


Theorem 5.21. For any two graphs G,, G, of minimum degree at least 3, there are 
only finitely many graphs G of minimum degree at least 3 which are edge-minimal 
with respect to the property of containing subdivisions of both G, and G,. 


Robertson and Seymour (private communication) have recently obtained a 
labelled version of Theorem 5.1 from which the intertwining conjecture follows. 

Finally we mention that Theorem 5.1 is not true for infinite graphs, as proved 
by Thomas (1988). It is unsolved whether or not Theorem 5.1 holds for countable 
graphs. 
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1. Introduction 


In the most general terms, a random graph is a pair (G, P) where & is a family of 
graphs and P is a probability distribution over %. 

More specifically take ¢ to be the family of all graphs on n labeled vertices and 
assign to each G € & a probability measure 


P(G) = p'O = pyP™, 


where 0 < p <1 and e(G) denotes the number of edges of G. 

We can describe this model in terms of independent Bernoulli experiments 
performed on the edges of a complete graph K, with n vertices. Namely, delete 
each of the (3) edges with probability 1—p, independently of all other edges. 
Equivalently, one may start with an empty graph on the vertex set and insert 
edges independently, with the same probability p. In the sequel such a random 
graph is denoted G(n, p) and is often called a binomial random graph. Note, that 
in G(n, p) the number of edges is a binomially distributed random variable with 
expectation (5)p. From now on we often assume that the edge probability p is a 
function of the number of vertices, i.e., p = p(n). 

Another fundamental model, having a ‘‘counting” flavor, is obtained by 
considering @ to be the family of all graphs with n labeled vertices and with 
exactly N edges, O< N<(5). To every GE @, assign a probability measure 


pa)=|9) t= (©) 


Intuitively, we sample at random a graph from the family @. Equivalently, one 
can get such a random graph by picking at random a given number N of edges, in 
such a way that all possible (%) choices are equally likely, and removing the 
remaining ones from K,, or by simply inserting N edges at random into an empty 
graph on n vertices. We denote such a random graph by G(a, N) and call it a 
uniform random graph. 

Many natural questions of deterministic graph theory can now be translated 
into a probabilistic language. In random graph theory we are mainly concerned 
with establishing the probability that a random graph possesses a given property, 
i.e., the probability that the random graph belongs to the class of graphs 
possessing it. Interesting properties include connectedness, hamiltonicity or 
containment of a perfect matching, to name only a few. Most often we are 
interested in when the probability of such an event tends to one as the number of 
vertices of a random graph goes to infinity. In this case we say that the random 
graph has this property almost surely (a.s., for short). Moreover, various 
numerical invariants, for instance vertex’ degree, vertex or edge connectivity, 
chromatic number and girth, can be considered random variables when the graph 
in question is random. We then ask for the probability distribution of such random 
variables. Usually we are interested in asymptotic distributions as n— ©. 

In this chapter we are concerned with problems of the above type and we 


354 M. Karonski 


restrict our attention to binomial and uniform random graphs. We aim to give the 
reader an idea of the nature of typical problems in the field, basic methods used in 
proofs, as well as to lead the reader from the origins to most recent results. 

It is widely accepted that the birth of the theory of random graphs is the 
publication of the seminal paper on the evolution of random graphs by Erdos and 
Rényi (1960), followed by a sequence of their other fundamental contributions 
(for the entire collection see Erdés 1973 or Rényi 1976). Since then several 
hundred papers have been published in various areas of random graph theory. 
The growth of the field and its impact on other branches of combinatorics, is 
extensively presented in the excellent monograph of Bollobas (1985). This book is 
highly recommended to all who want to undertake a serious study of the field. 
One may, however, begin with an introductory book by Palmer (1985), as well as 
to consult review papers by Grimmett (1983), Karonski (1982) or Stepanov 
(1973). (For recent developments in research on random graphs we refer the 
reader to Random Structures & Algorithms, the journal solely devoted to that 
topic.) 


2. Evolving graphs 


Let us adopt a dynamic view in which a graph, acquiring edges, grows from empty 
to complete. More precisely, start with labelled vertices {1,2,...,} and add 
edges at random, one at a time, until the resulting graph is complete. This simple 
model of a graph evolving in a random manner was first investigated in detail by 
Erdoés and Rényi (1960). The key question they posed may be stated in general 
terms as follows: what is a typical structure of a random graph during its 
evolution? We say that a structure (or a graph property) is typical if a random 
graph possesses it almost surely. 

Denote by @ the set of all labelled graphs on the vertex set {1,2,...,m}. 
Define a graph process (G,)\? as a sequence of graphs from &, such that e(G,) =t 
and GyCG,C --» CGi.). A graph process is random if G,,, arises from G, by 
adding to it a single edge chosen at random from the (3) —¢ remaining edges, 
0<+t<({) — 1. Equivalently, we may choose at random one particular process 
(G,){® from all (2)! graph processes. Clearly, since G,,, depends only on G,, this 
random graph process has the Markov property. 

By a graph property we mean a subset of the set G. We say that a graph has a 
property if it is an element of #. For example, # may be the set of all 
connected graphs, Hamiltonian graphs, graphs with isolated vertices, graphs with 
a given chromatic number, graphs without a forbidden subgraph, etc. A graph 
property is increasing (decreasing) if the fact that G € »# and G is a spanning 
subgraph (supergraph) of the graph H implies that H € #. The property # = 
{G € : G is connected} for example, is increasing, # = {G € &: G has isolated 
vertices} is decreasing while # ={GE: y(G)=k,k EN} is neither increasing 
or decreasing, i.e., is not monotone. Notice also that the complement of an 
increasing property (“not ’’) is decreasing. if is an increasing property and 
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(G,)<® is a graph process, then a natural question arises: when is the first time G, 
acquires the property 0, i.e., what is the hitting time + = min{t: G,E of} of 7 
Obviously, when the process is random then the problem is to find the limit of 
Prt =N) for N=0,1,...,(5). Although such a “dynamic” discrete time 
approach to the study of the evolution leads to very precise results, we shall take 
an easier ‘static’ approach, looking at the state of the process at a given time 
t= N. Notice that the set of all possible (‘2’) states (graphs) of the process (G, yo) 
at t= N consists of all labelled graphs on n vertices with N edges, and that each 
particular state is equally likely. Hence, Gy can be identified with a random graph 
G(n, N), and instead of finding the hitting time of of in (G,))? we try to specify 
the smallest value of N for which sf is likely to occur in G(n, N). 

One can also view a random graph process as a continuous time process. Let 
{T,}, €€ {{i, j}: i, J=1,2,...,n, i<j}, be a sequence of independent random 
variables with a common continuous distribution on [0, ©]. Each T, can be viewed 
as the time it takes to “grow’’ edge e. Let G,,(t) denote the graph with vertex set 
{1,2,...,m} and any edge e for which T,<#, t€[0, ~). Hence, we treat T, as 
the time of appearance of the edge e in a graph G,(t). Furthermore, notice that 
G,(t) can be identified with a random graph G(n, p) when p = Pr(T, <0), i.e., 
G(n, p) can be treated as a state of the process G,(t) at the time ¢. (In fact 
G(n, N) can be recovered from this process as well by taking G,(7;y), where 
Ty, denotes the Nth smallest order statistic of {7,}.) Hence, we see that random 
graphs G(n,N) and G(n, p) may provide information about the asymptotic 
properties of random graph processes, both with discrete and continuous time at 
fixed moments of the evolution. It turns out however, that in many cases G(n, N) 
and G(n, p) are asymptotically equivalent. Intuitively, one may expect that such 
equivalence should take place when the number of edges of G(n, N), is equal to 
the expected number of edges of G(n, p), i.e., when N= |(3)p] ~n *p/2. 

This natural relationship was described by Erdés and Rényi (1960) but a formal 
proof was given by Bollobas (see Bollobds 1985, Theorem 2, p. 34). Here, we 
present a slight generalization of Bollobds’ equivalence theorem due to Luczak 
(1990a). 


Theorem 2.1, Let 0< py <1, s(n) =(n’p(1 —p))'” > and w(n)— & as n> &. 
(1) Uf & is a graph property and for all NEW such that 


(5) p- oln)s(n)<N< (5) p + w(n)s(n) , 
Pr(G(n, N)E M)—> py, then Pr(G(n, p)\E 4) py - 


ao If & is a monotone property and pe =p +(n)s(n)in’, p” =p — a(n)s(n)/ 
, then the facts that 


Pr(G(n, p-)EM)—>py and Pr(Gin, p*)EA)—> py 


imply that Pr(G(n, N) © #)— py as n>, where N= |(3)p]. 
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Notice that the results for G(m, N) are sharper than results about G(n, p), since 
for any graph property , 
@) 
Pr(G(n, p) Ef) = = Pr(G(n, N) € f) Pr(e(G(n, p)) = N). 


However, G(n, N), unlike G(n, p), suffers from the fact that the edges are not 
independent of each other. Since the calculations are often easier in G(n, p), this 
model is more popular though it is less intuitive than G(n, N). From now on we 
shall study the evolution of G(n, p) as the edge occurrence probability grows from 
zero to one. The reader more used to the graph-theoretic than the probabilistic 
approach is encouraged to “translate” the results by substituting 2N/n” for p, 
whenever Theorem 2.1 allows. 


3. Evolution —- Main epochs 


The process of the evolution of a random graph (or rg for short) consists of clearly 
distinguishable epochs, decisive events and turning points. This subsection is a 
“historical sketch”, which will later serve as a guide, leading us in our “travel in 
time’’ through particular characteristic periods. 

At the beginning (p =0) our universe is empty, i.e., it consists of isolated 
vertices only. Next, for some time, we stay in a forest, where trees grow larger 
and larger. The following theorem shows how long this “forest epoch’’ lasts. 


Theorem 3.1. If np—0 as n—~«&, then G(n, p) is a.s. a forest. 


Proof. Denote by of the set of all graphs on n vertices, with at least one cycle. 
Then 


Pr(G(n, p)E f) < D Pr(X,,>0)< > EX,,, 
k=3 k=3 


where the random variable X, , counts cycles of length k in G(n, p). But, 


EX,4= (Gj) 4k —U!p* < (np) 


and the theorem follows. O 


This simple proof illustrates the usefulness of the so-called first moment method. 
This name is attributed to the immediate consequence of Markov’s inequality, 
that for a non-negative valued random variable X, Pr(X > 0) <EX. It should be 
stressed that the knowledge oi*the asymptotic behavior of the expectation of a 
given numerical characteristic of a random graph is usually of a critical impor- 
tance. A careful analysis of when the expectation tends to zero, to a constant, or 
to infinity as n— ©, often explains the typical structure of a random graph. 

Let us now take a closer look at that period of the evolution of G(n, p) when 
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np — 0. Following the observation above, consider the expectation of the number 
Y,.. of trees on k vertices, contained in G(n, p) as subgraphs. Obviously, 
EY, , =(,)k* *p*"'. Hence, by the first moment method, it follows that, for 
example, when pn’? 0, G(n, p) consists a.s. of isolated vertices only, if pn®/*—>0 
there are a.s. no trees on three vertices, and in general, G(n, p) a.s. does not 
contain trees on k vertices as subgraphs when pa@-D_, 9. Furthermore, as 
noted by Erdds and Rényi (1960), when pat'@— 5 @ then G(n, p) a.s. contains 
a tree on k vertices. However, this cannot be deduced from the behavior of the 
expectation itself and one has to employ a more sophisticated approach known as 
the second moment method (as it makes use of the second moment of a random 
variable by way of Chebyshev’s inequality). 

The ‘forest epoch” ends when np—>c, c > 0, however as long as c <1, as the 
next result shows, the structure of a random graph does not change too much, 
compared with the period when np — 0. 


Theorem 3.2. If npc, as n->%, and 0<c<1, then G(n, p) a.s. consists of 
components with at most one cycle. 


Not surprisingly, in that period of the evolution, trees dominate the structure of 
a rg because the probability that G(n, p) is a forest tends to (1—c)!’? exp(c/2 + 
c’/4). The expected number of vertices which belong to components with one 
cycle tends to a constant. Thus a.s. all but a negligible fraction of vertices are 
contained in isolated trees. Moreover, in that brief epoch, all components are 
rather small since they are O(log x) in size. 

The structure of a rg changes dramatically when the constant c exceeds 1. 


Theorem 3.3. /f np—>c, c>1, then G(n, p) a.s. consists of a unique giant 
component and small components with at most one cycle. 


The unique component is “giant” indeed, since it contains approximately 
(1 —x(c)/c + 0(1))n vertices of G(n, p), where x(c) is the only root of the 
equation xe *=ce “, satisfying 0 <x(c) <1, while the second largest component 
is of size O(log 7). 

Theorems 3.2 and 3.3 indicate that c = 1 is a critical point in the sense that, 
when np 1, G(n, p) very quickly changes from a scattered collection of small 
trees and unicyclic components to a coagulated lump of components that 
dominates the graph in size. This short period when the giant component emerges 
is called the phase transition because of its resemblance to the physical phenom- 
enon of that name. In a subsequent section, we shall provide more details about 
this turning point in the history of the evolution of G(n, p). 

After the phase transition, the giant component begins to consume the small 
components not yet attached to it, and finally G(n, p) achieves the structure 
described below. 
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Theorem 3.4. If np—~™, but np —logn——© as n+, then G(n, p) is ass. 
disconnected and all components, with the exception of the giant one, are trees. 


When we look closer into the period of evolution characterized by Theorem 
3.4, we see (a simple first moment argument) that the giant component gradually 
“swallows” the remaining tree-components in the reverse order (with respect to 
their size) in which they have been emerging in G(n, p) (when np— 0). Indeed, if 
Z,,x» K EN, denotes the number of tree-components on k vertices, then 


EZ, , a CG) KREG spore) 


and one may easily check that EZ, , > 0 as n—~© as soon as np — (logn — (k — 
1) log log n)/k—> ~. So, the asymptotic behavior of EZ, , suggests that, when 
np —logn—c, -~<c<~«, then G(n, p) a.s. consists of the giant component and 
a number of isolated vertices only. This is in fact true, which further implies that a 
tg becomes connected as soon as the last vertex ceases to be isolated. 


Theorem 3.5. If np — logn—®, as n> then G(n, p) is a.s. connected. 


Notice that the expected vertex degree in G(n, p) is equal to np. This means 
that the before connectedness (b.c.) era ends and the after connectedness (a.c.) 
era begins when each vertex has on the average, log neighbors. Also, as soon as 
np—, G(n, p) a.s. contains a triangle K,, but one must wait until the expected 
vertex degree is of the order n'’* to be almost sure that it contains K, as a 
subgraph. In general, complete graphs appear in G(n, p) gradually in a manner 
similar to trees. Namely, when pn*’“~)—>0 then G(n, p) a.s. does not contain 
K,, while for pn*'*~") +, G(n, p) has a.s. at least one subgraph isomorphic to 
K,. 

Increasing the edge probability p further, we make G(n, p) more and more 
dense to such an extent that when p is a constant (i.e., each vertex is, roughly 
speaking, joined to a positive fraction of the remaining vertices) then, for 
example, its diameter is a.s. equal to two. The process of the evolution of G(n, p) 
comes to its end when for p =1, it becomes a “‘dull” complete graph K,,. 


4. Phase transition 


We now take a closer look at the phase transition in a random graph, the most 
fascinating, but still somewhat mysterious, period of its evolution. A distant view, 
offered by Theorems 3.2 and 3.3, is quite clear: when p=c/n, O<c<1, then 
G(n, p) is composed of many small components, mainly trees, while for c > 1, it 
has one large component which dominates the whole graph. Early attempts to 
better understand the nature of the phase transition, concentrated on the analysis 
of the typical structure of G(n, p) at the critical point c= 1, i.e., when p = I/n. 
For example, Erdés and Rényi (1960) proved that for such p, the largest 
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component a.s. has size O(n”’*). Recall that its size is O(log 2) when c <1 and is 


comparable with nm for c> 1. At first glance, this suggests that there is a “double 
jump” in the order of the largest component as c passes 1. But it is only a matter 
of the lens through which we observe G(n, p) at the critical point of its evolution. 
In order to get an undistorted picture of the phase transition, we have to 
significantly magnify the neighborhood of the point c = 1, assuming that c = 1+ e, 
where ¢ = e(n)—> 0 as n->~&, Then, as we shall see from the next theorem, the 
“double jump’’ somehow vanishes. Also, while a distant view suggests the 
existence of the critical point, c = 1, the closer look offered by this theorem shows 
that we should speak about a critical “interval” c=1+e, where e°n =O(1), 
instead. 

Theorem 4.1. Let ¢—>0 but en © as n>. Then 

(i) G(n,(1-)/n) a.s. consists of components with at most one cycle. The 
largest component is a.s. a tree on (2+ 0(1))e ” log(e*n) vertices; 

(ii) Gt, (1+ €)/n) a.s. has the unique large component and a number of small 
components with at most one cycle. The giant one has (2 + 0(1))en vertices and 
(2 + 0(1))e*n edges more than vertices, while small components have less than n="? 
vertices each. 


To get an idea of how the giant component emerges in G(n, p), it is convenient 
to return to the original ‘dynamic’ interpretation of the evolutionary process. 
Therefore, let p, >p, and assume that G(n, p,) arises from G(n, p,) by adding 
new edges to the latter graph with the probability Ap =(p,—p,)/(1-—p,) 
(formally, G(m, p,) can be interpreted as the sum of G(n, p,) and G(n, Ap)). 

Following Theorem 4.1, we start to observe the process just before the critical 
interval begins, by looking at two particular instances, when p, = (1 — 2¢)/n and 
Pp, =(1—e)/n, where e°n—> x as n—>©. Then, one can prove that in G(n, p,) 
there are many components of the same size as the largest one, which, recall, 
consists roughly of « “log e’n vertices. Let us take two such components and 
insert additional edges with probability Ap ~ e/n in order to obtain G(m, p,) and 
ask about the chance that those components will merge together in G(n, p,). By 
the first moment, the probability of such an event is at most (e “log © n)’(e/n) = 
(log e*n)/(e*n) > 0 as n->~%, Thus, when p=(1-«)/n, en, the large 
components are unable to ‘‘swallow” each other and therefore are forced to hunt 
for smaller quarry. Hence large components grow absorbing only small ones and 
no clear favorite to win the race to become the giant emerges. As the edge 
probability p increases, the number of contenders for the role of giant decreases. 
When p=(1-.)/n, e°n =O(1), the probability that two specified large com- 
ponents will form a new component is bounded away from zero, but still is too 
small to ensure the creation of the unique giant component. At the same time, a 
big gap between the orders of large and small components arises which prevents a 
new large component being formed from the small ones. Next, as soon as 
p=(1+e)/n, «0 but en—, all large components a.s. merge together and 
the unique giant component emerges from “chaos” as the unquestioned victor. 
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The critical interval c=1+e, e*n =O(1), which sharply differs from other 
epochs due to its many peculiarities, is the most specific and hard-to-study period 
of the evolution. For example, this is the only time when G(n, p) may have more 
than one component which is neither a tree nor a unicyclic graph. This is also the 
only moment when the size of the largest component is “fuzzy”, (i.e., cannot be 
a.s. determined up to a factor of one, etc.). The critical interval divides the whole 
b.c. (before connectedness) epoch into two phases: subcritical - when com- 
ponents compete to be the largest; and supercritical in which the largest 
component rules. In addition to this fundamental difference, there is a striking 
similarity between the typical structure of G(n, p) in its subcritical phase and the 
structure of the part of G(n, p), in its supercritical phase, which is induced by 
vertices not belonging to the giant component. Indeed, Theorems 3.1—3.4 and 4.1 
show that in both cases all components are rather small and each contain at most 
one cycle. In fact, ties connecting those two phases are much stronger since the 
evolution of the part of G(n, p) which remains outside the giant component 
during the supercritical phase, looks like a backward replay of the evolution of 
the whole of G(n, p) during the subcritical phase. The supercritical phase, 
restricted to that part of a rg which is not absorbed by the giant component, is a 
symmetric reflection of the subcritical phase only when p = (1+ e)/n and e—>0, 
In general, roughly speaking, all structural events which take place when np—c, 
c > 1 have counterparts located in a relatively short period when np —c’, O0<c'< 
1 (in the sense, that a given c corresponds to c’ defined by c'e ©“ =ce ‘°. The 
same “relationship” connects two other epochs: The one when np— (but 
np —logn—»—) with that when np—0 (compare, for example, how tree- 
components of a given size emerge and how they disappear). One can say that 
outside the giant component, in the supercritical phase, “time” flows more slowly 
than in the whole rg during the subcritical phase. 

Let us present a result due to Luczak et al. (1994), which adds substantial 
support to the claim that the phase transition leads to a significant qualitative 
change in the structure of a random graph. 


Theorem 4.2. Let ¢ = e(n)—>0 as n-~. Then G(n, p) is 
(i) a.s. planar, when p =(1—e)/n, en—->©, 
(ii) planar with probability tending to a(A), 0<a(A)<1, as n>, when 
p=(l+te)/n, where end and —2x<A<~ is a constant; 
(iii) @.s. non-planar, when p =(1+ €)/n, en, 


Notice that planarity is a decreasing property of graphs which, by the above 
theorem, implies that G(n, p) is a.s. planar during the subcritical phase while 
after the phase transition it is ¥.s. non-planar. 

A systematic study of the phase transition was originated by Bollobés who 
revealed the mechanism of the formation of the giant component. For an 
extensive account of those results the reader is referred to chapter 6 of Bollobas’ 
(1985) monograph. The reader interested in recent developments in this direction 
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is referred to papers by Kolchin (1986), Stepanov (1987), Euczak (1990b) and, 
first of all, to a seminal paper by Janson et al. (1993). 

Obviously, investigations of the critical “interval”, when the phase transition 
takes place, requires very sophisticated and delicate tools. Janson et al. (1993) 
applied machinery of generating functions with a great precision. They were able 
to study the structure of evolving graphs (and multigraphs) when edges are added 
one at a time and at random, mainly looking at so-called excess and deficiency of 
a graph. To give the reader a taste of their results let us quote the following 
theorem. 


Theorem 4.3. The probability that a random graph of multigraph with n vertices 
and in+ O(n") edges has exactly r bicyclic components (i.e., components with 
exactly two cycles), and no components of higher cyclic order, is 


5\',/2 1 ere 
They also consider the following fascinating problem: What is the probability 
that the component which during the evolution becomes the first “‘complex” 
component (i.e., the first component with more than one cycle) is the only 
complex component which emerges during the whole process. So they ask what is 
the probability that the first bicyclic component is the “seed” for the giant one. 
They prove that it happens quite often indeed. 


Theorem 4.4. The probability that an evolving graph or multigraph on n vertices 
never has more than one complex component throughout its evolution approaches 
51/18 = 0.8727 as n>, 


5. Thresholds, threshold spectra and 0-1 laws 


Erdos and Rényi (1960) were the first to discover that when a graph evolves 
acquiring edges in a random manner, an abrupt change in its typical structure 
occurs as the order of magnitude of the number of edges allocated hits some 
critical values. In the specific case of G(n, p), this means that many properties are 
associated with critical edge probabilities, called thresholds, around which the 
probability that G(n, p) possess a given property jumps from 0 to 1 in the limit. 

Let  C @ be an increasing property of graphs on n labeled vertices. We say 
that p, =p,n) is the threshold for # if Pr(G(n, p)€ #)—>0 when p<< py 
(p =0(py)), while Pr(G(n, p)€.8)—1 if p>> py (py =0(p)). Bollobés and 
Thomason (1986) proved, in a general setting, the following existential statement. 


Theorem 5.1. Every non-trivial increasing property x has a threshold in G(n, p). 


First, consider the following basic increasing property: G(, p) has a subgraph 
isomorphic to a given graph G. Erdés and Rényi (1960) established the threshold 
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for this property under the condition that G must be balanced. This simply means 
that the density of G: d(G) = e(G)/|G|, where |G| denotes the number of vertices 
of G, is equal to its maximum subgraph density m(G) = max{d(H): H C G} (i.e., 
d(H) <d(G) for every subgraph H of G). It is not hard to check that the class of 
balanced graphs includes trees, cycles as well as unicyclic and complete graphs. 

The threshold for balanced graphs played a crucial role in the explanation of 
the nature of the evolution first given by Erdés and Rényi. Twenty years later, 
Bollobads (1981a) extended this fundamental result by, removing the condition 
that G be balanced using a sophisticated method of grading. Here we shall 
present an elementary proof of this result due to Rucinski and Vince (1986). To 
the surprise of many, their argument is similar to the one which Erdos and Rényi 
used to solve the case of balanced graphs. 


Theorem 5.2. The threshold for the property iat arg G(n, p) contains a subgraph 
isomorphic to an arbitrary graph G is hom 


Proof. For the sake of simplicity, denote “Gi, p)2G” the property that 
G(n, p) contains a subgraph isomorphic to G. Moreover, let X,(G) be the 
number of copies of G in G(n, p). To prove that Pr(G(7, p) 2 G)— 0, when 
p<<n~''"® choose a subgraph H of G such that d(H) = m(G). Then 


Pr(G(n, p) > G) <Pr(G(n, p) > H) < Pr(X,(H) > 0) 


n |H|! e(H 
<EX,(H) = (i) aan 


nll p e(H) __ 


lim(G dye) 


=(pn 


where aut(H) is the number of automorphisms of H. Hence n '’”“@ satisfies the 
first condition to be the threshold. To show that Pr(G(n, p) 2 G)—>1 when 
p>>n-"'S) the first moment argument does not suffice. Now we have to apply 


the following bound: 


Var X,(G) 


=0)< 
Pr(X,,(G) =0) {EX,(G))? 
This bound is a simple consequence of the Chebyshev inequality and is widely 
known in the literature as the second moment method (for other applications, see 
chapter 33). 
Denote by G,, G, ay: ae t= ~ (ai) |G|!/aut(G), all copies of the graph G on 
{1,2,...,n}. Let 1, ee ,t, be the indicator of the event “‘G(n, p)D 
G,” Les 


i217 if Gn, p) DG, rs 
a 0 otherwise . 
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Then 


Var X,(G) = 2 Covi 1) = 3D (EGO — (BLE); 


1st, pat 
= DD (Prd? = 1,24? = 1) — p>. 


Observe that the random variables 7°) and I‘ are independent if and only if G, 
and G, are edge-disjoint. Consequently, in those cases the covariance vanishes 
and therefore 


Var X,(G)=< Dn TIGITIAL 2G) eta) 


HCG,e(H)>0 


On the other hand EX,(G)=n'@! p“©, thus 


Pr(G(n, p)BG)=PrX,(G)=0)=O( DY nHlp-een) 
HCG,e(H)>0 
Hence, Pr(G(n, p) 7G) 0 as pn’ & since d(H) <m(G), son") is 
the threshold. O 


The above theorem shows why trees and complete graphs appear gradually in 
an evolving graph. Remember that trees and complete graphs are balanced, hence 
the threshold for a k vertex tree is n*'“~") while for a complete graph of order k 
isn“ The behavior of trees and complete graphs contrasts with that of 
unicyclic graphs whose thresholds, regardless of their sizes, are always equal to 
n 


Theorem 5.2 discusses only the asymptotic behavior of Pr(G(n, p)>G). 
However, in many random graph problems it is important to have a good estimate 
for this probability. Janson et al. (1990) were able to show that for every graph G 
with at least one edge there are constants c,, c, > 0 (depending on G only), such 
that for all p and n 


exp(—c,M)<Pr(G(n, p) DG) = exp(—c,.M), 


where M = M(n, p, G) =min{n!"'p*"?: HCG, e(H) > 0}. 

From the nature of the thresholds, it follows that if is increasing then, in the 
limit, Pr(G(#, p) € &) jumps from 0 to 1 and stays equal to 1 up to the end of the 
evolutionary process. Obviously, when a property is not monotone, its behavior 
in G(n, p) can be much more complicated. For example, let @ be the property 
that the largest component has exactly k vertices. Our current knowledge of rg 
evolution suggests that we should look then at a very early period of the 
evolution, namely at its ‘forest epoch.” Indeed, Pr(G(n, p) € #) remains equal 
to 0 until we reach the threshold n~*’“*~" for a k-vertex tree. Next it jumps to 1 
and remains unchanged as long as larger trees do not appear. This a.s. happens 
when edge probability p >> nn” “*’* and therefore, in the limit, Pr(G(n, p) € #) 
jumps again, now from 1 to 0. In general, one can easily construct properties with 
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finitely many such jumps but, in order to study when they happen, it is necessary 
to extend the notion of the threshold. 

Call a>0 a point of continuity if there is an ¢ >0 and 8 € {0,1} so that if 
n**<p<n*** then Pr(G(n, p)€ 4)—>5 as n—>~. The threshold spectrum 
Spec() is the complement (in [0, ~)) of all points of continuity. The elements of 
Spec(sf) are called points of evolutionary discontinuity. So for the property 
that G(n, p) contains a graph G as a subgraph we have Spec() = {1/m(G)}, 
while Spec(.#) = {k/(kK — 1), (k + 1)/k} when states that the largest component 
has exactly & vertices. 

Theorem 5.1 says that any increasing graph property has a threshold spectrum 
consisting of a single point. Shelah and Spencer (1988) gave a characterization of 
Spec(#) for properties « expressible in the first order theory of graphs (FO) 
which means that the fact “G €.#” can be stated in the language consisting of 
Boolean connectivities, two binary predicates: equality and adjacency, universal 
quantifiers and variables identified with vertices of G (for example, the property 
that G has no isolated vertices is expressible in FO since it can be written as 
V,4,(v <u), where < denotes adjacency). 


Theorem 5.3. If & is expressible in FO then Spec(s) consists only of rational 
numbers and is a well-ordered set under > and has order type less than w”. 


The first statement of Theorem 5.3 follows from another result of Shelah and 
Spencer, which is itself concerned with 0-1 laws, another “hotspot” in current 
research in random graphs (as well as in logic and combinatorics; see Compton 
1989). We say that edge probability p = p(n) satisfies the 0-1 /aw if for all graph 
properties of expressible in FO, Pr(G(n, p) & #) tends to 0 or 1 as n>, 


Theorem 5.4. [f a is an irrational number between 0 and 1 then p =n “ satisfies 
the 0-1 law. 


The reader may expect that, in the limit, Pr(G((n, p) © ) is bounded away 
from 0 and 1 if the edge occurrence probability p ~cp,, -»“<c< ©, where p, is 
the threshold for ». Indeed, as we shall show in subsequent sections, this 
observation does hold and therefore the 0-1 law can be satisfied as long as p falls 
“between the cracks’’ of the spectrum of threshold functions. Shelah and Spencer 
(1988) showed the following. 


Theorem 5.5 If any of the following conditions holds then p satisfies the 0-1 law. 
(i) p<<n”’, 
(ii) nO << p<<n™* ™"* for some positive integer k, 
(iii) n-'* << p<<n™ for all e >0, 
(iv) n°! << p<<n"' logn, 
(v) n! logpn<< p<<n_'** for all « >0, 
(vi) n° << p for all e>0. 
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The statement (vi) is due to Glebskii et al. and Fagin and its elegant proof, 
combining graphs, logic and probability, is presented in Spencer’s lectures on the 
probabilistic method (Spencer 1987, p. 23). 

The results established for threshold spectra and 0-1 laws are restricted to 
properties which are expressible in the first order language. The class is rather 
narrow and does not contain such important properties as, for example, connec- 
tedness, hamiltonicity and planarity. In view of Theorem 4.2, the planarity of 
G(n, p) enjoys a much sharper threshold than n™', which is the threshold 
satisfying the definition from the beginning of this section. 

The same is true in the case of two other properties mentioned above. Indeed, 
Erdés and Rényi (1959) proved the following. 


Theorem 5.6. If np —logn—c, —~<c<©, then 
Pr(G(n, p) is connected)— exp(—e ‘) asn>o. 


The threshold for a Hamiltonian cycle, a long standing open problem of Erdés 
and Rényi, was found by Bollobas (1983) and Komlés and Szemerédi (1983). 


Theorem 5.7. if np — logn —loglogn—c, —-~<c<o, then 
Pr(G(n, p) is Hamiltonian)— exp(—e ‘) . 


Hence, in both instances, a critical range in which the probability jumps from 0 
to 1 is very narrow and reduces to the case when c = c(n) and c(n) tends to ~~ or 
+, respectively. Obviously, the usual thresholds for those properties are 
identical and equal n™ ‘log n. However, notice that the threshold spectrum is even 
less informative since it extracts from the threshold its main factor (7 *) and 
ignores the secondary ones. Therefore, connectedness and Hamiltonicity (but also 
planarity) all have the spectrum {1}. 


6. Distributions 


Another important branch of the theory of random graphs with a pronounced 
probabilistic flavor is concerned with limit distributions of numerical characteris- 
tics associated with graphs. In previous sections we reduced the analysis of the 
evolution to the problem of whether a given property is typical or not. However, 
in many cases we are able to give a more detailed picture. For example, suppose 
we know that G(n, p) contains a copy of a graph G with a probability bounded 
away from 0 as n—>~. Then, one is led naturally to the following question: how 
many copies of G can be found in large G(n, p)? Clearly, this number is a 
non-negative integer valued random variable (rv), therefore the answer involves 
its asymptotic probability distribution. Most often, we encounter two types of 
asymptotic behavior of random variables (rvs) associated with random graphs: 
Poisson and normal, especially when those rvs are themselves sums of indicators 
(Bernoulli rvs). The problem is that, as a rule, those indicators are not 
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independent. Hence, generally speaking, one is forced then to prove results which 
have the form of the Poisson and central limit theorems for sums of random 
variables with a given, quite often weak, dependence structure. In random 
graphs, we deal with various modes of convergence of a sequence of rvs {X,} toa 
rv X. Usually, convergence in distribution, denoted as X,,—> X, is studied and if 
Xx, 4. X we often say instead that X, has asymptotically the same distribution as 
X. For convenience, X,,-> 0 means here that {X,,} converges to a rv degenerate at 
0. Moreover, Po(A) and N(O, 1) represent random variables having the Poisson 
distribution with the expectation A and the standard normal distribution, respec- 
tively. Finally, if X is a rv, then X =(X - EX)/(Var X)"””. 

We begin our discussion with a fairly complete answer to the following 
question: What is the asymptotic distribution of the number X,(G) of subgraphs 
of a rg G(n, p) isomorphic to a given graph G (the number of copies of G). 


Theorem 6.1. Let G be a graph with the density d(G) and the maximum subgraph 
density m(G). Then 
(i) X,(G)30 if p<<n "© 
(ii) X,(G)4 Po(a), A=c" /aut(G) if p~en , ¢>0 and G is strictly 
balanced, i.e., such that d(H) <d(G) for every proper subgraph H of G. 
(iii) ¥,(G)SN(O, 1) if p>>n "© and 1-p>>n”. 


—1/d(G) 


Sketch of proof of (ii). The assertion (ii) was proved independently by Bollobas 
(1981a) and Karonski and Rucitiski (1983). The most useful tool in proving such 
statements is the following fact, called the method of moments, which says that if 
the probability distribution of a rv X is uniquely determined by its moments and 
for every r=1,2,...,EX’—>EX’ then X,3X. To prove (ii) it is more 
convenient to consider a variant of the method of moments and to investigate 
convergence of the factorial moments of X,,(G) (see Bollobas 1985, p. 23). It is 
not hard to notice that the rth factorial moment of X,(G), E, =E,X,(G)= 
E[X,(G)(X,(G) — 1)---(X,(G)-—r+1)], r=1,2,... is equal to the expected 
number of ordered r-tuples (G,,G,,...,G,) of graphs isomorphic to G and 
contained in G(n, p) as subgraphs. Having in mind the underlying dependence 
structure (see the proof of Theorem 5.2), the usual trick is to split E, into two 
separate parts E, and E’, where E; is defined over those r-tuples which consist of 
vertex-disjoint copies of G while E; takes care of the other cases. Next, one 
shows that, under the assumptions of (ii), E’ is negligible. Indeed 


aa ie ae G){(ai) amy)” ; 


where e, is a lower bound jor the number of edges in (Jj_, G, while k= 
|U;_, G. By a purely combinatorial argument one can show that when G is 
strictly balanced then e, = kd(G) + 1/|G|. Hence 


Ey = O((mp*)'p'"7!) = O(p*"7!) = 01) . 
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To complete the proof, note that 
E/={n'*lp* saut(G)}’ , 


which implies that E,X,(G) = (1+ 0(1))a’, for p~en™''“® and r=1,2,.... It 
means that the factorial moments of X,,(G) converge to the factorial moments of 
the Poisson rv with the expectation A, which completes the proof of (ii). DO 


The statement (i) of Theorem 6.1 follows trivially from Theorem 5.2 while the 
central limit theorem given in (iii) was proved by Rucifiski (1988) also by the 
method of moments (though his proof involves more subtle combinatorial 
arguments). It is not hard to verify that (ii) holds if and only if when p ~ cn 1/4 
and G is strictly balanced (examples of strictly balanced graphs are trees, cycles 
and complete graphs). If we assume only that G is balanced, but not strictly 
balanced (for example, G is connected unicyclic graph different than a cycle), 
then it is quite hard to establish the asymptotic distribution of X,(G) and no 
“closed form” solution can be expected. Indeed even in the simplest case when G 
is balanced and has exactly one subgraph H #G such that d(H) =m(G) (for 
instance G is a cycle with a pendant vertex attached to it), the limit distribution of 
X,(G) when p~ cn ''“), is the same as the distribution of a rv £7, Y,, where 

, Y,, Y;,..., are independent Poisson rvs with EZ =c'”'/aut(H) and EY, = 
c!-#laut(H)/aut(G) for i=1,2,... (see Janson 1987). 

In 1970, Stein introduced a powerful new technique for obtaining estimates of 
the rate of convergence to the standard normal distribution. His approach was 
subsequently extended to cover convergence to the Poisson distribution by Chen 
in 1975, Finally, Barbour (1982) ingeniously adapted both methods to random 
graphs. The Stein—Chen approach has several advantages over the more popular 
method of moments. The principal advantage is that a rate of convergence is 
automatically obtained. Also the computations are often easier and fewer 
moment assumptions are required. Moreover, it frequently leads to conditions for 
convergence weaker than those obtainable by the method of moments. 

Let us consider the following mode of convergence of a sequence of non- 
negative integer-valued rvs {X,,} to the Poisson distribution. We say that {X,} is 
Poisson convergent if the total variation distance between the distribution A(X, ) 
of X, and the Poisson distribution with the expectation A, = EX,, tends to 0 as 
n—%, So, we ask when 


A(P(X,), Po(A,)) = sup [Pr(X, € A) - 5 e™|>0 
AcZt : i€A ~* 
asn—>o, Z*={0,1,...}. 
It is important to notice that if {X,} is Poisson convergent and A, 0, A,—>A 
or A, >, then X,-30, X, 3 Po(A) and Xx, 43N(0, 1), respectively. 
The original Stein method investigates the convergence to the standard normal 
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distribution in the following metric: 


4,(P(X,), NO, 1)) =sup [Al] | Ha) dx (2) - | Ae) aoe), 


the supremum is taken over all bounded test functions h with bounded derivative, 
\|A|| = sup|A(x)| + sup|h’(x)|, F, is the distribution function of X,,, while ® denotes 
the distribution function of N(0,1). One can also show that if d,(AC(X,), 
N(0, 1)) +0 as n> then X,->N(0, 1). 

The basic feature and advantage of Stein-Chen approach is that it gives 
computationally tractable upper bounds for distances d and d, even when the 
random variables in question are sums of indicators with a fairly general 
dependence structure. For example, if Z, ,=Z,(7,) is the number of k-vertex 
tree components then Z,(7,) = 2, ¢(,;* /,, where I, = 1 if the k-vertex subset a of 
the vertex set of G(m, p) spans a component being a tree and /, = 0 otherwise. 
Note that the dependence of those indicators is no longer “local” as in the case of 
X,(G) (see Theorems 5.2 and 6.1) since now all I, are pairwise dependent. The 
next theorem combines results of Barbour (1982) and Barbour et al. (1989) and 
gives a better insight into the character of the approximation obtained via the 
Stein—Chen approach. 


Theorem 6.2. If KEN, k =2, then 
(i) dP(Z,,), Po(EZ, ,)) =O(n "EZ, ,) 
(ii) d,(P(Z,4)» N@, 1)) = O(EZ, ,)""”)- 


Notice that for a fixed k, EZ, , ~n*p*~'e *””, the theorem above implies that 
Lak , 0 when p <n her ye or knp — logn — (k — 1)loglogn—, Z,, enjoys 
the Poisson distribution at both “ends” of the “lifetime interval” and Z, , has the 
standard normal distribution “in between”. Knowing the fate of tree-components 
during the evolution from previous sections, one might expect such asymptotic 
behavior of Z, ,. 

Another example of an important numerical characteristic of G(n, p) which 
acts in a similar way as Z, , is the number Y, , of vertices of degree d, dEN. 
Note that neither the property that a graph has a tree component nor the property 
that it contains a vertex of fixed degree is monotone. Although a stronger result, 
analogous to Theorem 6.2, can be proved for Y, , also (see Barbour et al. 1989), 
we shall describe its limit distribution in a weaker but more transparent form. 


Theorem 6.3. if dE N, and n—~, then 
(i) Y, 220 if p<<n tan Die 
(ii) ¥nq-2 Po(d*/d!) if p~ on atta, c>0, 
(iii) Y,, a? N(O, 1) if p>>n@*"4| and np — logn — dloglogn>-~, 
(iv) ¥, 4S Pole” “/d!) if np —logn—d loglogn—c, -~<c<, 
(v) Y, 40 if np — logn — d log logn—», 


Random graphs 369 


The reader may notice that we excluded the case k = 1 in Theorem 6.2 and the 
case d= 0 in Theorem 6.3. Obviously, Z, , = Y,,, and both rvs count simply the 
number of isolated vertices G(n, p). Denote this number by S, and observe that 
the reason why we have to treat this case separately is that the property that a 
graph has an isolated vertex is decreasing. Therefore, as we shall see, S, behaves 
in an opposite manner to X,(G), the rv related to an increasing property (see 
Theorem 6.1). 


Theorem 6.4. If n->~, then 
(i) S, 3N(0, 1) if p>>n”? and np — logn> —~, 
(ii) S, > Po(e™ ‘) if np —logn—c,c>0. 
(iii) S. 40 if np —logn&. 


The Poisson and Normal are the most common limit distributions in random 
graphs since the majority of numerical graph characteristics can be represented as 
sums of weakly dependent indicators, each with a small probability of being 
non-zero. However, especially when G(n, p) is in the phase transition period, 
other types of limit distributions occur. For example, in the case when en—, 
Kolchin (1986) showed that the number U, of unicyclic components and the 
number W, of vertices in such components in the subcritical phase have the 
following limit distributions. 


Theorem 6.5. If p =(1—)/n, ¢ =«e(n) and e’n->& as n->~ then, 

(i) U, + slog €)/(— zlog e)'? 3N(0, 1), 

(ii) ow, / 5 + I(4), where I'(B) denotes the standard gamma distribution with 
the expectation B. 


We should mention, that inside the phase transition interval, i.c., when 

=(1+ e)/n, ¢ = e(n) and e’n = O(1), the limiting results are extremely hard to 
study (see Stepanov 1987). 

Let us conclude this section with another example of the limit theorem, which 
provides the answer to the following question of Paul Erdés: “What is the length 
L,, of the first cycle in an evolving random graph?” The distribution of L,, was 
found by Janson (1987) while the expectation of L, was determined by Flajolet et 
al. (1989). 


Theorem 6.6. (i) L,>L as n-—»©, where L is a random variable with the 
distribution 


1 
P(t =1)=3 | £10)? "4 at for 1=3,4,..., 


(ii) EL, =Cn'’* + O(n'”®) where C = 2.033. 
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7. Extreme characteristics 


Another interesting class of rvs defined on a rg G(#, p) consists of extreme graph 
characteristics such as the minimum and maximum vertex degree, the length of 
the shortest or longest path between two specified vertices, the diameter and 
girth, the length of the longest cycle, the size of the smallest or largest subset of 
the vertex set having a given property, etc. From the point of view of probability 
theory, all the random variables listed above are examples of extreme order 
statistics. 

Let us consider the minimum and maximum vertex degree of G(n, p) denoted 
by 5 and A, respectively, and let X;, i=1,2,...,n, be the degree of the ith 
vertex of G(n, p). Clearly, the X; are identically distributed (but not indepen- 
dent) rvs each having the binomial distribution with parameters n—1 and p. 
Moreover, if X,;), Xj2),--- »Xjny Stand for the degrees arranged in the ascending 
order, then 6=X,,, and A= X,,, are the two extreme order statistics of the 
degree sequence. For a detailed study of the asymptotic behavior of those rvs, the 
reader is referred to chapter 3 of the book by Bollobas (1985). 

Let us concentrate on the limit distribution of the rv 6. Theorem 6.4(i) implies 
that if the edge probability p is such that pn —logn— —o then the minimum 
vertex degree of G(n, p) is a.s. equal to 0. If we raise p slightly, then the 
following result of Erdés and Rényi (1961) holds. 


Theorem 7.1. Let d=0, c,=c,(n) = np —logn—dloglogn and n>, 

(i) Pr(6 =d)—> 1—exp(—e “/d!) and Pr(5 =d + 1)—>exp(—e “/d!) if cy, 
where —w<c<em, 

(ii) Pr(6 =d +1) 1 if cz but ¢,,, 7 —@. 


Ivchenko (1973) proved that the above theorem is true in a more general 
setting, namely 6 can be replaced by X;,, i21, and exp(—e “/d!)=A_ by 
vixeo (A‘/i!) e*. When p <<logan/n then G(n, p) has many vertices of minimum 
and maximum degree. However, Bollobas (1982) showed that if p >> log n/n (but 
1-p<<logn/n) then it a.s. has a unique vertex of minimum degree and a 
unique vertex of maximum degree. In fact even more is true. One can show (see 
Palka 1987) that for a fixed i both sequences Xj,),..., Xj and Xj,-4,---»Xtay 
are a.s. strictly increasing in G(n, p). 

Recall now the well-known theorem of Whitney which states that for any graph 
G, «(G)<A(G) <6(G), where x(G) and A(G) denote the vertex and edge 
connectivity of G, respectively. Erdés and Rényi (1961) asked about the value of 
«(G) and A(G) when G is random. They showed that, at least for the edge 
probability p specified in the «sumptions of Theorem 7.1, a minimal vertex cut in 
G(n, p) is a.s. composed of the neighbors of any vertex of minimum degree. 
Bollobés and Thomason (1985) were able to prove that this holds during the 
whole evolution. Hence we have the following random graph version of Whitney’s 
theorem. 
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Theorem 7.2. Let p = p(n), 0<p<1. Then in G(n, p) 
Prix =A=8)—>1 asn—>on, 


The above result indicates a remarkable fact that in a rg trivial necessary 
conditions expressible in terms of the minimum vertex degree 6 are often a.s. 
sufficient. Indeed, the evolving random graph becomes a.s. k-connected as soon 
as the last vertex of degree k — 1 vanishes. Moreover, one can show that “8 21” 
and ‘5 > 2” are a.s. sufficient conditions for a perfect matching and Hamiltonian 
cycle, respectively (compare Theorems 5.6 and 5.7 with Theorems 6.3 and 6.4). 
In fact, the following very precise and general result, due to Bollobds and 
Thomason (1985) and Bollobas and Frieze (1985) hold. . 


Theorem 7.3. Let KEN and t(#) denote the hitting time for an increasing 
property x. Then almost every random graph process (G,)°? is such that 1((8 = 
k)=7(k =k) =7(M,), where M, is the property that a graph contains |k/2| 


disjoint Hamiltonian cycles plus a disjoint perfect matching when k is odd. 


The proof of the second equality in Theorem 7.3 is based on a sophisticated 
“coloring method ”’ of Fenner and Frieze (1983). The introduction of this tool to 
random graphs was a turning point in the study of matchings and Hamiltonian 
cycles. For a closer look at how the method works, the reader is encouraged to 
consult the paper by Frieze (1989). 

The diameter of a rg is another example of an extreme order statistic since, 
diam(G(n, p)) = Yiy, where Y,, i= 1,2,..., (2) denotes the length of a shortest 
path between the ith pair of vertices in G(n, p). Notice that the problem of the 
limit distribution of the diameter is more complicated as the Y, themselves are 
extreme order statistics. Nevertheless, Burtin (1975) was able to establish the 
asymptotic behavior of this characteristic, but only for relatively large edge 
probabilities p. 


Theorem 7.4, Let d EN, c,=p*n*'-—2logn and n—>~. Then 

(i) Pr(diam(G(n, p)) = d)— exp(—e ‘/2), Pr(diam(G(n, p) =d+1)>1- 
exp(-e °/2) if d2=2 and cy>c, —~<c<~, 

(ii) Pr(diam G((n, p)=d)—>1 if d=3 and cy but cy_,;—7-%, while 
Pr(diam(G(n, p) =2)— 1 if c, >~ but n’°(1—p)>~. 


The results of Bollobas (1981b) indicate that the diameter of G(n, p) is one of 
at most two values when p is just about large enough to guarantee that G(n, p) is 
a.s. connected. Furthermore, he shows that when np =c, and c is sufficiently 
large, then the diameter of the giant component is o(log 7). 

A related problem of the length L = L(G(n, p)) of the longest path in G(n, p), 
for a similar range of p, was considered by Ajtai et al. (1981). 
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Theorem 7.5. if np =c, then there is a function f:(1,%)—(0,1), fc) 1 as 
c—>%, such that Pr(n”'L = f(c)) 1 as n>, 


Frieze (1986), solving the more general problem of the length of the longest 
cycle in G(n, c/n), gave essentially the best possible upper bound on f(c) for large 
c. A natural generalization of these questions is: what is the size of the largest 
induced subgraph of a given type (path, tree, cycle, etc.)? 

We conclude this section with two important results concerning the size 
a = a(G(n, p)) of the largest independent set in G(n, p). Obviously, @ is yet 
another example of the extreme order statistics since a = Uj», where for i= 
1,2,...,2", U,=|V,| if V, is an independent set in G(n, p) and U; = 0 otherwise. 
The reader may expect that, as in all previous cases, the limit distribution of a 
should be concentrated in a small range of possible values. Indeed, Frieze (1990) 
gave an astonishingly precise estimate for the independence number a, when the 
edge probability p = p(n)—>0 as n—>~. 


Theorem 7.6. Let np =c and e >0 be fixed. Suppose c, <c = o(n) for sufficiently 
large fixed constant c,. Then 


nf 


asn>o, 


2. 
a(G(n, p)) ~ = (log c — log log c — log 2 + 1) <) 1 


Systematic studies of the independence number of a rg were originated by Matula 
(1972) who proved that when p is fixed, the limit distribution of a is concentrated 
on at most two possible values. 


Theorem 7.7. For any constant p, QO<p<1 and «>O0, 
Pr(|z((n, p) — €] <a(G(n, p)) < lz(n, p) + e]) 71 asnoo, 
where z(n, p) = 2 log,n — 2 log, log,n + 2 log,(e/2) + 1 and b =1/(1 — p). 


In the next section we shall find both above results very useful in determining 
the chromatic number of a rg. 


8. Coloring 


Over the last decade the chromatic number y(G(n, p)) of a rg G(n, p) has 
attracted the interest of many people (cf. chapter 4). We shall present, briefly, the 
milestones in the history of the struggle to determine y(G(n, p)), showing how 
those efforts have contributed to the most important methodological break- 
through in random graphs in recent years. Observe first that the picture before 
the phase transition is quite clear. 
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Theorem 8.1. Let n—& and xy = y(G(n, p)). Then 
(i) Pr(y =1)>1-e °”? and Pr(y = 2) ‘” if p~cn”?, 
(ii) Pr(y =2) 1 if p>>n? but p<<n' 
(iii) Pr((y = 2) @ = e7'((1 —c)/(1 + c))'* and Pr(y =3) 91-4 if p~cn™ 
0<c<1. 


’ 


Indeed, statements (i) and (ii) trivially follow from Theorems 3.1 and 6.1, 
while the proof of (iii) reduces to the determination of the probability that 
G(n, p) contains an odd cycle. The question, what is the chromatic number of 
G(n, p) when p = cn, c > 1, was raised by Erdos and Rényi (1960). They noticed 
that for such edge probabilities y 3, since G(n, p) contains an odd cycle a.s. 
(now we know that y =3 a.s. if p =1/n). For large c an answer is givén by the 
following result of Euczak (1990c). 


Theorem 8.2. Let np =c and e>0 be fixed. Suppose c, <c = 0(n) for sufficiently 
large constant c,. Then 


c c 
Pr(sioes <x (Gln, p)<0+ OTe)?! asn—>o, 


Notice that a trivial lower bound for the chromatic number of G(n, p) is n/a, 
where a = a(G(n, p)) denotes, as before, the size of the largest independent set. 
Hence, the results of Frieze and Matula concerning a (see Theorems 7.6 and 7.7) 
yield the respective lower bounds on x(G(n, p)). The hard part is to establish the 
upper bound. For a constant p, Grimmett and McDiarmid (1975) were able to 
prove that for any e>0, y(G(n, p))<(1+ €)n/log,n a.s. where b= 1/(1—p). 
They showed that a greedy algorithm, which assigns colors to vertices of a rg 
sequentially, in such a way that a vertex gets the earliest available color, a.s. 
needs, approximately, n/log,n colors to produce a proper coloring of G(n, p). 
Grimmett and McDiarmid conjectured that this algorithm uses twice as many 
colors as necessary. An important step towards a resolution was made by Matula 
(1987). He constructed a refined version of the greedy algorithm, called the 
“expose and merge” algorithm, which uses roughly (2n/3log,m) colors. Next 
Shamir and Spencer (1987) proved that y is sharply concentrated in an interval of 
length of order n'’”, and in their paper, for the first time, a new powerful 
technique based on concentration measure for martingales was applied to random 
graphs. (Let us restrict our attention to discrete rvs and recall that a sequence 
{X;: i =0} is a martingale with respect to the sequence {Y, > 0} if for all i> 1, 
E|X,|<~ and, with probability one, E(X,| ¥,,-.., ¥,_,)=X,-,, where E(-|-) 
denotes the conditional expectation.) In particular, Shamir and Spencer (1987) 
used the Azuma inequality, which states that if X), X,,...,X,, is a martingale 
and |X, — X,_,|<1 for all i, then for any positive A, Pr(X, -X, 2A) <e 7". 

The chromatic number problem, for dense graphs, was solved by Bollobas 
(1988) who showed how the potential of this large deviation theorem could be 
realized. We shall present the main ideas of his ingenious and elegant proof. 
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Theorem 8.3. Let 0<p <1 be fixed and b=1/(1-—p). Then for every e>0 


P(sieea n <x(Gin, PI<(4 7 pea se5)>1 as nao, 


Sketch of proof. It seems to be clear that to get the upper bound on y when p is 
fixed, one has to employ primarily independent sets of essentially optimal order 
(2 + o(1))log,n. 

Indeed, Bollobaés uses independent sets of the cardinality s, = |2log,n — 
8 log, log, + O(1)|, which is only a little bit smaller than the maximal order of 
the largest independent set in G(n, p) (see Theorem 7.7). Next, he sets n. = [n/ 
(logn)"} and shows that every set of nm, vertices of G(n, p) contains an 
independent set of order sg, with probability tending to one as n tends to infinity. 
To prove this last statement, one has to show that 


(np) Pr(X = 0) <2" Pr(X=0)70 asnoe, 


where X is the number of independent sets of order 5, in G(n,, p). To meet this 
requirement, Pr(X =0) has to be exponentially small, which means that the 
expectation EX should be large and the distribution of X sharply concentrated 
around EX. This is, in fact, the key moment of the proof and we shall need 
martingales to proceed further. Notice that a random graph G(m, p) can be 
viewed as a vector of indicators rvs (7,,4,,...,4,)), where Pr(/,=1)=p and 
Pr({, = 0) = 1 — p, independently of other /,. Moreover, if an rv Z = Z(G(n, p)) = 
ZX 11+ +s Xeqy)> Xi E (0, 1}, satisfies 


IZ(x,; noe »X4y) a Z(x}, vee »Xiay)h <1 
whenever (x,,...,X;,)) and (igate ,X()) differ in exactly one coordinate, then 
the same is true for the martingale 

X;=E(Z[Y,), Y,=(1,.+.54); X,=EZ, Xy=Z, 


ie., |X,—X,_,|<1. 


Unfortunately, X does not satisfy the above condition because when two 
sequences (graphs) differ.on exactly one place (edge) the number of independent 
sets can differ substantially (in our case by as much as (70-3). Therefore, 
Bollobas switches to the random variable X*, the cardinality of a maximal 
collection of independent sets of order 1 Sox NO two of them sharing more than one 
vertex in G(no, p). Then EX* = O(n3/(log ny)*) = 73 ~°() Furthermore, notice 
that a single edge can alter the size of such a family by at most one, so Azuma’s 
inequality gives 


Pr(X = 0) = Pr(X* = 0) <Pr(EX* — X* = EX*) 
< exp(—(EX*)?/(mo(rto — 1))) <exp(—ng Y) = 0(27"). 


Hence, in every subset of ng vertices of G(m, p) there is an independent set of 
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order s, a.s. Now, assign the first color to such a set and peel it off from G(n, p). 
Next choose another independent set of order s, in the remainder and again peel 
it off. Repeat the procedure until at most mo vertices remain. Obviously, 
x(G(n, p))<n/sy +n, which gives the required upper bound. 0 


Let us conclude our discussion of the chromatic number of a random graph with 
a few comments. Matula and Kucera (1990) gave an alternative proof of Theorem 
8.3, using the second moment method and “expose-and-merge” approach. We 
should remark that Luczak’s proof of Theorem 8.2 is an ingenious blend of the 
martingale and “expose-and-merge” techniques. The reader may also notice that 
the chromatic number is yet another extreme order statistic. So, we should be 
able to show that, at least for some range of p, the limit distribution of x is 
concentrated on at most two points. Theorem 8.1 shows that it is true when 
p <1/n. Shamir and Spencer (1987) proved that if p<n ‘°')*") when e>0, 
then the chromatic number takes on a.s. at most five different values. Luczak 
(1991) was able to show that for such sparse random graphs, the chromatic 
number is, as expected, a.s. two-point concentrated. 

Now assume that instead of vertices of G(n, p) we color edges. Recall that 
Vizing’s theorem says that the chromatic index y € {A, 4+ 1}. Furthermore, 
from the proof of this theorem, it follows that if y= 4+ 1 then a graph has two 
adjacent vertices of maximum degree A. Since in a rg G(n, p) such an event is 
unlikely (remember that for large p a rg G(n, p) has only a unique vertex of 
degree A), therefore we have the following result. 


Theorem 8.4. Let p = p(n), 0<p<1. Then 
Priy=A)>- 1 asn>~., 


Another interesting class of coloring problems is concerned with Ramsey 
properties of random graphs. There are only some preliminary results in this area, 
although random graphs are a quite common tool used in proofs of Ramsey 
theorems for deterministic graphs (see chapter 33). Here we ask about thresholds 
for the vertex and edge Ramsey properties: G(n, p)—>(G); and G(n, p)—>(G),, 
which formally denotes the fact that for every r-coloring of vertices (edges) an rg 
G(n, p) contains a monochromatic copy of a graph G. The vertex-coloring case is 
much easier than the edge-coloring, in contrast to the previously considered 
problem of the chromatic number and the chromatic index of G(n, p). Luczak et 
al. (1992) were able to establish a sharp threshold for the vertex Ramsey property 
for any r and G. 


Theorem 8.5. For every integer r= 2 and every graph G with at least one edge (G 
non-trivial when r=2, i.e., containing a-path on three vertices), there exist 
constants c, =c, > such that, as n>, 
-1/m*(G)} 

> 


Pr(G(n, Dn. i pear 


-1/m*(G) 


1 if p=c,n 
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where m*(G) = Maxjc¢jn\23 @€H)/(\H| — 1). 


Edge-coloring Ramsey properties of random graphs appeared to be definitely 
more challenging. In the papers by Frankl and R6dl (1986) and Luczak et al. 
(1992) the threshold was established in the restricted case when G = K, andr = 2, 
i.e., when G is a triangle and one colors the edges of a rg by two colors. Only 
very recently, using sophisticated probabilistic and graph theoretic tools such as 
Janson’s inequality and Szemerédi’s regularity lemma, Rod! and Rucinski (1993, 
1995) found the threshold for any graph G. 


Theorem 8.6. For every integer r= 2 and every graph G which is not a star forest, 
there exist constants c,=c,>0 such that, as n>, 


0 if pxcn "© 


Pr(G(n, p)—> @y-{ 


1 if pecn ©, 


where m**(G) = maxycg jyj23(@(A) — 1)/(|H — 2). 


The results discussed above paved the way for further research on similar 
properties of other random discrete structures, such as random hypergraphs, 
random subsets of the set of integers, etc. 


Appendix A. Quasi-random graphs 


As has been noted earlier, there is a large collection of properties which the 
random graph G = G(n, 1/2) a.s. has. Some of these are as follows. 

P,(t): For a fixed > 4, each t-vertex graph H(t) occurs (1 + o(1))n‘/2™ times 
as an (ordered) induced subgraph of G. 

P,: G has at least (1 + 0(1))n7/4 edges and at most (1 + 0(1))n*/16 4-cycles. 

P;: y vevic) | |\nd(u) MN nd(v)| — n/4| =0(n°) where nd(x) denotes the neigh- 
borhood of vertex x in G, i.e., the set of all y adjacent to x. 

P,: For all $ CV(G) with |S| = [n/2], S spans (1 + 0(1))n7/16 edges in G. 

Ps: G has (1+0(1))n’/4 edges; A,\(G)=(1+0(1))n/2 and A,(G)=o0(”), 
where A,(G), A,(G),... denote the eigenvalues of the adjacency matrix of G 
with |A,(G)| > |A,(G)|> +++ 

It turns out that these properties are part of a rather large equivalence class of 
properties, termed quasi-random properties in Chung et al. (1989) (see also 
Thomason 1987a). Any graphs G = G(n) satisfying any one of the quasi-random 
properties must in fact satisfy all of the quasi-random properties. This is perhaps 
somewhat surprising since these properties appear to have rather different 
strengths, e.g., P, looks a lot weaker than P,(1000). Moreover, since it is often 
easy to prove that a particular class of graphs satisfies a certain quasi-random 
property then the graphs in this class must possess all the quasi-random properties 
and so, behave like random graphs in many ways. For example, since it is trivial 
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to show that the Paley graph P(p) with vertex set Z/p (where p is a prime = 1 
(mod 4)) and edges {i, j} for i—j a quadratic residue modulo p, satisfies P,, then 
in fact it must contain (1 + 0(1))p’’/2** induced copies of the Petersen graph, etc. 
Similar remarks apply to the even intersection graph I(n) with all XC 
{1,2,...,m} as vertices, and all {X, Y} with |X M Y| =0 (mod 2) as edges. 

In fact, a more quantitative form of the theory is available which is based on 
the concept of the deviation of a graph G=G(n), denoted by devG. This is 
defined to be 


1 
devG=—; > H(V,, V2) H (02, Vz )H (V3, U4) (V4, 01), 


TM v4, 7,053.04 
where for vertices x, y of G, 


_f-1 if {x, y} isan edge of G , 
w(x, ¥) = ti otherwise . 


Quasi-randomness is equivalent to devG going to 0 (as n->~), and so, all 
quasi-random properties can be expressed in terms of dev G (see Chung and 
Graham 1991). For example it can be shown in the case of property P, that for 
any graph G with 7 vertices and e(G) edges: 

(i) e(G) = (n?/4)(1 - (dev G)'"*), 

(ii) G has at most (n*/16)(1 + (dev G)'*)* 4-cycles. 

The theory of quasi-randomness can be carried over to hypergraphs as well (see 
Chung and Graham 1990, 1991 for details). It turns out that the following 
“threshold” result holds for k-uniform hypergraphs (= k-graph). If G“(n) is a 
k-graph which contains all 2k-vertex k-graphs as (ordered) induced sub-k- graphs 
asymptotically the same number of times (i.e., (1 + o(1))n*/ 2) times) then for 
any fixed t, G“(n) will contain any t-vertex k-graph (1 + 0(1))n'/2™ times as an 
induced sub-k-graph. The value 2k is sharp here; the corresponding statement is 
not true if 2k is replaced by 2k — 1. 

In fact, this philosophy can be applied to a large class of structures (e.g., 
integer sequences, (0, 1)-matrices, tournaments, directed graphs, etc.) although 
many of the details still remain to be worked out. We remark that the threads of 
quasi-randomness can be traced back to early work of Wilson (1972, 1974) on 
block designs, Erdés and Sés (1982) in Ramsey—Turdn problems for graphs and 
hypergraphs, Rédl (1986) and Graham and Spencer (1971) (both on certain 
universality properties of graphs) and more recently Thomason (1987a,b, 1989), 
Haviland (1989) and Haviland and Thomason (1989), who deal with a related 
concept called (p, a)-jumbledness. 
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1. Hypergraphs and set systems 


Hypergraph theory is a part of the general study of combinatorial properties of 
(finite) families of (finite) sets. Hypergraphs are systems of sets which are 
conceived as natural extensions of graphs: elements correspond to nodes, sets 
correspond to edges which are allowed to connect more than two nodes. Usually, 
no distinction is made between isomorphic set systems; for instance, the complete 
r-uniform hypergraph on n nodes, denoted by Kj, is the pair consisting of a 
ground n-set and the collection of all its r-subsets. 


1.1. Historical background 


Influenced by the axiomatic tendencies of the beginning of the 20th centiry, and 
faced with the fast-growing economical interest of discrete mathematics, com- 
binatorialists of the 1960s were urged to develop a systematic combinatorial 
approach of finite sets. 

Earlier fundamental theorems, already known in the 1930s, like the Sperner 
theorem (in 1928), the Ramsey theorem (in 1930) and the Marriage theorem of 
Konig [from Frobenius in 1912... to Hall in 1935: see chapter 3 and Lovasz and 
Plummer (1986)], took progressively their place, with the efforts of the Hun- 
garian school, into a larger field of investigations focusing on “set systems’’; 
chapters 24, 25 and 33 are mostly illustrative of this approach. 

During the same period, set systems also appeared to the French school as 
efficient generalizations of graphs. A theory of hypergraphs would extend, 
simplify, and unify the results of the theory of graphs; in return, the theory of 
graphs would supply a convenient language for a theory of set systems. With the 
term “hypergraph”, suggested by Berge at the Tihany Colloquium (in 1966), and 
with the forerunning paper of Erdés and Hajnal “On the chromatic number of 
graphs and set systems” (1966), the premises of a hypergraph theory were settled. 

The reader interested in the first steps of the theory is referred to Berge 
(1970, 1973, 1976), Berge and Ray-Chaudhuri (1974), and Zykov (1974). Recent 
general references are Lovdsz (1979), and Berge’s textbook (1989). Packing and 
covering problems for hypergraphs are extensively treated in Schrijver (1979a,b), 
and in a survey by Firedi (1988). 


1.2. Hypergraphs nowadays 


Inheriting the usual terminology, and hence the usual tools, of graph theory, 
hypergraph theory gains two major advantages: (1) hypergraph structure is 
preserved under the basic set-theoretic operations, and (2) in a set system the 
elements (= the nodes or the vertices) and the sets (the edges) play dual roles. 
The generalization to set systems of graph parameters (such as 
A, w,a,t, p,v, x,y), Constitute, at the present time, the distinctive feature of 
hypergraph theory. Actually, they have led to new fields of investigation, which, 
more powerful and more unified than the original ones, are of independent 
interest. 
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Some outstanding graph-theoretic results have been obtained via hypergraphs. 
A celebrated example is Lovasz’s use of “normal hypergraphs” to solve Berge’s 
“weak perfect graph conjecture”: the complement of a perfect graph is perfect 
(see Theorem 3.19 and chapter 4). Hypergraphs also appear explicitly in 
constructions of Ramsey graphs and graphs with large chromatic number and 
girth (see section 5). More recently, Robertson and Seymour proved a conjecture 
by Nash-Williams on “graph immersions” by the means of an extension to 
hypergraphs of their “Graph minor theorem” (see Theorem 2.6 and chapter 5). 

Hypergraphs also turn out to be remarkably efficient in discrete optimization 
problems: offering a convenient setting to numerous minimax-type theorems (see 
section 3), they lead to a pertinent formulation of general inequalities on packing 
and covering parameters (section 4). 


1.3. Structural-type properties 


Section 2 introduces the reader to difficulties and advantages in generalizing graph 
problems to hypergraphs. There, some hypergraph analogues of important graph- 
theoretic results involving basic notions are mentioned: degrees, cycles, edge 
contractions, cliques, etc. An undeniable success is the above mentioned result on 
hypergraph embeddings. A typical difficulty arises with edge intersection prob- 
lems: a (connected, simple) graph on more than three nodes is fully determined 
by edge-to-edge incidences but a hypergraph is not. The point is that families of 
pairwise intersecting edges in a hypergraph can have a very intricate structure. 
Even for simplicial complexes (hypergraphs in which every nonempty subset of an 
edge is an edge), the maximum size of an intersecting family of edges is unknown 
(see Chvatal’s conjecture 2.18 and chapter 24). Among possible simplifying 
hypotheses, the Helly property (i.e., “pairwise intersecting edges share a common 
node”) provides an efficient way of investigating graphs via set systems. 


1.4. Hypergraphs and min-max relations 


A variety of hypergraph classes is presented in section 3. Those set systems 
generalize either trees or bipartite graphs; they frequently arise in the polyhedral 
approach of Combinatorial Optimization problems (cf. chapters 3 and 30): certain 
hypotheses — embeddability in a highly structured set, coloring properties, forbid- 
den configurations — imply general min—max type relations, namely when assump- 
tions warrant the integrability of optima in some related linear programs. The 
interest in “‘tree-like” hypergraphs (e.g., arboreal hypergraphs, totally balanced 
hypergraphs, ...) has been largely motivated by greedy heuristics: the idea is to 
handle a structure which is preserved under some elimination scheme (like 
successive removals of leaves of a tree). 

Balanced hypergraphs and farther generalizations of bipartite graphs (such as 
normal hypergraphs, Mengerian hypergraphs) share the K6nig property: the 
minimum number of nodes needed to cover all the edges (i.e., the blocking 
number) equals the maximum number of pairwise disjoint edges (i.e., the 
matching number). Actually, numerous min-max type properties amount to say 


Hypergraphs 385 


that a certain hypergraph (such as paths or cliques of a graph, pieces of a pattern, 
intervals, chains of a partially ordered set, ...) possesses the K6nig property. 


1.5. Packing and covering 


The world of matching and blocking numbers (denoted by v and 7) forms the 
cornerstone of discrete packing and covering problems, discussed in section 4. 
This field is the most active part of the theory of hypergraphs: almost all 
combinatorial optimization problems can indeed be reformulated as the de- 
termination of 7 or v in a certain hypergraph. 

Deep results in this area, inequalities or estimates, have been obtained by a 
powerful combination of classical “hypergraph-style” reasonings (extensions of 
K6nig’s theorem, averaging methods, set theoretic operations and‘ incidence 
properties) with linear algebra, probabilistic methods and polyhedral com- 
binatorics. For instance, with the linear programming approach of optimization 
problems, parameters 7 and v are viewed as integral optima of dual linear 
programs whose rational optima coincide. Extremal situations frequently involve 
block designs. 


1.6. Colorings 


Hypergraph coloring problems constitute, because of their intrinsic difficulty, a 
very attractive subject, with natural connections with other fields (e.g., Ramsey 
theory): less unity, more invention. The central question, treated in section 5, is 
to find the chromatic number of a set system: how many colors do we need in 
order to color the nodes so that at least two different colors appear in every edge 
of cardinality 2 or more. 

As a clear demonstration of ‘‘French style” incidence methods, we give in full 
the proof of a bicolorability theorem, 5.8, showing that a nonbicolorable 
hypergraph must contain a special type of odd cycles. 

For uniform hypergraphs, the best upper bounds to date on the chromatic 
number (in term, for instance, of their size) were obtained by “Hungarian style” 
probabilistic methods (e.g., see Theorem 5.17). 

A famous coloring problem, originating with Kneser’s conjecture in 1955, was 
to determine the minimum number of classes in which the A-subsets of a given 
n-set can be partitioned such that no class contains ¢ pairwise disjoint members. A 
general solution has recently been obtained with the help of topological methods, 
inspired by Lovasz’s solution of Kneser’s conjecture. See Theorem 5.24. 

Some partitioning problems, occurring in design theory and in discrepancy 
theory (cf. chapters 14 and 26), as the “factorization problem” for complete 
uniform hypergraphs, have been successfully treated by hypergraph methods on 
edge colorings: see Theorem 5.29 and Corollaries 5.30-5.32. 


1.7. General terminology 


(1.1) Let V= {v,,...,u,} be a finite set. A hypergraph on V is a pair # = 
(V, &), where @ is a family (E,),-, of subsets of V. Elements of V (respectively 
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members of @) are called the nodes (or vertices) of # (respectively the edges of 
#). The numbers n and |/| are frequently referred to as the order and the size of 
#, respectively. Notice that repeated edges, empty edges and isolated nodes (= in 
no edge) are permitted. One-element edges are called loops. 

In fact, when # is defined as a family of sets, # =(E,);<,;, or by the list of its 
edges, H=(E,,E,,...,E,,), we implicitly assume (in accordance with Berge 
1989) that the vertex set V(#) is the union of the edges. 

(1.2) We may view a general hypergraph on V as a multiset of V-subsets: every 
V-subset appears in # with a certain multiplicity, a nonedge corresponding to 
multiplicity 0. 

Hypergraphs correspond as well to 0-1 matrices. With hypergraph # is 
associated in a canonical way its edge—vertex incidence matrix M(d#): rows 
correspond to edges, columns correspond to vertices; entry m,; , equals 1 or 0, 
depending on whether £, contains v, or not. The pairs [v, i] such that v € E,, form 
the edges of the bipartite incidence graph of #. 

(1.3) The dual of #, denoted by # ', corresponds to the transposed matrix. We 
have #° = (I, (#(v)),<y), where H#(v), the star of v, denotes the set of all iE J 
such that vu € E;. 

(1.4) The letter # refers, if no other specification is made, to the generic 
hypergraph H=(V,€), with €=(E£,),<-;; n and m denote its order and size, 
respectively. Note that, in general, n and | U, E,| may be different. 

(1.5) The image $(#) of # by a mapping ¢: VW is the hypergraph (¢(V), 
($(E,)):c,). If @ is one-to-one, hypergraphs # and $(3) are isomorphic (we 
write #=(H#)). Throughout the chapter, we do not distinguish between 
isomorphic hypergraphs, unless otherwise specified. 

(1.6) The product (direct product in Lovasz (1979)) of hypergraphs # = 
(V,(E;);<,) and H=(W, (F,)jes) 18 the hypergraph # @H:=(V x W,(E, x 
Fu iyerxs) 

(1.7) Subhypergraphs correspond to submatrices: a hypergraph X’ = (V’, @') is a 
Subhypergraph of #=(V,@) if V’ CV and if every H’-edge is the intersection 
with V’ of some #-edge. A subhypergraph may have empty edges or isolated 
nodes. Let ACV and JCJ. The subhypergraph of # induced by A [or the 
restriction of # to A, Lovasz (1979)] is the hypergraph #[A] = (A, (E,;N A: 
E;7 A#¥$),-;) [an alternative notation is %,, Berge (1973, 1989)]. Notice that an 
induced hypergraph has no empty edge. The notation #[V\x] is abridged into 
#\x. A partial (sub)hypergraph of # is a hypergraph that corresponds to a 
subfamily of edges. A partial hypergraph has no isolated vertex. The family 
(E,),¢, constitutes the partial hypergraph determined by J (or generated by J, 
Berge 1989). The partial hypergraph determined by /\{i} is denoted by #\E,. 

(1.8) The partial hypergraph formed with all edges of # that are contained in A 
is called the trace of # by A, denoted by HA] (the term section was used in 
Berge 1989, with notation # x A). 

(1.9) The k-section of # (or the kth shadow, cf. chapter 24), denoted by %,,, is 
the hypergraph on V(#) formed with all k-sets F such that F C E for some edge 
E of &. 
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(1.10) The line-graph L(#) of # [or representative graph (Berge 1989), intersec- 
tion graph] of # is the 2-section of #": nodes of L(#) are H#-edges, edges of 
L(#) are pairs of intersecting cAeagee 

(1.11) A sequence o =v,E,v,E,---v,E,u,,, (where the E; are distinct #-edges, 
the vu, are distinct #-vertices for 1<i<p such that v,;,u,,,€ £, for 1<i<p) is 
called a path of length p (joining v, to v,,,) in # if u,,, is different from the 
other v,’s. The hypergraph # is connected if any two vertices are joined by a 
path. If v,,, =v,, the sequence o is a cycle of length p in #. lt is sometimes 
useful to consider a more restrictive notion: if, in addition, the sets E,,,\E, (with 
p+1=1) form a partition of E£,UE,U---UE,, the cycle will be called a 
hypercycle. A graph-theoretic cycle will be referred to as a polygon. 

(1.12) # is simple if there are no repeated edges. If, moreover E, Z E; for i ¥j, 
the hypergraph # is called a clutter or an antichain or a Sperner system. # is 
separating if #" is a clutter or, equivalently, if, for every node v, the intersection 
of all #-edges containing v is {v}. 

(1.13) The rank (respectively corank) of # is the maximum (minimum) cardinali- 
ty of an edge. # is r-uniform if |E,| =r for all i. An r-graph is a simple r-uniform 
hypergraph. 

(1.14) # is linear if |E, N E,| <1 for ij. 

(1.15) # is intersecting if its edges pairwise meet. For positive integers k, t, with 
k22, # is k-wise t-intersecting whenever any k distinct edges have at least ¢ 
common nodes. If t=1, then ¢ is omitted. Intersecting partial hypergraphs of # 
(or intersecting families, Berge 1989) are referred to as “cliques” of # by some 
authors (compare with (1.27)). 

(1.16) # is a star if all its edges have a common node. The star centered at a node 
v, denoted by #(v), is the family of all #-edges that contain v. 

(1.17) # has the Helly property if every intersecting family of edges forms a star. 
If #' has the Helly property, # is said to be conformal. 

(1.18) The degree in # of a node v, denoted by deg,(v), is the size of the star 
Hv). The size of the largest linear star containing v is the linear-degree of v. The 
maximum degree of #-nodes (respectively the maximum linear-degree) is 
denoted by A(#) (A,(#)). 

(1.19) A set SCV(#) is stable (respectively strongly stable) if no edge is 
contained in S (respectively if |S M E|<1 for every edge E). A hypergraph is 
r-partite if its vertex set can be partitioned into r strongly stable sets. The stability 
number a(#) is the maximum cardinality of a stable set. The maximum 
aad of a strongly stable set is denoted by a,.)(#). We have a.\(#) = 
a(#,,,) 

(1. 20) A set BCV(#) is a blocking set of H [or a node-cover, Lovasz (1979), 
Fiiredi (1988), or a transversal, Berge (1973, 1989)] if it meets every edge. 
Inclusion-minimal blocking sets are the edges of the blocker of #, denoted by 
Bl(#). The minimum cardinality of a blocking set is referred to as the blocking 
number and is denoted by 7(#). Alternative terms and notation are transversal 
hypergraph Tr(#) and transversal number, Berge (1989), or node-covering 
number, Lovasz (1979), Fiiredi (1988). 
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(1.21) An edge-cover of # is a node-cover of #'. The edge-covering number 
p(3#) is the minimum size of an edge-cover. 
(1.22) A matching of # is a set of pairwise disjoint #-edges. The matching 
number v(#) is the maximum number of edges in a matching. 
(1.23) A fractional blocking set of # (or a fractional (node-)cover, a fractional 
transversal) is a weight function p: V->R, such that ,¢, p(v)>1 for every 
edge E. Dually a fractional matching is a function g:/—>R, such that 
Diver q(i) <1 for every vertex v. The numbers Y1,<y p(v) and ee q(i) are the 
respective weights of p and q. The fractional blocking number +*(#H) is the 
maximum weight of a fractional transversal. It turns out (see section 3) that 
1*(#) coincides with the fractional matching number v*(#), i.e., the minimum 
weight of a fractional matching. 
(1.24) A (proper) k-coloring of # (or k-coloration, Lovasz 1979) is an assign- 
ment of “colors” to nodes so that (i) each node receives exactly one color, (ii) at 
most & colors are used, and (iii) no nontrivial edge is monochrome (thus loops 
and empty edges do not play any role). The chromatic number x(#) is the 
smallest integer k such that # admits a k-coloring. If y(#) =k (respectively 
x(H) <k, x(#) <2), then # is said to be k-chromatic (k-colorable, bicolorable). 
(1.25) A (proper) edge-coloring of # is an assignment of colors to #-edges so 
that distinct intersecting edges receive different colors. The chromatic index y(#H) 
(or edge-coloring number) is the smallest number of colors needed. Note that 
WH) = x(L(#)). 
(1.26) A hereditary hypergraph is a simplicial complex, i.e., a simple hypergraph 
in which every nonempty subset of an edge is an edge. The hereditary closure of 
H#, denoted by &, is the smallest hereditary hypergraph containing #. 
(1.27) If a hypergraph # is endowed with a partition V,,V,,....V, of its node 
set, we say X is partitioned (or p-partitioned to be more precise). Given a 
mapping M:N?—N and integers n, = |V,|, the complete p-partitioned hypergraph 
Ke oe is the multiset where every subset E of V occurs with multiplicity 
M(ENV,|, IENV,|, . value 1 for a single p-tuple 
(r,,7,,--+,7,) and is 0 otherwise, the standard notation is K* te "zee , abridged 
to K*.. ee if 7, =r, =---=r,=1 (complete p-partite p- graph) and to K’ if 
=1 (complete r-graph of order n). A clique of an r-graph G is any subset A of 
wees such that G[A]= K/,). 


1.8. Notation 


EE€H  E is an edge of #&. 

HCH HK is a partial subhypergraph of #. 
V(#) vertex set. 

M(#) mxXn edge-vertex insidence matrix. 
al dual of #. 

o(#) image hypergraph of # under ¢. 
# =H isomorphic hypergraphs. 

HOH product hypergraph of # and %. 
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degy(x) degree of node x in #. 

A(#) maximum degree. 

A(#) maximum linear-degree. 

#[A] — subhypergraph of # induced by A (3, in Berge 1989, Lovasz 1979). 
XA] trace of # by A. 

(x) star centered at node x. 

HVE partial subhypergraph obtained after deletion of the edge E. 

HX induced subhypergraph obtained by deletion of x: #\x = #[V\x]. 
Hex k-section of #. 

L(#) __ line-graph of #. 

a(#) stability number. 

@2,(#) maximum cardinality of a strongly-stable set. . 
t(H) blocking number (or transversal number, node-covering number). 
v(#) matching number. 

p(x) edge-covering number. 

x(#) chromatic number. 

y(#) chromatic index (or edge-coloring number). 

K’ complete r-graph of order n. 


2. Hypergraphs versus graphs 


This section focuses on results which are directly inspired by graph theory and 
deals essentially with edge—-node incidences. The generalizations concerning 
cycles (cyclomatic number, acyclicity) or hypergraph minors will sound familiar to 
graph theorists. They will be probably more puzzled by innocent looking 
questions involving degree sequences, line-graphs or intersecting families. A way 
of modelizing graphs in hypergraph theory is shown in section 2.5, where the role 
of the Helly property is emphasized. 


2.1. Trees, cycles and their relatives 


Hypergraphs with no cycles constitute an obvious generalization of the concept of 
a tree. As easily seen, a necessary and sufficient condition for a hypergraph 
# = (E,),<, to be cycle-free is that, for every nonempty subset J C /, the following 
inequality holds: 


> 2 IE|- JU (2.1) 


More generally, Acharya and Las Vergnas (1982) have extended to hypergraphs 
the notion of a cyclomatic number by taking into account the cardinality of the 


intersections E,M E;. Let us assign to every edge {E,, E, ;} of the line-graph L(#) 
the weight |E, E,| ‘and denote by w(#) the maximum weight of a forest in L(#). 
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Then the parameter 


— w(#) (2.2) 


w(et)= D1 |- | 


is nonnegative. It is named the cyclomatic number of #. Hypergraphs with 
cyclomatic number 0 are called acyclic hypergraphs. An inequality, proved by 
Lovasz in 1968, shows that hypergraphs with no cycle of length = 3 are acyclic. 
More generally, we have the following characterization. 


Theorem 2.3 (Acharya and Las Vergnas 1982). A hypergraph # is acyclic if and 
only if # is conformal and for every cycle of length 23 in #, some edge of # 
contains 3 vertices of that cycle. 


Equivalently, the edges of an acyclic clutter represent the maximal cliques of 
some chordal graph (i.e., a graph in which every cycle of length > 4 has a chord, 
see chapter 4). The edges of an acyclic hypergraph can be represented by the 
nodes of some tree T such that the star of every node induces a subtree of T (sce 
Theorem 3.8). 

(2.4) Let a=1,b be fixed integers. Given an arbitrary hypergraph # = (E;),¢;, 
the set function f: 2’ Z defined by f(X)=al U,cy E,|+ 6 for X CJ is sub- 
modular. Such functions induce matroidal structures (see chapters 9 and 11 for 
details) that generalizes graphic matroids (the case a = 1, b = —1, |E,| =2). Then 
standard methods of matroid theory lead to the following generalization of (2.1). 


Theorem 2.5 (Las Vergnas 1970, see Berge 1973). Let r2=2 be an integer and 
(E,),c; be @ hypergraph. There exists a cycle-free r-graph #' =(E;),<, such that 
ECE, for every i€ I if and only if one has 


lu é| >(r—-1)|J| forevery JCI. 
jes 


This yields a sufficient condition for a hypergraph to admit a partition of its 
vertex-set into r blocking sets [a cycle-free hypergraph is balanced, hence 
Theorem 3.5 (vi) applies; for r = 2 cf. Theorem 5.12]. 


2.2. Hypergraph minors 


A graph G’ is a minor of a graph G if (up to isomorphism) G’ is obtained from G 
by successive node-deletions, edge-deletions and edge-contractions (in any order). 
In a series of outstanding papers, Robertson and Seymour proved the following 
conjecture of Wagner: In every Ynfinite sequence of (finite) graphs there is a graph 
which is a minor of another (see chapter 5). Robertson and Seymour recently 
obtained a generalization of their result to hypergraphs. 

Without entering into details, let us say that a hypergraph is a minor of another 
hypergraph # if it arises from #H as the result of successive elementary 
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operations, performed in any order. Elementary operations include the deletion 
of a node, of an edge, the identification of two nodes of an edge (elementary 
‘‘collapsing”), and the replacement of an edge by any subset of this edge 
(‘‘shrinking’”’). 


Theorem 2.6 (Robertson and Seymour 1987). Let ¥ be a family of hypergraphs 
such that no hypergraph in ¥ is isomorphic to a minor of another hypergraph in 
¥. Then F is finite. 


Such a generalization was highly motivated by its application to the following 
conjecture of Nash-Williams on graph embeddings (see chapter 5 for details and 
references): Any infinite list of graphs contains a graph that can be imrhersed in 
another graph of the list. Here, an immersion of a graph G into a graph H consists 
of a pair (a, 8) of mappings with the following properties: 

(a) a: V(G)— V(H) is one-to-one. 

(b) To every edge e = {x, y} of G corresponds a path B(e) of H with endpoints 
a(x) and a(y). 

(c) Paths of the form P(e) are pairwise edge-disjoint. 

A proof of the Nash-Williams conjecture is promptly derived from Theorem 
2.6, by considering hypergraph duals of the graphs of the list. 


2.3. Degree sequences and edge cardinalities 


(2.7) Degree-sequences of graphs are easily characterized by a system of 
inequalities (see chapter 1). From the fundamental theorem on the existence of a 
compatible integral flow in a network (see chapter 2) follows a generalization to 
hypergraphs. 


Theorem 2.8 (Gale 1957, Ryser 1957; see Berge 1989). Given n + m nonnegative 
integers 7, 2r,2:::2r,, d,2d,2:::2d,, there exists a hypergraph 
(E,,E,,...,E,,) on a set {x,,x,,...,X,} such that |E,|=r, for 1\<j<m and 
deg,(x;)=d; for 1<i<n if and only if 


k k 
24,<D 7% fork<n, (1) 


Sd=Sr-Sy,. (2) 


The Kruskal—Katona theorem (see chapter 24) yields a characterization of 
sequences (|F,|);-,., for simplicial complexes and, with little more effort, for 
clutters (Clements, Daykin, Godfrey, Hilton) and intersecting clutters of rank at 
most in (Greene, Katona, Kleitman): see chapter 24 for details and references. 
Characterizations of degree sequences have been obtained for certain finite set 
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systems arising in combinatorial convex geometry: face complexes of simple 
polytopes (Stanley, Billera and Lee); finite families of convex sets in R* (Kalai): 
see chapters 34 and 18. 


Open problem 2.9 (Berge 1989). Determine, for r23, the possible degree- 
sequences of r-graphs (= simple r-uniform hypergraphs). 


2.4. Line-graphs and edge intersections 


To what extent is the structure of a hypergraph reflected by its line-graph or more 
generally by a quantitative informative information on edge intersections? 
(2.10) Every graph G is the line-graph of some hypergraph (see (2.17) for a 
canonical construction). Let 2(G) denote the smallest order of a hypergraph with 
line-graph G. If G has a vertices, the edges of G can be covered by at most [4n7] 
cliques (Erdés, Goodman, Posa, see chapter 1); hence 2(G) <[ja 7]. The exact 
value of 2(G) can be interpreted as the chromatic number of some associated 
graph, Berge (1989). 

(2.11) While the line-graphs of graphs and multigraphs can be characterized by a 
finite number of forbidden induced subgraphs (see chapter 1), no such characteri- 
zation is possible for the line-graphs of r-uniform hypergraphs for any r= 3. 
Constructions of infinitely many forbidden subgraphs are known in the case r =3 
(Nickel 1973 (unpublished), Gardner 1977 (unpublished), Bermond et al. 1977, 
Berge 1989.) Nevertheless, Naik et al. (1982) obtained the following result. 


Theorem 2.12. For any integer r = 3, there exists a smallest integer f(r) such that, 
among the graphs of minimum degree = f(r), the line-graphs of linear r-graph can 
be characterized by a finite list of forbidden induced subgraphs. Moreover, 
f(r) = O°’) and f(3) <69. 


(2.13) In 1932, Whitney proved that the “claw” K, , and the triangle K, are the 
only nonisomorphic connected simple graphs with isomorphic line-graphs. More- 
over, if G and G’ are connected simple graphs of order greater than 4, any 
isomorphism L(G)— L(G‘) comes from an underlying isomorphism G—G’. 
First generalizations of Whitney’s theorem were obtained by Berge and Rado who 
constructed, for every integer p >2, a pair of “nearly isomorphic” hypergraphs 
&, and 2, as follows. Let P=1,..., p. Denote by B (respectively D) the set of 
P-subsets with even (odd) cardinality. Define @, and @, as the dual hypergraphs 
of (P, B) and (P, D), respectively. The next theorem is a further generalization of 
Berge and Rado’s results. 


Theorem 2.14 (Fournier 1984+ Gardner 1984). Let H=(V,E,,...,£,,) and 
H' =(V,E},...,E,) be two hypergraphs with the same vertex set and the same 
size. Suppose p is an integer such that both # and #' have rank less than 2°"'. 
Then there exists an isomorphism ¢:v—>v' mapping every E; to E; if and only if 
the following two conditions are fulfilled: 
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(1) For all JC (1,...,m} with |J|<p, we have |M ic, E;| =|Nie, E'l- 
(2) # and X' do not contain subhypergraphs H,{X] and #,{X'] (with |J| =p 
and |X|=|X'|=2°~'), respectively isomorphic to 8, and @,, or vice-versa. 


Theorem 2.14 implies that certain hypergraphs (E;),-, are determined (up to 

isomorphism) by their “intersection matrix” (i.e., with entries |E,M E,|). Interval 
hypergraphs and, more generally, totally balanced hypergraphs (cf. section 3) do 
have that property. 
(2.15) The famous Ulam-Kelly graph reconstruction conjecture (see chapter 27) 
can be extended to hypergraphs in an obvious way. Within the context of 
hypergraphs, edge-reconstruction and vertex reconstruction become dual prob- 
lems. The example of pairs 2, and %,, as defined above, shows that additional 
hypotheses are necessary to imply the reconstructibility of hypergraphs. More 
sophisticated nonreconstructible hypergraphs are known. 


2.5. Helly property 


A family of sets has the k-Helly property (we omit k for k =2) if every k-wise 
intersecting finite subfamily is a star. Helly’s celebrated theorem asserts that 
convex sets in R*~’ have the k-Helly property. Actually, the Helly property 
occurs in numerous mathematical fields. In arithmetics, the “Chinese remainder 
theorem” amounts to say that arithmetical progressions have the Helly property. 
Other classical examples of Helly families are families of intervals in a lattice (sets 
of the form {x:a@<x<b6}) and families of connected subsets of a (topological) 
tree (cf. arboreal hypergraphs in section 3). Notice that the k-Helly property is 
preserved under the product of hypergraphs. For instance, Euclidean boxes (i.e., 
parallelepipeds with faces paralle]) to the coordinate axes) possess the Helly 
property (see Theorem 4.40 for more). 


Theorem 2.16 (Berge and Duchet 1975). Let # be a hypergraph. Then 

(1) % has the k-Helly property if and only if, for every set A of k + 1 nodes, all 
the edges E such that |EQ A|=k share a common node. 

(2) #", the dual of #, has the k-Helly property if and only if the C-maximal 
edges of # are precisely the C-maximal cliques of some k-graph. 


(2.17) In particular, with any graph G can be associated a separating Helly hy- 
pergraph %'(G) such that L(%"(G)) = G: hypergraph X '(G) is the dual of the 
clutter formed with the maximal cliques of G. That one-to-one correspondence 
constitutes a bridge between graphs and hypergraphs and appears to be very 
useful. Properties of a given class of graphs, say G, may be derived, with the tools 
of hypergraph theory, from more general properties of Helly classes of hy- 
pergraphs. By a Helly class we mean (Duchet 1978) a class of Helly hypergraphs 
whose line-graph lies in G@ and which is closed with respect to certain operations. 
Numerous examples of Helly classes are provided in section 3. 

Various problems on set systems are drastically simplified when restricted to 
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Helly hypergraphs. Theorem 4.40 gives an example of a covering problem; let us 
conclude this section with a celebrated intersection problem. 


Conjecture 2.18 (Chvatal 1972). The hereditary closure # of any hypergraph # 
contains at most A(#) pairwise intersecting edges. 


From a result of Schénheim, it follows that the conjecture holds true when # is 
a Helly hypergraph. For further information on Chvatal’s conjecture, we refer to 
chapter 24 (see also the discussion on the “edge-coloring property” in section 5), 


3. Remarkable hypergraphs and min-max properties 


In many cases, a satisfactory solution of a combinatorial optimization problem, 
such as a min-max relation, amounts to saying that certain 0-1 matrices (= 
hypergraphs) have particular structural properties. Such matrices may, for 
instance, express constraints in a linear program, and some special properties 
imply the integrality of rational optima. It is often useful to consider those 0-1 
matrices as hypergraphs, although, in a large part of the extensive literature on 
the subject, the hypergraph structure is not explicitly mentioned. 

The same classes of hypergraphs, and related min—max type properties, may be 
seen in a different way. Either they generalize remarkable classes of graphs 
(bipartite graphs, trees), or they correspond to interesting patterns of some 
combinatorial structure (paths of a graph, branchings, . . .). 


3.1. A hierarchy of hypergraph properties 


The synoptic diagram in fig. 3.1 visualizes remarkable properties generalizing 
either those of trees or those of bipartite graphs. In the diagram, all possible 
unrefinable implications are marked by arrows. 


Balanced hypergraphs (section 3.3) form a very natural generalization of 
bipartite graphs. They are defined as hypergraphs for which no subhypergraph is 
an odd polygon. Balanced hypergraphs are bicolorable (i.e., they admit a proper 
2-coloring (1.24)) and have the Helly property (i.e., pairwise intersecting edges 
must share a common point (2.16)-(2.17)). Moreover, they possess the main 
min-max type properties of bipartite graphs (viz. chapters 3 and 4): 

- The Konig property: the blocking number 7 coincides with the matching number 
vy (see definitions (1.20) and (1.22); 

~The dual K6nig property: the edge-covering number p coincides with the 
maximum cardinality of a strongly stable set a,., (see (1.21) and (1.19)); 

- The edge-coloring property: the chromatic index coincides with the maximum 
degree A (see definitions (1.25) and (1.18)); 

~The Gupta property: the edges can be partitioned into 6 edge-covers, where 6 
denotes the minimum degree. 
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Figure 3.1. 


Investigation on tree-like structures was motivated by practical applications. 
Families of intervals of Z (Interval hypergraphs, see section 3.4) and systems of 
paths in a tree occur in numerous seriation problems (e.g., location of genes in a 
DNA molecule, phylogenetic trees, . . .). Arboreal hypergraphs (section 3.4), i.e., 
families of connected subsets of a tree, generalize the preceding classes. They are 
used to organize information retrieval systems in relational data bases. They also 
offer a natural setting to some “‘greedy-type” algorithms, based on the existence 
of ‘removable nodes’’. Totally-balanced hypergraphs (section 3.4), defined by the 
absence of any polygon as a subhypergraph, were encountered in the polyhedral 
approach of some location problems. They get together the advantages of 
arboreal structures and of balanced matrices. 

Normal hypergraphs (section 3.5) were introduced as Helly hypergraphs 
corresponding (viz. section 2.5) to perfect graphs (see chapter 4). A hypergraph 
# is normal iff every partial hypergraph #’ C # has the edge-coloring property. 
The “perfect graph theorem” amounts to saying that normal hypergraphs have 
the K6nig property. 

Mengerian hypergraphs and Fulkersonian hypergraphs (section 3.6) provide a 
general setting for important minimax theorems encountered in the polyhedral 
approach of Network Theory (Menger theorem, two-commodity flows, . . .). They 
are defined by integrality conditions on fractional blocking and matching numbers. 
It is more convenient here to extend definition (1.23) using the polyhedral 
approach. 


3.2. A glimpse of the polyhedral approach 


Given a hypergraph # =(V, @), with incidence matrix H, and vectors a€Q*, 
b EQ‘, the fractional blocking-set polyhedron FB(a) and the fractional matching 
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polyhedron FM(b) associated with # are respectively defined by: 
FB(a) = {x €Q)|Hx>a}, FM(b)={yEQ'|yH <b}. (3.1) 


Vectors in FB(@), FM(6) are called fractional a-blocking sets and fractional 
b-matchings, respectively. The adjective “‘fractional” is omitted if we restrict 
ourselves to integral vectors. We omit a,b in the cases a=1,b=1. Integral 
1-blocking sets and 1-matchings are respective incidence vectors of usual match- 
ings and blocking sets. We recall that the clutter consisting of C-minimal blocking 
sets is the blocker BI(#) of #. 

By linear programming duality, we have 


min{b- x |x € FB(a)} = max{y-a| y © FM()} . (3.2) 


Hence, the fractional blocking number r* and the fractional matching number 
v* coincide: 


7* = min{1-x |x FB} = max{y-1|y €@ FM} = »*. 


The blocking number 7 (respectively the matching number vy) corresponds to 
the integral optimum of the ‘‘min” (‘“max’’) in (3.2), with a= 1 and b= 1. More 
generally, the integral optima 


7,(#) = min{1-x|xEZ* 1 k(FB)} 
and 
v,(#%) = max{y-1| y € Z* Nk(FM)} 


are the k-blocking number and the k-matching number of # respectively. A 
k-matching corresponds to a set of edges such that each node belongs to at most k 
of them. A k-blocking set is a set of nodes such that each edge contains at least k 
of them. Remark that repetition of nodes and edges is permitted. We have 
fundamental inequalities (obtained in 1968 by Berge, Simonovits and Lovasz, see 
Berge 1989, Lovasz 1979 or Schrijver 1979a): 


_ ¥(9) v(t y,(#) 
H(96) = 9 (30) = mig AF gyp HD pay “HO? — v2(ae) 
* _ 1H). . H(H) 7,(#) 
=7*(#) = min - = inf — < max 7 =1,(#) =1(#). 
. (3.3) 


The aim of the hypergraph “‘fractional theory’’ is to find the integers & that 
force min-max relations of the type v* = »,/k or 7* =7,/k. (More on the subject 
in section 4; viz. chapters 28 and 30.) 
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3.3. Balanced hypergraphs 


Since balanced graphs are defined by the absence of odd polygons, they form a 
self-dual class. Berge introduced this important class of hypergraphs in 1969 and 
proved bicolorability under the stronger following form (existence of a “good 
k-coloring”’). 


Lemma 3.4. For every integer k > 2, a balanced hypergraph admits a k-coloring of 
its nodes such that the number of colors appearing in any edge E is exactly 
min(k, |E]). 


The next theorem summarizes the minimax types properties of ,balanced 
hypergraphs (see Berge 1989, Lovasz and Plummer 1986 or Schrijver 1979a for 
proofs). 


Theorem 3.5 (Berge 1970, Berge and Las Vergnas 1970). For a hypergraph H, the 
following properties are equivalent: 

(i) # is balanced, 

(ii) #', the dual of #, is balanced; 

(iii) Every subhypergraph of 2 is bicolorable,; 

(iv) Every subhypergraph of # has the Kénig property; 

(v) Every subhypergraph of # has the edge-coloring property; 

(vi) The blocker of any subhypergraph of 3 has the Konig property. 


Applying, for instance, (vi) to duals of bipartite graphs, we obtain Gupta’s 
theorem. Theorem 3.5, together with a result of Fulkerson et al. (1974), see 
Berge (1989) and Schrijver (1979a), implies that either in case a € {1, +~}*, 
bEZ* or in case a€ Q*, bE {1, +~)", both dual programs in (3.2) have integral 
optimum solutions (hence, balanced hypergraphs are Mengerian: see (3.21)). 
These results can be formulated equivalently in terms of total dual integrality, 
blockers, antiblockers, etc. (see chapters 30 and 28). 

No polynomial recognition algorithm for balanced hypergraphs was known until 
Conforti et al. (1990) succeeded, by the use of a decomposition theorem relative 
to bipartite incidence graphs (as defined in (1.2)). 

In a bipartite graph B, a 2-complete cut is a cut (= an edge set whose removal 
disconnects B) which consists of the union of the edge sets of two node-disjoint 
complete bipartite subgraphs. A 2-star cutset is a pair of adjacent nodes whose 
removal disconnects B. 


Theorem 3.6 (Conforti et al. 1990, cf. Conforti et al. 1994). The incidence 
bipartite graph of a balanced hypergraph admits a 2-complete cut or a 2-star cutset. 


(3.7) Unimodular hypergraphs correspond to totally-unimodular 0-1 matrices, 
treated in depth in chapter 30. Unimodular hypergraphs are balanced, moreover 
both polyhedra FB(a) and FM(b) have integral vertices for every integral vectors 
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a, b. Ghouila-Houri’s characterization theorem which asserts the existence of an 
“equitable 2-coloring’’, can be extended as follows (De Werra 1971, see Berge 
1989): For any integer k, the nodes of an unimodular hypergraph can be 
partitioned into k stable sets S, such that, for each edge E, the cardinalities |E 0 S,| 
differ by at most \. 

Hypergraphs with no odd cycles are unimodular (Berge 1989): examples are 
bipartite multigraphs, and subhypergraphs of any ‘‘grid-hypergraph”’ consisting of 
rows and columns of a two-dimensional array. 

Interval hypergraphs (section 3.4) and more generally systems of oriented paths 
in an oriented tree (3.18) are unimodular hypergraphs. See chapter 30 for further 
examples. A polynomial-time algorithm to test whether a matrix is totally 
unimodular results from the work of Seymour on regular matroids (see chapter 
10). 


3.4. Arboreal hypergraphs 


A hypergraph 2 is said to be arboreal when there exists a tree T on node set 
V(#) such that every edge of # induces a 7-subtree. Such trees are called 
representative trees for 3€. As shown by the structural characterization which 
follows, a hypergraph is arboreal iff its dual hypergraph is acyclic (viz. Theorem 
2.3). 


Theorem 3.8 (Duchet 1978, Flament 1978, Slater 1978). A hypergraph # is an 
arboreal hypergraph if and only if 2 has the Helly property and every cycle of 
length 23 possesses three intersecting edges. 


Arboreal hypergraphs form a Helly class (2.17) corresponding to chordal 
graphs. Since chordal graphs are perfect (see chapter 4), arboreal hypergraphs 
share the min-max type properties of normal hypergraphs (Theorem 3.19, see 
also Theorem 4.40 for related results). Algorithms concerning arboreal structures 
frequently use an “‘elimination scheme”: the nodes of an arboreal hypergraph can 
be ordered as v,,...,v, such that every subhypergraph #[v,,u,,,,...,U,] is 
arboreal. To handle representative trees, Duchet (1985) introduced an efficient 
decomposition method; see Theorem 3.16 and (3.18) for some applications. 
(3.9) Totally balanced hypergraphs. A hypergraph # (equivalently, its incidence 
matrix 1) is said to be totally balanced if no subhypergraph is a polygon. Observe 
that, by Theorem 3.8, totally balanced hypergraphs are arboreal [see Lehel 
(1985) for a construction of the representative trees] and form a self-dual subclass 
of balanced hypergraphs. If H is an m Xn totally balanced matrix, the general 
weighted b-matching problem 


max{y-a| yEQ”, y<y, and Hy <b} (3.10) 


has an integral optimal solution for every a, y,,b =0 (such problems arise as 
duals of some location problems, for instance, if, on a tree, distributing centers 
are to be chosen in order to serve clients at a minimum cost: see (3.14)). 
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Moreover, the solutions of (3.10) can be found by a “greedy” algorithm: 
coordinates are optimized one by one according to a certain reordering of rows 
and columns of M, based on the following theorem. 


Theorem 3.11 (Lubiw 1985, Hoffman et al. 1985). A 0-1 matrix is totally 
balanced if and only if rows and columns can be reordered such that the resulting 
matrix has no submatrix of the form 


1 1 
r=(; 4). 

An elegant way of proving Theorem 3.11 is to consider the colexicographic 
order on vectors: (@,,@3,. ++) <coiex (b,, 62, -- .) if a, <b, for the largest index k 
such that a, # b,. By suitable permutations of rows and columns any matrix can 
be arranged in “‘colexical form” (i.e., both rows and columns of the transformed 
matrix are in colexicographic order). For a totally balanced matrix, every 
colexical form is F-free! Hence the method also provides a fast algorithm to test 
whether a given matrix is totally balanced. A straightforward consequence of 
Theorem 3.11 is the following property, due to Brouwer and Kolen: any totally 
balanced hypergraph contains a “‘nest point’’, i.e., a node vu such that the edges 
containing v are totally ordered under inclusion. 

To prove that a hypergraph is totally balanced, the following criterion, due to 
Lubiw, is useful. 


Proposition 3.12. The 0-1 intersection matrix of the edges of a totally balanced 
hypergraph is totally balanced. 


Line-graphs of totally balanced hypergraphs (‘‘strongly chordal graphs”) have 
been extensively studied. The above results were obtained independently by 
Farber. For instance, Farber showed that a graph is totally balanced if and only if 
its adjacency matrix is totally balanced (compare with Proposition 3.12) and he 
characterized those graphs by a simple list of excluded induced subgraphs. 
(3.13) A chordal bipartite graph is a bipartite graph for which every cycle of 
length >4 has a chord. Chordal bipartite graphs were discussed by Golumbic 
(1980) in relation with perfect Gaussian elimination for systems of linear 
equations. Actually, chordal bipartite graphs coincide with the bipartite incidence 
graphs of totally balanced hypergraphs. Perfect elimination schemes correspond 
to F-free orderings; to the first non-zero entry of the first row of a I’-free form of 
the matrix corresponds a bisimplicial edge of the graph, i.e., an edge {i, j} such 
that all vertices adjacent to i or j induce a complete bipartite graph. 

(3.14) Consider a tree T and let SCV(T). For every node x let 0<dj<d) <= 
d<--- denote the sequence of distances from x to elements of S and let 7, 
denote the minimal subtree containing x and the elements s € S with d(x, s) < dj. 
For pEN denote by S(x, p) the set of all nodes y of 7, such that d(x, y)<p 
where k is chosen so that d)_,<p<dj. Tamir (1985) showed that the hy- 
pergraph (E,),-,;:= {S(x, p)|x EV(T), p EN} is totally balanced. By Proposition 
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3.12 the hypergraph (£,),-, on J, where E,={jE/|E,NE, #9}, is totally 
balanced too. For S = V, we obtain a “neighborhood system” (Giles). For |S| = 1, 
hypergraph (E,) consists of paths of an arborescence while the sets £, form a 
“bipath-system”’ (Frank). 

For more information on totally balanced structures, see Anstee and Farber 
(1984), Lubiw (1985), Hoffman et al. (1985) and their references. 


Definition 3.15. # is an interval hypergraph if, for some total ordering of its 
nodes, every edge forms an interval. Interval hypergraphs constitute a Helly class 
(2.17) which is closed under passing to subhypergraphs and corresponds to 
interval graphs (see chapter 4). More precisely, a hypergraph # is an interval 
hypergraph if and only if it has the Helly property and the line-graph of the 
augmented hypergraph #*, resulting from # by adding all singletons as new 
edges, is an interval graph. 


A characterization of interval hypergraphs by forbidden configuration (Tucker 
1972) can be promptly derived from the following ‘“‘betweenness theorem” (in 
hypergraph #, a node x lies between two other nodes y, z if every y—z path has 
an edge containing x). 


Theorem 3.16 (Duchet 1978). # is an interval hypergraph iff every set of 3 nodes 
contains a node which lies between the two others. 


Investigating the Kénig property for polyominoes (see (3.20)), Gydri (1984) 
found a nice min-max property of interval hypergraphs. 


Theorem 3.17. Let # =(E,),., be a family of intervals of a totally ordered set V. 

Let a,(#) denote the maximum length of a sequence i,,...,i, such that 

E,, ZE,,UE,U+*UE, forl<k<p. Let 6,(#) denote the minimal number 
447 k 

of V-intervals we can choose so that every edge of X is expressible as a union of 

chosen intervals. Then 


a (H) = O(#). 


(3.18) Paths in trees. Families of node-sets of paths in a tree (7P-hypergraphs), 
families of oriented paths in an oriented tree (OTP-hypergraphs) or in a rooted 
tree (RTP-hypergraphs) generalize interval hypergraphs. The problem of charac- 
terizing such structures was raised by Renz in 1970. By (3.7), OTP-hypergraphs 
are unimodular. RTP-hypergraphs are preserved under taking subhypergraphs 
[hence they have the additional property of being totally balanced, compare with 
(3.14)], but this is not true for‘OTP-hypergraphs and TP-hypergraphs. Neverthe- 
less, in each case, a characterization by a (huge) list of excluded configurations 
can be given: a hypergraph is in the class iff it is arboreal and has no 
subhypergraph in the list (Arami and Duchet 1988). It also appears that a 
TP-hypergraph is an OTP-hypergraph if and only if it is balanced. Polynomial 
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recognition algorithms are easily derived. Similar results on the corresponding 
line-graphs follow from a suitable “Helly class representation method” (see 2.17). 


3.5. Normal hypergraphs 


Following Lovasz (1972), let us call a hypergraph # normal if (i) it has the Helly 
property, and (ii) its line-graph L(#) is perfect (i.e., x(G’) = w(G’) for every 
induced subgraph G’; see chapter 4)). Arboreal hypergraphs are normal, since, 
by a theorem of Gallai, chordal graphs are perfect. 

For #' C #, L(#’) is an induced subgraph of L(#). Since x(L(#')) = y(H’) 
and w(L(#’)) = A(%’), we see that # is normal iff every partial hypergraph 
#' CH has the edge-coloring property: y(#’) = A(#'). Hence, balanced hy- 
pergraphs are normal. 

By (2.17), to every perfect graph G corresponds (canonically) a normal 
hypergraph # = X '(G); clutters of the form X(G) are called conormal clutters. 
Matchings of # correspond to stable sets of L(#); blocking sets correspond 
(because of the Helly property) to clique-covers of G. Equivalently, »(#) = 
a(G)=«(G) and 7(#)=(G). Thus, the implication “G perfect > G per- 
fect” (Lovasz’s perfect graph theorem) is a part of the following general theorem. 


Theorem 3.19 (Lovasz, see Lovasz 1979, Berge 1989, Schrijver 1979a). For any 
hypergraph 2, the following conditions are equivalent: 
(i) # is normal; 
(ii) Every partial hypergraph 2' C # has the Kénig property: t(#') = v(#'); 
(iil) 7*(#’) = 7(2') for every H' CH; 
(iv) 7,(#') = 21(H') for every H' CH; 
(v) v(3')A(H') <|2'| for every every #' CH. 


See chapters 3 and 30 for polyhedral proofs of this theorem. An easy (but 
crucial!) point is that multiplication of edges in a normal hypergraph preserves the 
edge coloring property. It follows, that # is normal iff the maximum in (3.2) has 
an integral optimal solution for b = 1 and every a Z* (equivalently the fractional 
matching polyhedron is integral). Actually, the system defining FM is totally dual 
integral (Lovasz 1972, Chvatal 1975, Fulkerson 1972). Inequality 3.19 (v) may be 
viewed as the ‘‘width-length inequality” of antiblocking pairs of polyhedra (the 
duals of normal clutters corresponding to complementary graphs form an 
antiblocking pair: viz. chapter 30). See chapter 4 for more information on perfect 
graphs. 

(3.20) Polyominoes. Let S denote the set of unit squares (= cells) with integer 
vertices in R?. A polyomino is a finite subset of S (here, a polyomino need not be 
connected). A box is a rectangular polyomino. A hypergraph #(P) is associated 
with any polyomino P as follows: the nodes are the cells of P, and the edges are 
the C-maximal boxes contained in P. Polyomino-hypergraphs have the Helly 
property and are conform (Theorem 2.16). If P is simply connected, then J¢(P) is 
normal (Shearer 1982, Berge 1989). That is the case when P is horizontally 
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convex, i.e., when cuts of P by horizontal lines are intervals (Berge et al. 1982). 
Theorem 3.5 has the following consequence: hypergraphs corresponding to 
horizontally convex polyominoes have the dual K6nig property: p(3€(P)) = 
1. (H(P)). For simply connected polyominoes, Gyéri proved: p(A(P)) < 
2aj2(H(P)) — 1. 


3.6. Mengerian and Fulkersonian hypergraphs 


Let # =(V, €) be a hypergraph. Expanding a node v EV by an integer k means 
replacing v by k new vertices v,,...,v, and each edge E containing v by k new 
edges E—v Uu,,..., £—v U,. For instance, if uv is the set of arcs of a directed 
graph D with two given nodes s,t, and # is the collection of s—t paths, then 
expanding v by & corresponds to replacing, in D, arc v by k parallel arcs, 
Expanding a vertex by 0 is equivalent to considering the partial hypergraph 
Hv. 

More generally, for wEZ‘, the expanded hypergraph #” arises from # by 
expanding, successively, every node by the corresponding component of w. Let us 
consider the following properties: 


(a) v(#")=v*(H") foreverywEZ° , 


3.21 
(b) 1(%#”)=7*(#") foreveryweEZ:. 2) 


As easily seen, (a) (respectively (b)) amounts to saying that the maximum 
(minimum) in (3.2) has integral optimal solution for a = 1 and every & = w. Thus 
(see chapter 30), condition (a) expresses that the system defining the fractional 
blocking set polyhedron FB is totally dual integral, while condition (b) means that 
FB is integral. Hence v and z do not play a symmetric role: (a) is stronger than 
(b). 

Hypergraphs which satisfy (3.21) (a) are said to be Mengerian (or hypergraphs 
with the Z,-max-flow min-cut property, Seymour (1977); the term ‘‘Mengerian”’ 
refers to the flow version of Menger’s theorem: see Example 3.24(1)). Mengerian 
hypergraphs are characterized as follows. 


Theorem 3.22 (Lovasz 1976, see Schrijver 1979a, Berge 1989). The following 
conditions are equivalent: 
(i) # is Mengerian; 
(ii) Every expanded hypergraph #” has the Kénig property; 
(iii) v,(9€") = 2»(H") for every wEZ*; 
(iv) For some integer k = 2,v,(#") = kv(#”) holds for every wEZ*. 


Hypergraphs which satisfy (3.21)(b) are said to be Fulkersonian [= hypergraphs 
with the Q.,-max-flow min-tut property in Seymour (1977), paranormal hy- 
pergraphs in Berge (1989)]. We underlined above the importance of anti-blocking 
polyhedra for normal hypergraphs; for Fulkersonian hypergraphs, the theory of 
blocking-polyhedra, developed by Fulkerson (1971) and Lehman (1979) (sev 
Schrijver 1979b, 1983, Berge 1989, and chapter 30), plays a similar role. 
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Theorem 3.23 (Fulkerson 1971, Lehman 1979). Let # be a hypergraph and let 
H = BUH) denote its blocker. The following conditions are equivalent: 
(i) # is Fulkersonian; 
(ii) +*(H”) is an integer for every wEZ*; 
(iti) 7*(#") = 7(#") for every wEZ‘; 
(iv) 7(3€”)1(H') < wel for every w1EZ"; 
(i)’-(iii)’ obtained from (i)—(iii) with X instead of #. 


Observe that condition (iv) of this theorem (the ‘“‘width—length inequality”) is 
preserved under taking blockers, since BI(BI(#)) consists of the C-minimal edges 
of # (see (4.4)). 

Seymour showed that if a clutter € is Mengerian or Fulkersonian, «hen any 
contraction—deletion minor of € has the same property. Here, a contraction— 
deletion minor €' arises from € by taking disjoint subsets V,,V,CV(€@) and 
defining €’ to be the collection of minimal sets in 


{E\V, |EE 6, ENV, =}. 


The clutter Q,=K, is an example of a minimal non-Mengerian clutter. Its 
Blocker BI(Q,) is Mengerian. It was conjectured that if € has no Q, minor and if 
BI(€) is Mengerian, then € is Mengerian. This is, however, contradicted by an 
example due to Schrijver (1980, 1983). To characterize Mengerian or Fulkerso- 
nian hypergraphs by excluded minors seems to be a difficult problem; see 
Theorem 3.25 for a partial answer. 


Examples 3.24. By Theorem 3.5, balanced hypergraphs and their blockers are 
Mengerian. Several min-max theorems in Graph Theory provide interesting 
blocking pairs of Fulkersonian clutters €, BI(€@) [see chapter 30, and Schrijver 
(1979a, 1983) for details and proofs]. 

(1) Given a directed graph D = (V, A) and s,t€V, let V(€) = A; edges of @ 
are minimal s-f directed cuts, edges of BI(@) are s-t minimal directed paths. 
Clutters @ and BI(€) are Mengerian (max-potential min-work and max-flow 
min-cut theorems). 

(2) Given a digraph D =(V, A) with root r, let V(€)=A; edges of @ are 
minimal r-cuts, edges of BI(€) are r-arborescences. Again, @ and BI(€) are 
Mengerian (Fulkerson’s optimum branching theorem and Edmonds’s disjoint 
branching theorem). 

(3) Given a graph G=(V,E) and s,,¢,,5,,t,€V, let V(€)=E; edges of © 
are minimal ‘‘two-commodity cuts” which separate each of the pairs 
{s,,t,}, {S2, t}, edges of BI(@) are minimal sity paths and minimal s,-1, paths. 
Seymour and Hu proved that € and BI(€) are 5-Mengerian, i.e., in both cases, 
the maximum in (3.2) has half-integral optimal solution fora=1,bEZ* 

(4) Given G = (V, E) and SCV, let V(€) = E; edges of © are minimal S- -cuts, 
edges of BI(€) are minimal S-joins. The clutter € is 1-Mengerian (Edmonds and 
Johnson 1973). 
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(5) Given D=(V, A), let V(@) = A; edges of € are minimal directed cuts 
(= sets of arcs entering a nonempty proper subset S such that no arc of D leaves 
S), edges of BI(€) are minimal coverings by directed cuts. The Lucchesi- 
Younger theorem asserts that € is Mengerian; generally BI(€) is not Mengerian 
(Schrijver 1980, 1983, Berge 1989). 


Seymour was able to characterize Mengerian hypergraphs for an interesting 
class of clutters. A clutter @ is called binary if for all E,,£,,...,E, in @ with k 
odd, the set £,AE,A--- AE, includes some edge of €. The blocker of a binary 
clutter is binary again. Examples 3.24 (1), (3) and (4) are examples of binary 
clutters. 


Theorem 3,25 (Seymour 1977). A binary clutter is Mengerian if and only if it has 
no Q. minor. 


No characterization of Fulkersonian binary clutters is known. The binary clutter 
O(G) formed with the edge-sets of odd circuits of a graph G is Fulkersonian if G 
is planar, but O(K,) is not Fulkersonian. 


3.7. Further refinements 


The following refinements of Theorems 3.19, 3.22 and 3.23 provide general 
conditions under which the k-matching number », coincides with the k-blocking 
number. 


Theorem 3.26 (Lovasz 1975a, 1977, see Schrijver 1979a, Fiiredi 1988). Let KE 
{1,2,3} be given. If kr*(#’) is an integer for each partial hypergraph X' C #, 
then 1,(3#) = v,(#). 


Theorem 3.27 (Schrijver and Seymour 1979, Schrijver 1979a, Fiiredi 1988). Let k 
be a positive integer. Each of the following conditions implies 1,(#) = v,(#): 

(i) kr*(3") is an integer for every wEZ\™), 

(ii) »,(%") = 2,(H”) for every wE ZY). 


Theorem 3.26 does not hold for every &. Schrijver and Seymour gave an 
example for which 207*(9¢’) EN for every #'CH but 7,.(H) #207*(H) and 
Veg(HC) ¥ 60V*(H). 


4. Stability, transversals ancematchings 
For a general hypergraph, the problem of determining its packing and covering 


parameters such as a, Qo), ¥, p, and 7 (viz. definitions in section 1.7) is NP-hard 
(practical algorithms may be found in Nemhauser and Wolsey 1988); hence 
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estimates are useful. Basic inequalities are 


(i) v<-and, for7-graphs,7<rv , 

(ii) m/As7, 
(iii) =» <p for uniform hypergraphs , (4.1) 
(iv) vt(p—v)/r<n/r<p—(p — v)/r for r-graphs , 


(v) t<rr*—r+1 forr-graphs. 


Equality is possible in each case. 

Gallai observed that the identity a + + =n holds for every graph. That trivially 
extends to hypergraphs, since stable sets correspond to blocking sets under taking 
complements. Considering dual hypergraphs, we have the additional identities: 


P(H)=1(H™), — %,(H€) = v(H") = aH). (4.2) 


Consequently, we may focus without loss of generality on blocking number 
(node-covering problems) and matching numbers (edge-packing problems), except 
for few cases in which other parameters appear to be more natural. The 
K6nig—Hall theorem and its relatives (systems of distinct representatives, ‘“‘trans- 
versal theory”, cf. chapters 3 and 9) has widely inspired the earliest results on 
blocking and matching numbers. Although this point of view is still active (e.g., 
Theorem 4.15), important inequalities were obtained by linear algebra, averaging 
methods and probabilistic arguments. Polyhedral combinatorics [see (3.1)—(3.3) 
and chapter 30] are of a particular efficiency since the fractional packing problem 
(finding a maximum matching with weighted edges) and the fractional covering 
problem (finding a minimum blocking set with weighted nodes) appear as dual 
linear programs. Recalling (3.3), we have 


v(H#) = v,(9) = min 4) < max vA) v*(#) 
=7*(#) =min nie) <max n%) =1,(#) =7(#) (4.3) 


Section 2 was mainly devoted to the equality cases in (4.3). Here we wish to 
estimate how good the inequalities are, so the central question becomes: how far 
is + from v? 


4.1. Examples of packing and covering problems 


(4.4) Blocking clutters. Let # and HX be two clutters on the same node-set V. 
Then # = BK(#) if and only if, for every bicoloring of the nodes, say in pink and 
green, either # has a pink edge or (exclusive) #’ has a green edge. This property, 
which implies BI(BI(#))= 2, is the discrete version of the “width—length 
inequality” which characterizes blocking pairs of polyhedra (see Theorem 3.23 
and chapter 30). In a matroid, for instance, bases and cocircuits form a pair of 
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blocking clutters and the above property is usually referred to as Minty’s Lemma 
(see chapter 9). Various other examples were given in section 3.6. 

Let #, = Bl(#) be a pair of blocking clutters and f be any real-valued 
function on V. The following duality formula (“bottleneck extrema’) was 
obtained by Edmonds and Fulkerson (1970) in the context of facility location 
problems (see chapter 35): 


max(min f(v)) =min(max f(v)) . (4.5) 


(4.6) Self-blocking clutters (i.c., # = BI(3#) holds) coincide with intersecting 
3-chromatic clutters (cf. section 5). Obvious examples are K},_, (for BI(K’,) = 
K"'**), the Fano plane) and the fan, on ground-set 1,...,” with edges 2,.. ,n 
and 1,é for i=2,...,. The following less trivial example is due to Erdés and 
Lovasz: as node-set, let us take the union of r pairwise disjoint sets V, with |V,| =i 
for 1<isr; the edges have the form V,U 7, where 1<i<r, and 7 is any set 
such that |7]=r—i and |TNV,|=1 for i<j <r. See (5.27). 

(4.7) Let # be the hypergraph whose edges are the hyperlanes of the k- 
dimensional affine space over GF(q). The trivial lower bound on the blocking 
number is in fact the right value: 7(#) = k(g — 1) + 1 (Jamison 1977, see chapter 
32 for a proof). If lines are considered instead of hyperplanes, the blocking 
number is not known, even for q = 3. 

(4.8) The Ramsey number R(p,q) (see chapter 25) may be viewed as the 
optimum of a matching problem (Erdos et al. 1971, see Berge 1989): 


R(p, q) =1+ max(v(# @ #')| oH) <p, v(H#') <q). 


Turan’s problem 4.9. For 1<r<n, let T(n, p,r) denote the minimum number 
of edges of r-graph # of order n with a(#) <p. T(n, p,r) is the blocking number 
of the hypergraph K} | K;, whose edges are the copies of K’ in Kj. Inequality 
(4.1) (ii) gives (7)(2)' <T(n, p, 1). 

In fact, the ratio T(n, p,r)(") ' has a limit ¢(p, r) as n goes to infinity (Katona 
et al. 1964). The determination of ¢(p, r) is one of the most challenging problems 
in extremal set theory (see chapter 24). Let a(n,r, p)=(")— T(n, p, r) denote 
the stability number of Kj|K/,. Erdés and Sauer (see Erdés 1981) made the 
following conjecture. 


Conjecture 4.10. The edges of any r-uniform hypergraph can be partitioned into 
at most a(n, p,r) edges and copies of Kj. 


For partial results see Theorem 4.15 and its consequences. 
ww 
Zarankiewicz’s Problem 4.11. The following problem, raised in 1951 is not yet 
completely solved: what is the smallest integer z(p,q) such that every boolean 
Pp X q-matrix with z(p, q) entries equal to 1 necessarily contains an r x s-submat- 
rix of which all entries equal 1? Again, this is equivalent to determining a 
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blocking number: z(p, q) = a(K, @ Ki) +1=pq+1-—7(K,@K%). See chapter 
24 for further information. 


4.2. Stability number 


The stability number appears in situations where a given pattern is to be avoided 
in a large part of some structure. The lower bound on the Turan number given 
above, easily implies the following inequality: 


-I/r 
a=nm 


for any r-uniform hypergraph . (4.12) 

For loop-free hypergraphs, inequality a >n/(4+ 1) is obvious. If ¥ is loop- 
free, the minimum number of stable sets needed to cover V(#) is precisely the 
chromatic number y(#). Hence, from inequalities (5.14) and (5.15), we have the 
following strengthening: 


a Fae for any loopless hypergraph . (4.13) 
f 

Here A, stands for the maximum finear degree, i.e., the maximum size of a linear 

star. Various similar bounds can be obtained by simple counting arguments, Berge 

(1989). For instance, if @' denotes the minimum cardinality of an inclusion- 

maximal stable set, Meyer (1975) and Lorea (1972) (see Berge 1989) obtained the 

following inequality, where A denotes the minimum cardinality of an edge: 


n(h-1) 
rc ere se 
Parameter a’ is intimately related to the Helly property: let #=(E,,...,£,,) 


be a hypergraph on a set V of p + g elements. Put #° =(V—E,...,V—E,,). By 
Theorem 2.16, we have a'(#)=>p if and only if #* has the q-Helly property. 

The following interesting extension of the K6énig-Egervary theorem for 
bipartite graphs was proved by Lehel (1982). 


Theorem 4.15, Suppose # has no isolated node and every ACV(#H) contains a 
stable set of # of cardinality at least 4|A|. Then p(#)<a(X). 


Applying this theorem to suitable hypergraphs, we obtain: 


Corollary 4.16. For 1<r<p, any r-graph G can be edge-covered by a(G, p.r) 
edges and copies of K),, where a(G, p,r) is s the maximum number of edges of a 
K/-free partial hypergraph of G. 


Corollary 4.17 (Lehel and Tuza 1982). Let F be a nonbipartite graph and G an 
arbitrary graph. Then the edge-set of G can be covered by a,(G) edges and copies 
of F, where a,(G) is the maximum number of edges of an F-free subgraph of G. 
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Corollary 4.17 answered a conjecture of Bollobas: with the notation of Turan’s 
Problem and Conjecture 4.10, the edges of any r-uniform hypergraph can be 
covered by a(n, p, r) edges or copies of K,. Lehel conjectures that in fact we can 
pack: the copies of K> could be chosen to be pairwise edge-disjoint. This would 
imply the Erdés—Sauer Conjecture 4.10. 

More precise lower bounds on @ are known under additional assumptions. 


Theorem 4.18 (Ajtai et al. 1982). Suppose an r-graph X of order n has no cycle of 
length less than S and has average degree t with r<t <n. Then, for some constant 
c, depending only on r, we have 


a(#)=c,nt~ ‘(log 1)!” . 


4.3. Fractional optima 


The series of inequalities (4.3), obtained by the linear programming approach of 
packing and covering problems, yields basic bounds on the common fractional 
optimum 7* = v* of the blocking number and the matching number. Let X be a 
blocking-set of hypergraph # and E€#. Then, for any partial hypergraph 
HH’ CH, we have 


(4.19) 


As Lovasz observed, equality occurs on the left-hand side (respectively the 
right-hand side) if the automorphism group of # is transitive over the edges 
(nodes). Moreover, for regular r-uniform hypergraphs, we have 

m n 


= + _ Tk 
ZTaHvatt= 
A 


(4.20) 


~ 


An r-uniform hypergraph # is said to be quasi-regularizable (respectively 
regularizable) if, for some natural integers k(E) with positive sum (respectively if, 
for some positive integers k(E)), a regular r-uniform hypergraph #%’ can be 
obtained by multiplying each edge £ of # by k(£). Quasi-regularizable r-uniform 
hypergraphs are precisely those for which 7* equals n/r (see Berge 1989). The 
Structure of regularizable hypergraphs is not known. 

Berge and Simonovits (Berge 1989, Lovasz 1979, Fiiredi 1988) gave further 
corollaries of (4.3), relative to hypergraph products (as defined in (1.6)). 


Corollary 4.21. For any pair &, G of hypergraphs, we have 
V(H)vV(G) < o(H# @ FY) <o(H)v*(F) < v*(H)v*(F) = v*(H@G) 
=7T*(H@S)=1*(H)t*(G) <1(H)t*(F) < 1(H OY) 
<1(H)1(¥). 
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Corollary 4.22. For every hypergraph 3, we have 


«(9¢) =mi HOG) VHOG)  , 
T(#)=min Tey MEK Sy 
where & runs over all hypergraphs. 


From the left-hand equality in this corollary it follows that 7(#) = 7*(#) holds 
if and only if we have 7(#@G)=7(#)7(Y) for every hypergraph Y. The 
right-hand equality was first obtained in implicit form by Rosenfeld (Firedi 
1988). 

What are the possible values of 7*? Lovasz (1975b) proved that any rational 
number (+1) is the fractional blocking number of some hypergraph. More 
precisely, given positive integers a, b, c and d such that a = b/c <d, b/c >1, there 
exist a hypergraph for which v = a, r* = b/c and 7 =d. 

Chung et al. (1988) brought a new insight. Given any rational t,0<¢<1, a 
rank 3 hypergraph exists with 7* — [7*] =¢. Nevertheless, if p/q (with gcd( p, qg) = 
1) is the fractional blocking number of a hypergraph of rank at most r, then 

Po 2 log g 


= 


gq rilogr’ 


Hence the set N, of possible values of r* for hypergraphs of rank at most r 
forms a discrete sequence. For instance, 


_ 435 27 9 
N,= {1, 4, 3,3,3,2,2,...}. 


We shall see below that further information on 7* = v* can be obtained by 
standard tools of linear programming. Let us mention here an interesting 
consequence of Farkas’s Lemma of linear programming. 


(4.23) 


Lemma 4.24 (Fiiredi 1981). Any hypergraph # of rank at least 2 contains a partial 
subhypergraph X' =(V', ') such that r*(H') =1*(H) and |€'|<|V'|. 


4.4. Comparison between fractional and integral optima 


The first general result that bounds the ratio 7/7* in terms of maximum degree 
was obtained by Lovdsz (1975a), see Berge (1989), Lovdsz (1979) and Furedi 
(1988), and in a slightly different form, by Stein (1974), see Fiiredi (1988) [we 
refer to Lovasz (1975b) and Firedi (1988) for other basic bounds on 7 that 
involve 7* and the ratios 7,/kr*]: 


1 
re(1tgth4---4+q)r*<(1 tog Ayr*. (4.25) 


Inequality is sharp, up to a constant factor. 


Proof. In eq. (4.25), 7 can be in fact replaced by |T|, where 7 is any minimal 
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blocking set obtained by a greedy algorithm. Let T= {x,,...,x,} be a blocking 
set defined as follows: node x, has maximum degree A, in hypergraph #, = #; 
node x,;,, has maximum degree 4,,, in the partial hypergraph %,,, = 2, — x,. 


Observe that A, =m, —m,,,, where m, is the size of #, (with m,,, =0). Writing 


Dye ltie ahs ay 
A, oA, Aone yeregly 


which easily implies (4.25). O 
Using (4.25) and Corollary 4.21, we easily obtain the following. 


Corollary 4.26 (McEliece and Posner 1971, see Berge 1989, Lovasz 1979). Setting 
H°? =H @+--Q@H (p factors), we have 


7*(9) =lim V1,(#*). 


The corresponding statement about v is not true: numbers y v, (3°?) go toa 
limit $(#), called the Shannon capacity of #, but, in general, S(#) # v*(9) (see 
chapter 1). 

For A-regular r-graphs, inequality (4.25) can be improved under rather weak 
“‘sparseness” assumptions. It is more convenient from now on to deal with the 
dual problem about the edge-covering number p and the corresponding fractional 
optimum p* which, by (4.19), is p* =n/r =m/A = v*. Let an arbitrary s-graph ? 
be given and denote by | Ki, the “factor hypergraph” whose edges correspond 
to copies of # in K,, (hence, the nodes are the s-subsets of a N-set). A celebrated 
conjecture formulated by Erdés and Hanani in 1963 (cf. chapter 14) amounts to 
saying that the ratio »(K; | K})/p(K, | Ky) tends to 1 when N->~ (p,s fixed). 
Roughly speaking, near perfect matchings and coverings exist (i.e., »=v* and 
p= p*). Rodi found in 1985 an ingenious probabilistic method which succeeded in 
proving the Erdés—Hanani conjecture. Frankl and R6dl (1985) generalized Rédl’s 
method and result in a hypergraph setting. They proved that near perfect 
edge-coverings exist in nearly regular r-graphs if the number of edges containing 
an arbitrary pair of nodes is not too large; the existence of near perfect matching 
follows since, in view of (4.1)(iv), any inequality of the form p < (1+ e)””” implies 
vy =(1-e(r—1))"”. A similawresult was previously obtained for random r-graphs 
(De La Vega 1982). Recently, Pippenger and Spencer obtained a simpler and 
more powerful version. 


Theorem 4.27 (Pippenger et al. 1989). For all integers r>2, reals k1=1, and e>0 
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there exist £ >0 so that: if an r-graph of order n has, for some d, the following 
properties: 
(i) A(#) <Ad, 
(ii) (1 — £)d < deg,,(v) < (1 + €)d holds for all but at most én nodes, 
(iii) any pair of different nodes lies in less than &d edges, 
then p(#)< (14 e)nir. 


Corollary 4.28 (Frankl and Rodl 1985). Suppose P is a fixed r-graph of size M 
and N tends to infinity. Then 


v(P | Ky) = (1 of 1))(¥)/M 


For further information and applications, see chapter 33 and Fiiredi (1988). 
Perfect decompositions of K}, are treated in chapter 14, see also Theorems 5.28 
and 5.29 and Corollaries 5.30—5.32. 

Returning to general r-graphs, we now compare r* with v. The trivial inequality 
7* <rv is not sharp: as shown by the hypergraph formed with disjoint copies of a 
projective plane of order r—1 (lines have r points), the ratio +*/y can reach 
r—1+1/r (if such a plane exists). Actually, that is precisely the extremal case. 


Theorem 4.29 (Furedi 1981). Let # have rank r=2. Then 
ert] 
(i) *(#) = »(H#) <<" t) 


holds with equality if and only if 2 consists of v(#) disjoint copies of a projective 
plane Y, of order r—1. More precisely, if r=3 and if # does not contain p +1 
copies of a P., then we have 


(ii) -7*() = (9) < (7-1) (00) +2 


Theorem 4.29 answered questions which had been considered at their time 
difficult; the case of intersecting regular r-graphs had been solved by Lovasz 
(197Sb) and the case of regular graphs by Béllobas and Eldridge (chapter 23). 
Nevertheless, Fiiredi’s proof (mainly by induction on v) is surprisingly simple. We 
show below the (crucial) case v = 1. 


Proof (for v = 1). Let # be an intersecting r-graph with nodes and m vertices. 
We prove r*<r— 1. By Lemma 4.24 we may suppose m <n. 
Case 1. There exists an x, © V(#) with degree k <r. Set H(x)) =(E,,-.., Ex) 


and 
0 ifx =x, , 
(x)=41 
@) % 40%) ifx#xXx,, 


where d,(x) stands for the degree of x in #(x,). As easily seen, ¢ is a fractional 
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blocking. Hence 
1 1 
T(H)<Z > a(x)=Elr-Wk=r-1. 
x¥Xy 


Case 2. # has minimum degree 6 =r. We have 


bn< > d(x)=rm<m, 
xEV 


hence # is regular and m equals n. By (4.19), 7* = m/r. We prove m= r’—r. Let 
E, € &. Since # is r-regular, we obtain 


m#)<1+r(r-1), 


with equality only if |EM £,|=1 for any other edge E. Hence m <r? —r holds 
except when % is a linear intersecting r-graph with m=n=r°—r-+1, these 
requirements force # to be a projective plane. O 

Fiiredi’s theorem was generalized in two directions. 


Theorem 4.30 (Frankl and Flredi 1986). Let # be a (2-wise) s-intersecting 
hypergraph of rank r 22. Then, either 


: * r-1 1 = r-s 

(i) m*< 7 Weds? 

i -1 1 
(ii)  # is a symmetric (r, s)-design and r* = ae 


r 


It is conjectured that Theorem 4.30 still holds if (i) is changed to r* <(r — 1)/s. 
The next theorem concerns the fractional matching polyhedron FM associated 
with a hypergraph, as defined by (3.1), and the (integral) matching polyhedron 
IM which is the convex hull of the set of integral vectors in FM. 


Theorem 4,31 (Furedi et al. 1993, Fiiredi 1988). (a) Any hypergraph # admits a 
matching Al C # such that 


rHH#H)< D> (Ge 1 +i) 


LEK 


(b) Suppose # is r-uniform; if I and F are the matching polytope and the 
fractional matching polytope, respectively, then 


we 


1 
rMc(r—1+=)IM. 


Using a different approach, Aharoni et al. (1985) obtained the following 
powerful inequality involving v and v*. 
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Theorem 4.32. Suppose hypergraph 3 has corank h. Then we have: 


@y ot? 


n—(h—-1)(vtl'/m” on 


pv =f(v*,m,n)= 
There are infinitely many cases where equality holds. 


This theorem is effective in the sense that we can find a matching of size 
f(v*, m,n) in time polynomial in m,n. 


Proof (Sketch). One starts with the following straightforward inequality: 


p'M'Mp<n, (4.32a) 


where p is an optimal fractional matching. The matrix M'M is estimated as 
follows: 


M'M 2diag(|E,|-1)+J-B, (4.32b) 


where J is the m  m matrix with all entries equal 1 and B is the adjacency matrix 
of the graph G =L(#1). Consider separately the three terms in (4.32b). Then 


p'Jp=(v*). (4.32c) 
The Cauchy—Schwartz inequality, joined to the fact that |E,|<h, yields 
p' diag(|E,|—1)p>=(h-1)(*)’. (4.32d) 


To estimate the term p' Bp, remark that cliques of G correspond to matchings 
of # and apply to this graph an idea developed by Motzkin and Strauss in their 
proof of Turadn’s Theorem (see chapter 23): The maximum of x' A(G)x, subject to 
conditions x =0 and Y; x,=1, is achieved when x is the incidence vector of a 
largest clique of G. One obtains 


p Bp< (1 -=)ery. (4.32e) 


The inequality of Theorem 4.32 follows. O 


4.5. Blocking number versus matching number 


Of course, we may compare t with » by combining inequalities that involves the 
fractional optima. But direct comparisons are better in most cases. A possible 
method consists in keeping one of the parameters 7 and » fixed, then estimating 
Separately the other one. For instance, a greedy heuristic to find a k-matching 
yields: 
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Theorem 4.33 (Lovasz 1977, 1979). For every hypergraph of rank r we have 
ry, &=kr+(k-1(r-1). 


It follows that, if # is assumed to be ¢-wise intersecting, we have 


—1 
(30) <<; + 1. (4.34) 


See chapter 24 for further results of that kind. 
Following the lines of Theorem 4.32, Aharoni et al. (1985) made a direct 
comparison between 7 and v. 


Theorem 4.35. Set = eVnyv (with e = 2.71828. . .). If # has at least ef edges, then 


7<min(n, 3é Vlog n/é) , 


moreover, this bound is, up to a constant factor, best possible whenever m > en. If 
# has less than e& edges, then + can be greater than m/5. 


Proof (Sketch). The theorem is effective in that a blocking of the required 
cardinality is exhibited in polynomial time. Three arguments are involved. 

(1) If ‘‘small” edges exist, one considers a maximum matching among them; 
the set of nodes in the union meets all small edges. 

(2) The generic step of the construction is a greedy one (as in (4.25)): one 
picks a node of maximum degree, remove it and the edges containing it. 

(3) Towards the end of the process, one estimates the number of steps by 
saying that it does not exceed the number of remaining edges. OU 


The comparison of blocking numbers with matching numbers in multipartite 
hypergraphs has received the most attention, since extensions of the K6nig 
theorem for bipartite graphs are eagerly expected. The following striking 
conjecture of Ryser (1970, unpublished) would be a remarkable generalization of 
the K6nig—Egervary theorem on bipartite graphs. 


Conjecture 4.36. Every r-partite r-graph satisfies 7 <(r — 1)». 


Let p.(#) denote the minimum number of pairwise disjoint edges and 1- 
element sets whose union covers V(#). Lehel (1982) showed that Conjecture 4.36 
may be restated as follows. 


wv 


Conjecture 4.37. Every r-partite r-graph satisfies py < a. 


A weaker form of Conjecture 4.36 was independently stated stated by Lovas, 
(1975b). A stronger form is due to Meyniel (1984, unpublished): 
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Conjecture 4.38. If #, and every subhypergraph of #, is k-colorable, then 
T(#) <(k — 1)v(#). 


Very little is known on Ryser’s conjecture. Szemerédi and Tuza settled some 
particular cases: r = 3, » <4, 7 =4, »S2;r=5, v=1. For r= 3, the current best 
bound is 7 <8». See Tuza (1987). 

Gyarfas proved the “fractional version”: 7*<(r—1)y. In fact, this is a 
corollary of Theorem 4.29. Ryser’s conjecture would imply 7 = (r — 1)7*. Actual- 
ly, a much stronger statement holds. 


Theorem 4.39 (Lovasz 197Sb). For any r-partite r-graph, t <irr*. 

The Helly property (2.16)—(2.17) suggests another natural direction towards 
interesting relationships between 7 and v, for a hypergraph 2% has the Helly 
property if and only if implication »(#')=1 > 7(#')=1 holds true for every 
x’ C 2. A class of hypergraphs is said to be r-bounded if there exists a “binding 
function” f(x) such that 7(#)<f(~(#)) for every # in the family. Two 
interesting classes of hypergraphs, denoted here by ©, and %,, were considered 
by Gyarfas and Lehel (1983). 

(a) HEQ, iff H has the Helly property and its line-graph has no induced 
subgraph isomorphic to the k-dimensional octahedron graph Q, (obtained from 
the complete graph K,, by deleting k independent edges). 

(b) # € 8, iff there exists a tree T on V(#) such that every edge of # induces 
a subforest of 7 with at most & components. 


Theorem 4.40. Classes 0, and %, are r-bounded. 


Surprisingly, the presence of octahedron graphs in statements of that kind is, in 
some sense, unavoidable. If 8 denotes the Helly class associated to “Berge 
graphs” (in a Berge graph, neither odd polygons nor their complements arise as 
induced subgraphs), the Strong Perfect Graph Conjecture (see chapter 4) would 
imply that members of 8 are normal hypergraphs (hence +=», by Theorem 
3.19). But it is not even known whether 8 is 7-bounded. 


4.6. Other estimates of blocking and matching numbers 


Most upperbounds for blocking numbers and lower bounds for matching numbers 
are derived from the preceding results. For instance, the following inequalities on 
the matching number are trivial consequences of Firedi’s Theorem 4.29: 


2p t] 
(i) (9) << a(oeyo(9¢) if 9€ has rank r, 
(iil) = n(#) s (r? —r+1)v(#) if # is regular r-uniform . (4.41) 


Equalities are possible only for disjoint copies of projective planes. 
The matching problem appears to be more easily handled than the covering 
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problem. For graphs, a min-max formula and a fast algorithm for the matching 
number are known (chapter 3). For uniform hypergraphs, some interesting lower 
bounds are obtained by counting arguments. For r-graphs of order n, the 
inequality 


nee) uy 


holds for sufficiently large n (Hajnal and Rotschild 1973, Lovasz 1979). The case 
v=1 of (4.42) is the celebrated Erdés-Chao-Ko-Rado theorem (chapter 24). 
For linear hypergraphs more can be said: 


Theorem 4.43 (Seymour 1982). Suppose 2 is a linear hypergraph without multiple 
loops; then 


y2—, 
n 

The case vy = 1 is the famous De Bruijn—Erd6s—Ryser inequality for intersect- 
ing linear set-systems (chapter 14). Compare with Theorem 5.12. It had been 
conjectured that the inequality m <x holds for any intersecting r-graph which is 
maximal in the following sense: the addition of any new edge (with r elements) 
causes the matching number to increase. This is however contradicted by a 
construction due to Blokhuis, see Fiiredi (1988). The construction was improved 
by Boros, Fiiredi and Kahn who exhibited a maximal intersecting (g¢ + 1)-graph 
of size ig’ + O(q) for any prime power q, q =5S (mod 6), g > 20. 

To obtain bounds in the other direction (upper estimates of v, lower estimates 
of 7) we may look for a small part of the hypergraph which forces the matching 
number to be small or which shows that the blocking number is big. That is the 
aim of the notions of v-critical or 7-critical hypergraphs. 

A hypergraph # is v-critical when any substitution of an edge by a smaller new 
one causes the matching number to increase. A hypergraph % is r-critical 
whenever 7(#\E) <7(#) for every edge EZ. Main problems about such structures 
are of extremal nature and we refer to chapter 24 and Fiiredi (1988) for a detailed 
account on the subject. We just mention here a few basic methods and results. 

The investigation of r-critical graphs was initiated by Erdés and Gallai in 1961. 
Lovasz (Berge and Ray-Chaudhuri 1974) observed that in a 7-critical hypergraph 
we have strict inequality 7* <7. 

Given an integer r, there are only a finite number of 7-critical r-graphs; An 
upper bound on the size of a r-critical hypergraph is given by the following result. 


Theorem 4.44 (Bollobds 1963; Jaeger and Payan 1971, see Berge 1989, Lovasz 
1979). A t-critical r-graph has at most ("%25 71) edges. 


For a proof and extensions, see chapter 24. 
Rather good estimates of the maximum number of nodes of a +-critical 
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hypergraph with given blocking number are derived from the next lemma 
originated in Gallai’s work. 


Lemma 4.45 (Gyarfas et al. 1982). if S is a strongly stable set in a 7-critical 
hypergraph H, then |{(S)| >= |S| + d(x) — 1, where I(S) is the set of all subsets 
E-s for EEX,sES. 


In the case of graphs this was proved by Lovasz and by Surdnyi. Recently, Tuza 
determined the maximum order of a r-critical r-graph, apart from a constant 
factor. Notice that determining the minimum size of a t-critical r-graph on n 
nodes is precisely Turdn’s Problem. 

It is also true that for a given r, there are only a finite number of v-critical 
r-graphs. The earliest result is due to Calczinska—Karlowicz who proved in 1964 
the existence of some function f(r) such that, for every intersecting hypergraph of 
rank r, there is a set S with no more than f(r) elements which induces still an 
intersecting subhypergraph. With other terms, the function f(r) is the maximum 
order of a v-critical intersecting r-graph. The current best bounds for arbitrary v 
are due to Tuza who found astonishing connections with bounds on intersecting 
set—pair systems. 


5. Coloring problems 


We already encountered in section 3 some important properties relative to 
hypergraph node-colorings (bicolorability) or to edge-colorings (the ‘‘edge-color- 
ing property’). We now turn to a more comprehensive approach of those 
hypergraph coloring problems. This field has various applications (timetabling 
and scheduling problems, planning of experiments, multi-user source coding, . . .) 
and offers rich connections with other combinatorial areas: probabilistic methods, 
Extremal Set Theory, Ramsey Theory, Discrepancy Theory, etc. 

When speaking about hypergraph node-coloring problems, loops play no role 
(hence they are tacitly deleted). The chromatic number y(#) of a hypergraph 3 
is the smallest number of colors needed to color the nodes so that no mono- 
chrome edge occurs. # is k-colorable (respectively k-chromatic) if y(#)<k 
(x(#) =k); if # is k-chromatic but any edge-removal causes the chromatic 
number to drop, # is said to be k-chromatic critical (or k-critical for short). These 
concepts are close analogues of graph theoretical concepts (chapter 4) but they 
differ in spirit; frequently, they have been at the origin of new ideas or methods. 

Any assignment of “colors” to the edges of a hypergraph & is referred to as an 
edge-coloring. An edge-coloring is proper if any two intersecting edges receive 
different colors. The chromatic index of # [i.e., the least number of colors used 
in a proper edge-coloring, denoted by y(#)] coincides with the usual chromatic 
number of the line-graph L(3¢). Parameter y(#) might be thought more relevant 
to Graph Theory. In fact, important results and conjectures, related to the 
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“edge-coloring property” (i.e., y(#)=A(#H)) refer fundamentally to the hy- 
pergraph structure; we close the section with them. 


5.1. Some instances of hypergraph coloring problems 


Some examples enlighten connections and contrasts with Graph Theory; others 
show how hypergraph coloring problems arise in various domains. 


(5.1) The 4-color problem and hypergraph bicoloring. Associate with any graph 
G a hypergraph # as follows: nodes of # correspond to edges of G; edges of # 
correspond to odd cycles of G. Then, G is 4-colorable if and only if # is 
2-colorable (Stein, and Woodall 1972). 

(5.2) The bicolorability problem for hypergraphs is NP-complete (cf. chapter 29). 
Let F be a boolean formula in conjuctive normal form with V, as set of variables 
and €, as set of clauses. Define a hypergraph # = (V, @) as follows: 


= {x|xEV,} U (z[xEV,} U{#}, 


where # is a new symbol. For each clause C in €; let e(C) be the set containing 
# and every variable appearing in C, taking with its sign (thus, for example, if 
C=x,v Xx, then e(C) = {#,x,, x,}). Set 


6 = {e(C)|CE4,) UL{x, XH} |x EV,}. 


Then F is satisfiable if and only if # is 2-colorable (Linial and Tarsi 1985). 
(5.3) Hypergraphs do not jump. A real A,0<A <1, is a jump for an integer r > 2 
if, for every positive «€ and every integer p>r, any r-graph with order n> 
ny(e, p) and with size not less than (A+ .«)(") contains necessarily a partial 
r-graph of order p with at least (A + c)(%) edges, where c is a positive constant 
depending only on A. Graphs “jump”, i.e., for r=2 every A is a jump; an 
explanation for this phenomenon is that, for any 2-graph G, the ratio a(G | K>)/ 
n? {where a(G| K*) denotes the maximum number of edge-disjoint copies of G in 
K*) asymptotically tends to 


2xo=1) 


Frankl and Rédl (1984) proved that hypergraphs do not jump: for r = 3, & = 2r, 
the number (k’~' — 1)/k’~' is not a jump. For details and connections with the 
Erd6s-Stone density theorem and Turdn’s Problem, the reader is referred to 
chapters 23 and 24. 

(5.4) Ramsey graphs. Roughly speaking, a Ramsey graph is a graph with neither 
a large clique nor a large independent set. Let R,(p) denote the smallest integer 
n such that, for any partition of the edges of K’ into k parts (the ‘“‘colors’’), some 
part contains a (monochrome) copy of Kj. To estimate Ry Pp) amounts to 
estimating the chromatic number of the factor hypergraph Kj, | K’ (where nodes 
are the r-subsets of an n-set and edges correspond to copies of Kj): we have 
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Ri(p) > n iff x(K; | K’,) =k. Lower bounds for Rj, thus follow from upper bounds 
on the chromatic numbers of hypergraphs; for k=2, the existence of large 
Ramsey graphs follows from the bicolorability of 7-graphs with few edges. 
Actually, numerous results in Ramsey Theory use hypergraphs. Since the 
pioneering work of Erdos on Ramsey graphs, existence theorems and estimates 
are frequently obtained by probabilistic methods: see eq. (5.16), Theorem 5.17, 
Corollary 5.18 and eq. (5.19) and chapter 25. 

(5.5) Consider a p X q-matrix M. Erd6s and Rado asked in 1956 for the largest 
integer & such that, for every k-coloring of the entries of M, there exists an r Xs 
submatrix all of whose entries have the same color (see chapter 25). Again, this is 
a Ramsey-type problem, related to Zarankiewicz’s Problem (see chapter 23). The 
required number is y(K, ® K{) — 1 whose precise value is unknown. Berge and 
Simonovits (1974), see Berge (1989), proved that K,@Kj has maximum 
chromatic number among products of a p-colorable r-graph by a q-colorable 
r-graph. 

(5.6) In a positional game (such as HEX) two players, say Red and Blue, mark 
alternatively the vertices of a hypergraph #. The first who obtains a monochrome 
edge of his own color is the winner, Berge (1989). The possibility of a draw 
depends upon the existence of a uniform bicoloring (the numbers of red and blue 
nodes differ by at most 1). Erdds and Selfridge (1973) showed that the second 
player has a strategy warranting a draw if # satisfies the inequality 
m(#) + A(#) <2", where h is the minimum cardinality of an edge; compare with 
(5.16). 

(5.7) Let st(#) denote the family of stable sets of a hypergraph # (with no 
loops). Then y(#) = p(st(#)). For instance, if @,, and %,, denote the family of 
circuits and of independent sets of a matroid M(E), respectively, we have, by the 
Edmonds—Nash-Williams theorem on unions of matroids (see chapter 9), 


xX(€y,) = PCF) = max([|A|/rank,,(A)]; 84 ACE). 


5.2. Bicolorable hypergraphs 


What properties of bipartite graphs do bicolorable hypergraphs share? We already 
noticed that excluding odd polygons as induced subhypergraphs insures 
bicolorability (Theorem 3.5) on balanced hypergraphs). The problem, raised by 
Lovasz, of deciding whether normal hypergraphs (see section 3.5) are bicolorable 
or not was answered into the affirmative by the following deep result, due to 
Fournier and Las Vergnas (1972, 1974), which gives some precise information on 
the obstructions to bicolorability. 


Theorem 5.8. Suppose a hypergraph # does not contain any odd cycle 
E,,...,Ey94,,£, with the following properties: 

(a) |E,NE,,,)=1 fori=1,...,2¢, 

(b) \Eo ast nN E,| = 1, 

(c) E,NE,NE, =@ for every distinct indices i, j, k. 
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Then H is bicolorable. 


The hypotheses of this theorem are satisfied by normal hypergraphs and more 
generally by hypergraphs having no odd hypercycle (also named “‘pseudobalanced 
hypergraphs’, Berge 1989). 


Proof. The idea is to perform color interchanges on disjoint subsets until either a 
bicoloring or an odd cycle of the forbidden type is obtained. We suppose # is a 
3-critical hypergraph. We choose an edge E, and a vertex b,€E,. By the 
hypotheses, 2#\E, admits a proper 2-coloring (X,, Y,) and we may suppose 
E,c X,. Setting B, = {by}, we form a sequence of triples (X,, Y,, B,) for i= 
0,1,..., p, as long as possible, with the following properties (let us recall that 
HA] denotes the trace of # on a subset ACV(#H), formed with #-edges 
included in A): 


(i) (X,, Y,) is a partition of V(#); 
(ii) X,=Y,_, UB,_, and Y, = X,_, — B,_, (for (> 1); 
(iii) Y, is a stable set of #; 
(iv) B, is a minimal blocking set of #X,] ; 
(v) B, CX, for i even, and B, C Y, for i odd; 
(vi) By, B,,..., B; are pairwise disjoint. 


All conditions are satisfied by (X,, Y,, B,). No X; is stable, otherwise (X;,, Y;) 
would be a 2-coloring. Hence B, @ and condition (vi) insures the existence of a 
largest p. Conditions (5.9) imply: 


(vii) X;, = X,—- By UB,-:-—B,_,UB,_, for i even; 
(viii) X, = Y, UB, — B,---—B,_,UB,_, for i odd; 
(ix) X, X,,, = B,. 


By the definition of a minimal blocking set, for any 6, € B, some edge E, of 
HX,] is such that E,O B, = {b,}. Since E,C X,=Y,_,UB,_, there is a vertex 
b,-, in E,NB,_,. Again, some edge E,_, of #[X,_,] is such that F,_,B,_, = 
{b,_,}. Hence by (ix) we have E,N E,_, = {b;_,}. Iterating the argument, we 
obtain a sequence 

b,E,b; 


erik 


E;_,b;_, ieee bE, bE, 


such that |E,;  E,,,|=1 for i=j>1. Such a sequence is named a cascade. The 
following point is crucial: 


Lemma 5.10. Edges of any cascade are distinct. Moreover, two edges of a cascade 
are disjoint when their indices Have the same parity. 


It is not difficult to see that if some cascade contradicts this lemma, then it 
contains the edges of an odd cycle with properties (a)—(c). Hence, to finish the 
proof of Theorem (5.8), it suffices to obtain a contradiction when assuming 


Hypergraphs 421 


Lemma (5.10). The contradiction follows from the construction of a new triple 
(X,+19 Yoav By+1) that satisfies (5.9). 

We put X,,,= Y,UB, and Y,,, =X, — B,. By (iii), the set Y,,, is stable, 
hence the trace LX sail is not empty. 
—If p is odd, every edge of #[X,,,]] meets X, (since Y, is stable), hence X, 
contains a minimal blocking set Be, of 7% all By (v), By «1 is disjoint from 
B,,B,,...,B,. Since B,,, CX,,, =X,_, — B,, the set B,,, is also disjoint from 
B,,By,.-- Be ,- Then, conditions (5.9) are Pied 
- If p is even, we have E, B, = @ (otherwise, any vertex b,E EN B, would be 
the beginning of a cascade with E,0E, #0, a contradiction with Lemma 5. 10). 
Hence E, ZX p+ (since X, pti =y, UB,, and Y, is stable). Since Ey is the only 
edge of # to ‘be contained in Xx, every edge ‘of HX, 4,] meets Y,, hence Y, 
contains a minimal blocking set B,,, of #1X,,,]- As for | P odd, we check that the 
sets X41, Y,41, 8,4, satisfy (5.9). O 


An interesting generalization has been conjectured by Sterboul in 1972 
(unpublished). 


Conjecture 5.11. A nonbicolorable Nypereraph contains an odd hypercycle 
E,,...,E,,, such that |E,N E,,,|=1 fori=1,...,2k. 


As implied by Theorem 5.8 the conjecture holds for 3-graphs. 

We shall see later on some sufficient conditions for bicolorability as specializa- 
tions of general upper bounds on the chromatic number. Let us mention here 
some other results on bicolorable hypergraphs which mainly rely on the existence 
of a matching in some associated bipartite graph. 

Lovasz (1979) observed in 1967 that a hypergraph whose every partial 
subhypergraph has fewer edges than nodes is bicolorable. This proposition, which 
is easily derived from Theorem 2.5, led Khatchatryan (1982) to give a general 
upper bound on y(#) in terms of the parameter maxy.cy(mn(#’') — kn(#’)), 
where k is any fixed integer. Woodall (Berge 1989) exhibited a 3-chromatic 
hypergraph # such that m(#H) = n(9#) but m(#’) < n(H') holds for every #’ C 
#. Seymour (1974) gave a characterization of 3-chromatic critical hypergraphs 
such that m(#) =n(#) and strengthened Lovasz’s result as follows. 


Theorem 5.12. Every 3-critical hypergraph satisfies m(#) = n(#). Moreover, the 
bipartite incidence graph admits a node-to-edge matching. 


Seymour’s proof is by linear algebra (see chapter 33 and compare with 
Theorem 5.26). Aharoni and Linial (1986) extended Theorem 5.12 to the infinite 
case and gave a purely combinatorial proof, using matchings and transversals. 
Theorem 4.15 of Lehel (1982) is obtained with similar tools and has the following 
corollary (which extends the Kénig-Egervary theorem on bipartite graphs to 
bicolorable hypergraphs): 
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Theorem 5.13. Every bicolorable hypergraph # satisfies p(#) =< a(#H). 


5.3. General upper bounds 


A simple examination of the greedy coloring algorithms leads to the following 
inequality, as observed in 1968 by Tomescu and Lovasz, see Berge (1973, 1989): 


xX(#) SAH) +1, (5.14) 


where A, denotes the maximum linear degree, i.e., the maximum size of a linear 
star. Inequality (5.14) has various applications (see Berge 1989); for instance, it 
leads to estimates of the stability number (such as (4.13)), in conjunction with the 
following trivial inequalities: 


M(H )a(H) Bn, X(H) + a(#)<nt+1. (5.15) 


Brooks’s graph-coloring theorem (see chapter 4) can be extended to linear 
hypergraphs: a connected linear hypergraph # forces equality in (5.14) if and 
only if # is an odd polygon or a complete graph (Lepp and Gardner 1973). No 
exact analogue is known for gencral hypergraphs: equality in (5.14) is obtained 
with complete r-graphs but also with nontrivial structures (see, e.g., Lovasz 1968, 
Berge 1973, p. 145). 

For uniform hypergraphs, substantially stronger results have been obtained by a 
probabilistic approach of the following fundamental question: estimate m,(r), the 
minimum size of a non k-colorable r-graph. 

The basic result amounts to Erdés (1964), see Erdés and Spencer (1974) and 
chapter 33. Let us color randomly the nodes of an r-graph #, using & colors (each 
color has probability 1/k to be assigned to any given node). If # has no more 
than k’~' edges, then, with positive probability, a proper k-coloring is obtained. 
Hence a proper k-coloring does exist! With other words, we have 


m(r2k'.” (5.16) 


With each edge FE may be associated the event ‘‘E is monochrome”. To disjoint 
edges correspond independent events, hence the line-graph L(#) is the depen- 
dency graph for those events. Applying Lovasz’s “local lemma” (see chapter 33), 
we obtain (with e = 2.71828. . .): 


Theorem 5.17 (Erd6s and Lovasz 1975). A non k-colorable r-graph # must 
contain an edge which meets at least k’"'/e other edges. Moreover, if # is linear, it 
contains at least k"~”/e(r — 1) nodes with degree at least k’~*/e(r — 1) and contains 
at least k’*/er(r — 1) pairwise disjoint edges. 


Corollary 5.18. For any r-graph, x(#) <(erA)'”. 


The lower bound on m,(r) was improved by Beck (1978) [for a simple proof, 


Hypergraphs 423 


see Spencer (1981) and chapter 33}: 


-1,143 
aot 2 


m,(r) = ck for some constant c . (5.19) 


An upper bound for m,(r) can be obtained as follows. For every k-coloring I 
of a given set S, let us denote by %,. the collection of all monochrome r-subsets of 
5. Then an r-graph (5, €) has no proper k-coloring if and only if @ meets every 
€,. Applying inequality (4.25) on blocking numbers, we obtain (Erdés and 
Schmidt 1964): 

m,(r) er (log k)k? . (5.20) 


Other interesting estimates (weaker, in most cases, than those above) have 
been given. 3 
Theorem 5.21. 

rk’ . —k+ ‘\ 


Pekka) KO r 


The upper bound is due to Herzog and Schénheim (1972). The lower bound 
(Johnson 1976) can be deduced from a more involved formula due to Hansen and 
Lorea (1978), see Berge (1989): 


Theorem 5.22. The following inequality implies the k-colorability of #: 


2 
> ka + x “). > kl, 
EEwr 


[E[ / ex 


5.4. Lower bounds via topology 


To obtain lower estimates of chromatic numbers a powerful topological method 
was initiated in 1978 by Lovdsz. To any r-graph #, associate a simplicial complex 
C(#) as follows: the vertices of C(#) are all the m!m(#) ordered r-tuples 
(v,,¥,,...,0,) of vertices of #, where {v,,...,v,}E#. A set of vertices 
(v;,.-.,0;);e, Of C(#) forms a face if there is a complete r-partite subhyper- 
graph of # on the (pairwise disjoint) sets of vertices V,,..., V, such that vu; EV, 
for all iG J and 1<js <r, 

Recall that for s=0 a topological space T is s-connected if for all O<p<s 
every continuous mapping from the p-dimensional sphere S’ into T can be 
extended to a continuous mapping from the (p + 1)-dimensional ball B” + with 
boundary S$’, into 7. 


Theorem 5.23 (Alon et al. 1986). For any r-graph H, where r is a prime, we have 
X(H) > k if CCH) is (kK —1)(r — 1) — 1-connected. 


This theorem extends a previous result of Lovasz (case r = 2) and is conjectured 
to hold for every positive integer r. A (far nontrivial) consequence of Theorem 
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5.23 is the solution of a conjecture of Erdds that generalized Kneser’s famous 
conjecture (see below and chapter 4). As a matter of fact, a self contained 3-page 
proof of a stronger result was recently obtained by a different use of simplicial 
complexes: 


Theorem 5.24 (Sarkaria 1990). Let n,h, j,k, r be positive integers. If n(j —1)= 
(k —1)(r - 1) + rh, then for every k-coloring of the h-subsets of an n-set there is at 
least one r-tuple of h-sets having the same color such that any j of them have empty 
intersection. 


Erdés’s conjecture was the case j = 2, while Kneser’s was the subcase j = 2, r= 
2. The r-graphs whose nodes are the h-subsets of a given set and whose edges 
correspond to r-tuples of pairwise disjoint h-subsets are often referred to as 
Kneser hypergraphs. The key in the proof of Theorem 5.24 — as in every proof of 
the special cases mentioned above - is a variant of the Borsuk—Ulam theorem (on 
the existence of pairs of antipodal points in some member of an arbitrary covering 
of S’ by p+1 open sets, see chapters 4 and 34). Indeed, following Sarkaria, 
Theorem 5.24 can be seen itself as a ““Borsuk—Ulam result”. 


5.5. Highly chromatic hypergraphs 


As we have seen above, to prove that a chromatic number is large is by no means 
easy. The question is widely related to Ramsey theory. As indicated by (5.4), any 
Ramsey type result implies that the chromatic number of a certain hypergraph is 
large. For instance the r-dimensional subspaces of the affine space GF(q)” form 
the edges of a hypergraph with large chromatic number if 7 is sufficiently large. 
Most of the constructive or existential proofs of Ramsey Theory rely on highly 
chromatic graphs or hypergraphs. Of course, for estimates on Ramsey numbers, it 
is important to “‘minimize” the construction, i.e., to obtain k-chromatic critical 
hypergraphs with as few edges as possible (see section 5.6). 

Lovasz (1968), and then Ne%etril and Rédl (1979), gave a constructive proof of 
the following important result of Erdés and Hajnal (1966), see Berge (1989). 


Theorem 5.25. For all integers r,k,s with r=2, there exists an r-graph H with 
x(#) = k in which no cycle is shorter than p (i.e., the girth is at least p). 


Interestingly, the earliest constructive proof of this theorem uses a double 
induction on p, k: the construction of a graph of girth p with chromatic number 
2k depends on a previous construction of r-graphs (with r large) of girth p — 1. 

From Theorem 5.25 can be derived for instance the following result: given a 
graph G of order n and a posite integer k, there exists a graph R(G) with the 
same clique number as G such that for every k-coloring of the nodes of R(G), 
some monochrome induced copy of G occurs (see chapter 25). 

Highly chromatic Steiner systems are known (see chapter 14). For highly 
chromatic infinite hypergraphs, see chapter 42. 
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5.6. Chromatic critical hypergraphs 


Let us recall that a hypergraph # is k-chromatic critical, or k-critical for short, 
when y(#) =k and removing any edge causes the chromatic number to drop. As 
mentioned above with regard to Theorem 5.23, constructions of y-critical r- 
graphs often use r’-graphs with r' > r: a consequence of various constructions of 
this type (cf. chapter 4) is the existence of y-critical r-graphs with arbitrarily large 
minimum linear degree, for y = 4 and r = 2 (Toft, see chapter 4). Moreover, in 
contrast with the case of graphs, Muller, Rodl and Turzik exhibited, for any 
integers r=3 and A, a 3-critical r-graph with all linear-degrees at least h and 
prove the existence of a linear hypergraph with the same properties. Notice that a 
k-critical r-graph on n vertices exists for n>(k—1)(r—1)+1 (Abbott and 
Hanson for r= 3, Toft (op. cit.) for r= 4). 

By use of linear algebra, Lovasz found an upper bound for the size of 3-critical 
r-graphs: 


Theorem 5.26. For any 3-critical r-graph 3 on n vertices, we have 
n 
meee) (,",). 
This bound has the correct order of magnitude, as justified by Toft’s construc- 
tions. 


There are only a finite number of 3-critical intersecting r-graphs. Bounds on 
their maximum size M(r) were given by Erdés and Lovasz (1975): 


Ls 
s=1 


The lower bound is conjectured to be exact. 


r! 
st 


= [(e-1)r!| <M) <r’. (5.27) 


5.7. Edge-colorings 


The chromatic index y(#) of a hypergraph # cannot be less than the largest size 
of an intersecting subhypergraph and a fortiori cannot be less than the maximum 
degree A(#). The “edge-coloring property”, ie., y(#)=A(#), holds for 
bipartite graphs (K6nig’s theorem) and for some much more general classes of 
hypergraphs described in section 3. 

For regular hypergraphs, to find a proper edge-coloring in 4(#) colors amounts 
to finding a decomposition of the edge set into perfect matchings, i.e., a 
“1-factorization”’. Factorization problems appear in the theory of block designs (a 
1-factorization of K}, is a solution of the famous “Fifteen Schoolgirls Problem” of 
Kirkman: see chapter 14) and in Discrepancy Theory (see chapter 26). As shown 
by Pippenger and Spencer (1989), near 1-factorizations exist for near regular 
hypergraphs (compare with Theorem 4.27): 


Theorem 5.28. Let # be an r-graph of order n such that any node belongs to 
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d(1+0(1)) edges and any two different nodes are in o(d) edges (r is fixed, 
d=d(n)—>~). Then 

(i) y(3#)-d(1 + 0(1)), and 

(ii) #% edges can be partitioned into d(1 + o(1)) edge covers. 


A major contribution to factorization problems was given by Baranyi who 
showed how network flows and “integer making lemmas” can be used to find the 
chromatic index of complete hypergraphs Kj, and of their hereditary closures [see 
chapter 26 and Berge (1989) for more information on Baranyai’s method]. 
Haggkvist and Hellgren recently developed a purely combinatorial approach 
unifying Baranyai’s results with natural hypergraph extensions of KGnig’s edge- 
coloring theorem. They obtained a far-reaching generalization to hypergraphs of a 
theorem of Ryser on when a proper n-edge-coloring with n colors of the complete 
bipartite graph K, , can be extended to a proper n-edge-coloring of K,, ,. 

Let us be given a complete p-partitioned hypergraph # with partition 
V,,V,,..-,¥,, aS defined in (1.27). The tincture of a subset ECV(#) is the 
p-tuple with components |E  V,| for 1<k <p. Notice that, since # is complete, 
the multiplicity of an edge only depends on its tincture. To any (possibly 
improper) edge-coloring of # corresponds a color-chart A which matches to every 
color c the multiset A(c) formed with the tinctures of the edges colored c; for 
1<k<p, the sum of the kth components of A(c) is denoted by A,(c). The 
following theorem means that a proper edge-coloring of a subhypergraph 39¢’ can 
be extended — a term to be taken in a self-explanatory way — to a proper edge- 
coloring of # with given specifications, provided that each color can be extended 
separately. (Remark the role of the empty edges: in a proper coloring, any empty 
edge can share the color of an other edge.) 


Theorem 5.29 (Haggkvist and Heligren 1993). Let # =(V, (E,),<;) be a complete 
p-partitioned hypergraph with partition V=V,UV,U---UV,. Let V’CV and 
Vi=V'NV,. Let A' denote a proper edge-coloring, with color chart A', of the 
subhypergraph X' =(V',(E,NV'),<,) and assume that A’ extends to some (pos- 
sibly improper) edge-coloring of #, with color chart A. Then, a necessary and 
sufficient condition for X' to be extendable to a proper edge-coloring of X is that, 
for every color c, we have 


Ay(c) ~ Affe) <|V,|—|V{|_ for <k <p. 


This theorem has many interesting consequences, such as completion theorems 
for incomplete Latin squares (Cruse, Hoffman, Andersen, etc. See chapter 14). 
Below are some applications in hypergraph theory. 


Corollary 5.30 (Baranyai 197€). The complete hypergraph Kj, has the edge- 
coloring property if and only if r|n. 


Proof. Choose p = 1 (each edge has tincture 7), V'=@ and color chart A with 
A=(“ }) colors, each color to be used n/r times. O 
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Corollary 5.31 (Baranyai 1979). Let # be a complete hypergraph of order n in 
which every edge of cardinality i has multiplicity m(i). Then # has the edge 
coloring property if and only if there exist positive integers t,, (i=1,2,...,n, j= 
1,2,...) such that 


Dt = m1) and D it, =n. 


Proof. Apply Theorem 5.29 with p=1 (so tincture i occurs exactly m(i)(") 
times), V’ = § and color chart A such that exactly 4, i-edges receive color j. O 


Corollary 5.32 (Generalization of Ryser’s theorem). Let #= K? ag q ve the 


Then every ‘ue cloning of #, where each color occurs q times on be extended to 


all of #. 


It follows from Theorem 5.29 that complete p-partite p-graphs and also their 
hereditary closure have the edge-coloring property (Berge, and Berge and 
Johnson, see Berge 1989). The equality between (#) and A(%#), relative to the 
hereditary closure of a hypergraph 2, is of special interest: it implies that 
Chvatal’s conjecture 2.18 is valid, in a strong sense, for #. Equality holds true 
when # is a star (Berge 1989) and, as Berge observed, is equivalent to the 
famous Vizing theorem (see chapter 4) when # is a graph. 

Attempts to extend Vizing’s theorem have inspired interesting open problems. 


Conjecture 5.33 (Berge 1989). If # is a linear hypergraph, then 
y(H) = ACH) . 


Conjecture 5.34 (Berge 1985, Fiiredi 1986, Meyniel, unpublished). If # is a 
simple linear hypergraph, then 


WH) < ACH.) + 1. 


Conjecture 5.34 would imply the following much publicized problem. 


Conjecture 5.35 (Erdés, Faber and Lovasz, see Erdés 1976). Let 3€ be a linear 
hypergraph # of order n vertices. Then y(#) <n. 


The original formulation of this conjecture asserts that if the edge set of a graph 
is the edge-disjoint of n complete graphs on 7 vertices, then its chromatic number 
equals n [the equivalence was pointed out by Hindman (1981) and Seymour 
(1982)]. Conjecture 5.35 was proved up ton = 10 (Hindman 1981), also in case # 
is a cyclic Steiner system (Colbourn and Colbourn 1982) and in case X# is 
intersecting (Firedi 1986). Berge and Hilton (1990) proved Conjecture 5.34 if the 
edges of # with more than 2 elements are assumed pairwise disjoint; if, im 
addition, no edge has more than 3 elements, Conjecture 5.32 holds. Seymour’s 
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inequality (Theorem 4.43) is a step towards Conjecture 5.33 and suggests the 
following related problems. 


Conjecture 5.36 [(i) Seymour 1982, (ii) Fiiredi 1986]. Let w: #—R* be a 
nonnegative function on the edges of a linear hypergraph #%, then there exists a 
matching  C # such that 


(i) 2 w(E)> 2 w(E)in, 


(ii) x w(E) > > w(E)/(A(9%)) + 1). 


Conjecture 5.37 (Origin unknown). Suppose any partial hypergraph #'C # 
satisfies »(H#') = m(#’)/[A(H#') + 1]. Then y(#) <= A(H#H) + 1. 
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Introduction 


Interest in finite partially ordered sets has been heightened in recent years by a 
steady stream of theorems combining clever ad hoc arguments with powerful tech- 
niques from other areas of mathematics. In this chapter, we present a sampling of 
results exhibiting these characteristics. In those instances where we do not present 
a complete proof, we outline enough of the general contours of the argument to 
allow the reader to supply the missing details with little difficulty. We also outline 
anticipated research directions in the combinatorics of partially ordered sets, and 
we discuss briefly some of the most interesting open problems in this field. 

Since this Handbook contains chapters on Extremal Set Theory and Enumer- 
ation, we have limited our discussion to results on general partially ordered sets. 
Still some difficult choices had to be made concerning results to be included — 
espccially in view of our emphasis on proof techniques. West’s survey articles (West 
1982, 1985) offer more of a catalogue of theorems in the area and have extensive 
bibliographies. Also, we recommend the recent books by Anderson (1987), Fish- 
burn (1986), Stanley (1986), and Trotter (1992) as well as the conference volumes 
(Rival 1982, 1985) for additional material on partially ordered sets and related 
topics. 


1. Notation and terminology 


Formally, a partially ordered set is a pair (X,P) where X is a set, and P is a 
reflexive, antisymmetric, and transitive binary relation on X. The set X is called 
the ground set and P is called a partial order. Throughout this chapter, we use the 
short form poset for a partially ordered set. Many researchers choose to drop the 
adjective “partially” and use ordered sect to mean a poset. A poset (X, P) is finite 
if the ground set X is finite. In this chapter, we will be concerned primarily with 
finite posets. 

In some settings, we find it convenient to use a single symbol such as P to denote 
a poset (X,P). This notation is particularly handy when both the ground set X 
and the partial order P remain fixed. In other settings, especially when we have 
several partial orders on the same ground sct, we will usc the ordered pair notation 
for posets. 

The notations (x,y) € P,xPy,x < yin P, and y > x in P are used interchange- 
ably. The notation x < y in P means x < y in P and x # y. Distinct points x,y 
are comparable when either x < y or y < x in P. Otherwise, we say x and y are 
incomparable and write x||y in P. When using a single symbol like P for a poset, 
we will write x < y in P, x|ly in P, etc. 

A poset P = (X,P) is a chain (also a totally ordered set or a linearly ordered 
set) if each pair of distinct points is comparable. We will use the symbols R, Q, 
2 and N to denote the reals, rationals, integers and positive integers, respectively. 
Each of these posets is a chain. 

Dually, P = (X, P) is an antichain if each pair of distinct points is incomparable. 
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If Y c X and Q is the restriction of P to Y, then the poset Q = (Y, Q) is called 
a subposet of (X,P). A subset Y C X is also called a chain (antichain) if the 
subposet (Y,Q) is a chain (antichain). The height of a poset is the maximum 
cardinality of a chain, and the width is the maximum cardinality of an antichain. 

When P = (X,P) and Q = (Y,Q) are posets, a map f: X — Y is called an 
embedding (of P into Q) if x) < x2 in P <=> f(x,) < f(x.) in Q. An embedding 
f:X — Y is an isomorphism when f(X) = Y. In this chapter, we prefer not to 
distinguish between isomorphic posets and to write P = Q to indicate that the 
two posets are isomorphic. Similarly we say that P is contained in Q (also P is a 
subposet of Q) when there exists an embedding of P in Q. 

We say y covers x in P and write x <: y in P when there is no z for which both 
x <zandz < yin P. The cover graph associated with the poset P = (X, P) is the 
graph G = (X, E) whose edge set F consists of the pairs xy for which x <: y in P. 
A drawing of the cover graph G = (X, £) in the Euclidean plane is called a Hasse 
diagram (or order diagram) of the poset P = (X, P) if x is lower in the plane than 
y whenever x <: y in P. 

Here are some frequently encountered examples of posets. Any family of sets 
is partially ordered by set inclusion; a set of positive integers is partially or- 
dered by division; and a subset of R" is partially ordered by (@),4@2,.-.,4n) < 
(b), b2,...,8n) <== a; <b; in R for é=1,2,...,n. In fig. 1.1, we show particular 
instances of these examples. Each has height 4, and their respective widths are 3, 
2, and 4, 

If P and Q are partial orderyon the same ground set X, Q is called an extension 
of P when P C Q. The partial order Q is called a linear extension of P if Q is an 
extension of P and (X, Q) is a chain. 

When P = (X,P) is a poset, an element x € X is called a maximal (minimal) 
element if there isno y € X for which x < y in P (y < x in P). The set of maximal 
(minimal) elements is denoted MAX(X, P) (MIN(X, P)). The subsets MAX(X, P) 
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and MIN(X, P) always determine antichains, although neither may be as large as 
the width of (X, P). 

When Y Cc X, the set {z € X: y < z in P for every y € Y} is called the set of 
upper bounds for Y. Note that this set may be empty. When the set of upper 
bounds of Y is nonempty and has a least element, this unique point is called the 
least upper bound of Y and is denoted |.u.b.(Y). Dually, the greatest lower bound 
(if it exists) of Y is denoted g.1.b.(Y). 

A poset P = (X,P) is called a lattice when each nonempty subset Y C_X has 
both a least upper bound and a greatest lower bound. When P = (X, P) is a lattice 
and x,y € X, we write x Vy for Lu.b.{x,y} and x A y for g.l.b.{x, y}. The binary 
operations V (join) and A (meet) are commutative and associative. The lattice is 
distributive if x A(y Vz) = (Ay) V(X Az) for all x,y,z EX. 

When P and Q are posets, the disjoint sum of P and Q, denoted P+Q, is 
obtained by taking the union of disjoint copies of the two posets with no compara- 
bilities between the points in one and points in the other. A poset is disconnected 
if it is the disjoint sum of two proper subposets; otherwise it is connected. The 
maximal connected subposets of a disconnected poset are components. 

The cartesian product of P = (X,P) and Q =(Y,Q), denoted P x Q, consists 
of the ordered pairs (x, y) where x € X and y € Y with partial ordering (x,,y1) < 
(x2, y2) <=> X41 < x2 in P and y; < y2 in Q. The cartesian product of n copies of P 
is denoted P”. 

Given posets P = (X,P) and Q=(Y,Q), a function f:X — Y is an order 
preserving (or monotone) map from P to Q if x, < x2 in P = f(x1) < f(x2) in 
Q. The set of all order preserving maps from P to Q is partially ordered by 
fi < fo <> Ai) < fa(x) in OQ for every x € X. This poset is denoted Q”. 

Throughout the chapter, we use k to denote a k-element chaind< 1<2<---< 
k ~ 1. The poset 2” is isomorphic to the set of subsets of an n-element set partially 
ordered by inclusion. A poset P is a distributive lattice if and only if there is a 
poset Q so that P is isomorphic to 2® [see chapter 3 in Birkhoff (1973)]. 

When P= (X,P) is a poset and ¥ = {P, =(Y;,Qx): x € X} is a family of 
posets indexed by the ground set of P, the lexicographic sum of ¥ over P is the 
poset whose ground set is {(x, y): x € X,y € Y,}. The partial ordering is defined 
by (11,1) < (x2, Yo) > (x1 < x2 in P) or (x) = x2 and y; < yo in Q,,). A lexico- 
graphic sum is nontrivial if |X| > 2 and if at least one Y;, satisfies |Y,| 2 2. A poset 
P is decomposable if it is isomorphic to a nontrivial lexicographic sum; otherwise 
P is indecomposable. Note that the disjoint sum of two posets is a lexicographic 
sum over a 2-element antichain. 


2. Dilworth’s theorem and the Greene—Kleitman theorem 


Dilworth’s decomposition theorem (Dilworth 1950) has played an important role 
in motivating research in posets, as evidenced by results discussed in this section 
as well as in sections 3, 6, 7 and 8. Also, Dilworth’s theorem surfaces in a variety 
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of extremal problems (see, for example, Duffus et al. 1991). There are several 
elementary proofs; the one we present is pattcrned after Perles (1963). 


Theorem 2.1. if P= (X,P) is a poset of width n, then there exists a partition 
X =C,UC,U---UC, where each C; is a chain. 


Proof. We proceed by induction on |X| and note that the result is trivial when 
|X| = 1. Assume validity when |X| < k and consider a poset P with |X| =k. We 
may assume that the width n of P is larger than 1. 

Choose x € MAX(P) and y € MIN(P) with y < x. Let Q be the poset obtained 
by removing x and y from P. If the width of Q is less than n, then we can partition 
Q into fewer than n chains which together with the chain {x,y} form a partition 
of X into (at most) 1 chains. So we may assume that Q has width n. Thus y < x 
in P. Choose an n-element antichain A = {a),42,...,a,} in Q. : 

Then let U = {u € X: u 2 a; for some a; € A} and D = {d€ X: d < a; for some 
a, € A}. Evidently x € U — Dandy € D — U. Thus there are chain partitions U = 
Cy, UC, uU---UC, and D = C/ UCJU---U C7. We may label these chains so that 
a, € C/N C! for i=1,2,...,n. Then C; = C/UC7’ is a chain for each i and the 
desired partition is X =C,;UC),U---UG,. D 


In introductory combinatorics texts, Dilworth's theorem is grouped with other 
max-min theorems having a common theme: P. Hall’s marriage theorem, the 
K6nig-Egervary theorem, Menger’s theorem, and the max flow-min cut theorem 
for network flows. This last result most clearly captures the linear programming 
core common to all. (See chapters 2 and 3 by Frank and Pulleyblank for additional 
material.) 

Dilworth’s theorem has a trivial dual version for antichains. 


Theorem 2.2. /f P = (X,P) is a poset of height n, then there exists a partition 
X = A, UA2U---UA, where each A; is an antichain. 


Proof. Set A; = MAX(P). Thereafter set A;,; = MAX(P;) where P; is the sub- 
poset obtained by removing the antichains A;,A2,...,A; from P. 


The first major result in this chapter is an important generalization of Dilworth’s 
chain partitioning theorem due to Greene and Kleitman (1976). The proof we give 
here is patterned after algorithmic proofs given by Saks (1979) and Perfect (1984). 
An alternative proof using network flows is given in this volume in chapter 2. 

We need some preliminary notation and terminology. Let P = (X,P) be a 
poset and k a positive integer. A subset S c X is called a Sperner k-family if 
S does not contain a chain of (k +1)-elements. The maximum cardinality of 
a Sperner k-family is denotea d,(P). When € = {C,,C2,...,C,} is a family of 
chains forming a partition of X, we define e,(@) = )7)_, min{k, |C;|}. If S is any 
Sperner k-family and € = {C), C2,...,C,} is any chain partition of X, we note 
that |S C;| < min{k, |C;|}. Thus [S| < e,(@), so that d,(P) < e,(€@). The chain 
partition © is said to be k-saturated if d,(P) = e,(€). 
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We also need a preliminary lemma whose elementary proof is omitted. Let 4 (P) 
denote the set of all maximum antichains of P. Define a partial order on .4(P) by 
A < B <=> for every a € A, there exists b € B witha < b. 


Lemma 2.3. The set A((P) of maximum antichains of a poset P =(X,P) has a 
unique greatest element. 


With this background, here is the Greene-Kleitman theorem. 


Theorem 2.4, Let P be a poset and k a positive integer. Then there exists a chain 
partition € of P which is simultaneously k-saturated and (k + 1)-saturated, i.e., 
dy (P) = ex (€) and dy.i(P) = ex1(@). 


Proof. We first show that d|(P x k) = d,(P) for every k > 1. Let A be a maximum 
antichain in P x k, and let A; = {x € X: (x,i) € A}. Then each 4; is an antichain 
in P, so the set S = A; UA) U---U A, is a Sperner k-family. Furthermore, \s| = |A| 
since A; MA, = @ when i F j. Thus di (P x k) < d,(P). 

Conversely, let S be a maximum Sperner k-family in P. Partition $ into k 
antichains by setting Ay = MAX(S) and A;,, = MAX(S — (A; UA?) U-+:UA;)). 
Then A = {(a,/): a €A;} is an antichain in P xk with |A| = |S|. This shows 
d\(P xk) > d,(P). Thus d,(P x k) = a, (P). 

For the remainder of the proof, we fix a positive integer k. Then we make several 
definitions concerning chain partitions of P x (k +1). When € = {C),Co,...,C,} 
is a chain partition of P x (k + 1), we let M(€) = {MAX(C;): 1 <i < ¢}. We say 
y covers x in © if there is some C; € € so that y covers x in the “chain C;. When 
S cP x (k+4), the set {x € X: (x,i) € S for some é} is called the projection of S 
on P. For each i, the subset 5 1 (P x {i}) is called level i of S. The projection on 
P of level i of M() is denoted by M,(%). 

A chain partition © of P x (k+ 1) is special if the following two conditions hold: 

(i) Mo(€) > M\(€) > M2(€) D ++» > My_1(); 

(ii) If x © M,(%) — M,_;(@), then (x, k) covers (x,k — 1) in €. 

A special chain partition of P x k+1 is very special if it also satisfies the fol- 
lowing two conditions: 

(iii) Exactly d,,,(P) — d,(P) of the chains in @ are subsets of level 0; and 

(iv) |€| = dy(P x (k+1)). 

When © is special, it follows from the second condition in this definition that 
M,(€) = N,(€) UN2(6) where Ni (6) = M,(€) 0 My_1(€). If x € N2(€), then 
(x, k) covers (x,k —1) in ©. 

We now show that the theorem follows whenever P x (k +1) has a very spe- 


cial chain partition. To see this, let €,,; = {Ci,Co,...,C;} be a very special 
chain partition of P x (k +1) where ¢ = d\(P x (k+1)). Set s =d,.,(P) — d,(P). 
We assume that C;,C2,...,C, are subsets of level 0. For each j = 1,2,...,¢, let 


D, = {(x, i): (x,é+1) € Cj}. Of course, D\, D2,...,D, are all empty. Let €, be 
the collection of all nonempty D,’s. Then ©, is a chain partition of P x k and 
|€,| <1—s =d,(P) =4\(P xk). Thus |€,| = ¢,(P). Furthermore, it is easy to 
see that €, is a special chain partition of P x k. 
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Now level & in P x (k +1) forms a copy of P and the |M,;(€,,1)| chains in €,,, 
which intersect level k determine a chain partition of P which we denote by ©. 
We show that @ is both k-saturated and (k + 1)-saturated. 

Now let j € {k,k +1}. Then 


j- 
d,(P) = e\(€,) = |M(€,)| = S_ |Mi(€))I 


Ly 
i 4 
> Gj — 1)]M;..2('6))| + |Mj_-1(6;)| 
= (fF — DIMj 2(&))| + (Mj-1 (66) 9 Mj_-2(G)| + |No(€)| 
2 JMj-1(€;) 9 Mj-2(€;)] + |No(€;)| 
= JIN: (€;)| + |No(€;)| 


> > min{j,|E|} 


Ec€é 
=,(€) > d,(P). 


Thus @ is both k-saturated and (k + 1)-saturated as claimed. To complcte the proof, 
we need only show the existence of a very special chain partition of P x (k +1). 
Set ¢ = d,,,(P) = d,(P x (k + 1)). Of all partitions of P(X + 1) into ¢ chains, choose 
one having as many chains as possible as subsets of level 0. Call this partition 
6 = {C;,C2,...,C,} and label the chains in © so that C,,C2,..., Cy are subsets of 
level 0 but Cy,,, Cs42,..., C, are not. Since the last : — s chains in € cover a copy of 
P x k, we know t ~— s 2 d\(P x k) = a, (P), so s < dy,\(P) — d,(P). We show that 
s = d,,,(P) — d,(P). Suppose to the contrary that s < d,,,(P) — d,(P). 

Let Q= Px (k+1)—(C, UC,uU---UC,). Clearly, the width of Q is ¢ — s. Let 
A be the unique greatest element in the poset 4((Q) of maximum antichains in Q. 
A contains at least d,.:(P) — d,(P) —s elements from level 0 since the width of 
the top & levels is only d,(P). Choose an element aj € A which comes from level 
0. Without loss of generality ay € C,,;. Let Q’ = Q — {c € Cy,1: € < ag}. 

We claim that the width of Q’ is less than ¢ — s, for if Q’ contains a (tf —s)- 
element antichain B, then B contains an element 5 with ay < 6 in Q x (k+1). 
This contradicts our choice of A. It follows that we can partition Y’ intot—s-—1 
chains which together with C,,C2,...,C, and {c € C,,,: ¢ < ao} form a partition of 
P x (k + 1) into r chains. In this partition, there are s + 1 chains which are subsets 
of level 0. The contradiction shows s = d,,,(P) — d,(P). 

We now proceed to transform € into a very special partition by a series of 
operations called insertions argi switches. At this moment © satisfies properties 
(iii) and (iv), and both operations preserve these properties. 

We first perform a series of insertions. Choose points (x, i), (y,/) so that (x, /) 
covers (y,j) in @ andi > j. If i #j+1 or x 4 y, remove (y,j +1) from the chain 
to which it currently belongs and insert it in the chain containing (x,/) and (y, j). 
Repeat until no further insertions are possible. 
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Next, we perform a series of switches. For an integer / > 1, locate a point (x, /) € 
M(6) so that either: (1) j < k and (x,j— 1) ¢ M(€); or (2) j =k and (x, /) does 
not cover (x, j — 1) in €, and (x,j — 1) ¢ M(€). 

Let (y,i) be the point covering (x,j—1) and let C be the chain containing 
(y,é+1). Let C” consist of those points in € which are less than (y,i+1) and 
let C” = C—C’. Then let D be the chain containing (x, j) and set D’ = DUC". 
Replace C and D in € by C’ and D’. Repeat until no further switches are possible. 

It is obvious that the series of insertions must stop, but it takes a moment’s 


reflection to see that this is also true for the series of switches. For j = 1,2,...,k, let 
v; count the number of points x € X for which (x, j) covers (y, i) in € and (x, j — 1) 
covers (y,i—1) in @. Each time we perform a switch, the vector (v,,v2,..., vg) 


increases lexicographically. Since v; < |X| for each j, the procedure stops. 


This theorem has many significant applications. The following corollary follows 
easily, but we know of no simple proof avoiding the use of Theorem 2.4! 


Corollary 2.5. Let P be a finite poset. Then for each k > 1, d,(P) — dy_\(P) 2 
dy1(P) — ax (P). 


3. Kierstead’s chain partitioning theorem 


In this section we outline the proof of a theorem of Kierstead (1981) which asserts 
that for each n > 1, there is a ¢ = #(n) for which there exists an on-line algorithm 
which will partition any poset P of width at most 7 into ¢ chains. By an on-line 
partition, we mean that the poset and the partition are constructed one point at a 
time. An adversary (infinitely clever) constructs the poset and we must devise the 
partition. At each round, the adversary presents the new point and describes its 
comparabilities and incomparabilitics to all preceding points. We must then add 
the new point to one of the sets making up the partition. Both players’ moves are 
permanent. 

As a warm-up, we first present the on-line version of the dual to Dilworth’s 
thcorem. The result is an unpublished theorem of Schmerl, although a short proof 
is given in Kierstead (1986). 


Theorem 3.1. For each n > 1, there exists an algorithm which will construct an 
on-line partition of a poset of height at most n into n(n + 1)/2 antichains. 


Proof. When the new point x is added to the poset, let r = r(x) be the maximum 
number of points in a chain having x as least element, and let s = s(x) be the 
maximum number of points in a chain having x as greatest element. Assign x to 
the set A(r,s). Clearly, each A(r,s) is an antichain. Since r+s —1 <n, there are 
n(n +1)/2 such sets. © 


Szemerédi produced a simple argument to show-that Theorem 3.1 is best pos- 
sible, and we invite the reader to reconstruct his proof. Full details are given in 
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Kierstead (1986). As a first step, show that there exists a strategy for constructing 
a poset P of height at most n which will force any opponent producing an on-line 
partition into antichains to use at least n(n + 1)/2 antichains in covering P and at 
least n antichains in covering MAX(P). 

Here is Kierstead’s on-line chain partitioning theorem (Kierstead 1981). 


Theorem 3.2. For each n 21, there exists an algorithm which will construct an 
on-line partition of a poset of width at most n into (S" — 1)/4 chains. 


Proof. The argument proceeds by induction on x with the case n = 1 being trivial. 
The heart of the argument is the case n = 2 where we have to partition a width-2 
poset into 6 chains. 

We first construct a greedy chain C;. As a new point enters the poset, we insert 
it in C, whenever it is comparable to all other points previously placed in C;. 
Thus for every x € X — C), there is a nonempty set /(x) of points from C; which 
are incomparable to x. Although /(x) may grow with time, it is always a set of 
consecutive points from C,. When x,y € X — C,, we write I(x) < /(y) when u <u 
for every u € I(x) and every uv € I(y). Note that if x and y are incomparable points 
in X — C,, then the following condition holds: 

(K). When the latter of x and y enters, either 


I(x) < 1(y) or I(y) < T(x). 


In fact, when n = 2, the qualifying phrase “when the latter of x and y enters” 
can be dropped since I(x) NI(y) = @ whenever x || y. Regardless, we choose the 
weaker statement since it is crucial to the inductive step. 

We define a partial order, called the *-order, on X — C, as follows. When the 
new point x enters, we set x *y if 

(1) x<yin P—C,, or 

(2) x || y and I(x) < I(y). 

Similarly, we set y * x if 

(3) y<x in P—(C, or 

(4) x || y and I(y) < I(x). 

With this definition, it is straightforward to verify that (X — C,, *) is a chain, i.e., 
* is a linear extension of the original partial order on X — C,. Next, we define an 
equivalence relation on P — C;. Just as is the case with the *-order, the definition 
of this equivalence relation is on-line. The relation will satisfy: 

(a) each equivalence class is a set of consecutive elements of X — C, in the 
*-order, and 

(b) if x and y are consecutive elements belonging to the same equivalence class, 
then f(x) NI(y) # 9. 

When a new point x enters X — Ci, we put x in the same equivalence class as 
y if x <: yin * and J(x) NJ(y) # @. If no such y exists, we put x in the same class 
as z if z <: x in * and I(x) N/(z) #9. If neither of these results in the assignment 
of x to an existing class, start a new equivalence class whose only element (at this 
moment) is x. 
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Note that if x enters between two consecutive points y and z belonging to the 
same class, then I(x) NI(y) 4 9 4 I(x) N1(z). This insures that property (b) will 
be preserved when x is added to this class. 

To complete the proof of the width-2 case, we need to verify the following claim 
whose proof we leave as an exercise. 


Claim. /[f S,; and S; are equivalence classes of X — C,, and there are at least two 
other equivalence classes between them in the *-order, then S, U S, is a chain in P. 


Once the chain is verified, we may use it to devise a simple strategy for construct- 
ing an on-line partition X — C; = C2 UC,U---UC.. Each of these five chains is 
the union of equivalence classes, and any two classes which are subsets of the same 
chain have at least two other classes between them in the *-order. When we start 
a new class for a point x, we assign it to a chain which does not contain any of the 
four classes - two above x and two below x in the *-order. 

To obtain the general result when the width of P is n > 3, start by constructing 
a greedy chain C; just as before. Then define a partial order * on X — C, so that 
* is an extension of the original order on X — C, and the width of (X¥ — C,, *) is 
n—1. When the new point x enters X — C, we set x * y if either (1) or (2) holds. 
However, we also set x  y if: 

(1’) there exists u € X — C, so that x <u in P and uw * y, or 

(2') there exists v € X — C; so that x xv and v < y in P. 

It is easy to see that this more general definition of * is necessary in order to 
insure that * is transitive. The definition of when y + x must be expanded analo- 
gously. With these observations, it is clear that the width of (X — C,, *) is at most 
n—1, for if A = {@),@2,...,4m} is an antichain in (X — C,,*), then (2, /(a;) 4 @. 
If x € C, and x || a; for i =1,2,...,mm, then {x} UA is an antichain in P. It follows 
that there is a strategy for partitioning X — C, into (5""! — 1)/4 subsets each of 
which is a chain in the *-order. Observe that a *-chain satisfies property (K), and 
the algorithm described for the width-2 case will then partition a *-chain into five 
chains in P. The theorem follows since (5" — 1)/4 =1+5(5""!-1)/4. O 


It is apparently a very difficult problem to determine just how good the upper 
bound in Theorem 3.2 actually is. For n = 2, Theorem 3.2 gives an upper bound 
of 6, but Felsner (1995) has just shown that the correct answer is 5. Saks observed 
that the techniques used to show that Theorem 3.1 is best possible can be dualized 
to produce a lower bound of the form n(m + 1)/2 for Theorem 3.2. This bound 
is probably a very weak result, and it would be interesting to determine whether 
there is an algorithm which will partition on-line a poset of width at most n into 
n° chains for some absolute constant c. 

Recently, Kierstead et al. (1994) have shown that for every radius-two tree T, 
there exists a function fy: N — N, so that if G is any graph which does not con- 
tain T as an induced subgraph and does not contain a complete subgraph on k 
vertices, then G can be colored on-line with f;(k) colors. The complement of a 
comparability graph does not contain the subdivision of K,3, so it follows that 
there exists a function g : N — N so that a comparability graph with independence 
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number x can be partitioned on-line into g(n) complete subgraphs. This result does 
not follow from a straightforward extension of the ideas presented in this section. 
The difficulty is that the argument presented here makes specific use of the order 
relation between points—not just the information as to which pairs of points are 
comparable. 


4, Sperner’s lemma and the cross cut conjecture 


A poset P is said to be ranked if every maximal chain in P has the same number 
of points. When P is ranked and x ¢_X, we let r(x) be the largest i so that there 
exists a chain of i points having x as its least clement. The value r(x) is called the 
rank of x, and the antichains A; = {x: r(x) = i} are called ranks. The poset P is 
said to be a Sperner poset if the width of P equals the maximum cardinality of its 
ranks. The following now classic result is due to Sperner (1928). 


Theorem 4.1. For each n > 1, 2” is a Sperner poset. In particular, the width of 2" 
is Carp (cf. chapter 24). 


Proof. Weconsidcr 2” as the set of all subsets of {1,2,...,} ordered by inclusion. 
It is easy to see that the maximum cardinality of a rank of 2” is the binomial 
coefficient (n/a) Also there are n! maximal chains in 2”. Now suppose the width 
of 2” is t, and let # = {A,, A2,...,A,} be a maximum antichain in 2”. If A € of and 
|A| =k, then there are k!(n — k)! maximal chains in 2” which contain A. It follows 
that )7)_, k:!(m — k;)! < n! where &; = |A;|. Thus (ina) < Wakil — ki) /nt < 
1, so thatr < Cary) as claimed. 


An enormous amount of rescarch has been done on generalizations of this el- 
ementary result, and we encourage the reader to consult chapter 24 or the book 
Anderson (1987) which concentrates on this subject. Also, chapter 32 contains an 
important result from Sperner theory. In view of our space limitations, we include 
here only an outline of the theorem of Canfield (1978) which asserts that suffi- 
ciently large partition lattices are not Sperner posets. The argument we give is 
patterned after the argument given subsequently by Shearer (1979). 

The partition lattice Tl, is the poset whose elements are the partitions (into 
equivalence classes) of the set {1,2,...,”}. In I, we set 7 < 7 <=> each class in 
m, is a subset of a class in 772. Partition lattices are natural combinatorial objects and 
have been studied extensively both in combinatorial mathematics and in related 
areas. For example, an important theorem in lattice theory due to Pudlak and Tuma 
(1980) asserts that every lattice is a sublattice of a partition lattice. The problem of 
investigating the Sperner propegty for partition lattices was popularized by Rota. 

The rank sizes of the partition lattice II, are the Stirling numbers of the sec- 
ond kind S(n,k) for k =0,1,2,...,1. These numbers form a unimodal sequence 
achieving maximum value when k = k,, ~ n/logn. If 7 € TI, and 7 has k; classes 
of size i for each i, then 7 is called a partition of type 1*12#234 ..- nk». 

The following lemma is an easy exercise. 
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Lemma 4.2. The number of partitions in 1, of type 1412” ...n& is 
nl (ANE (21) (nt) Rr ky thal kyl. 
Here is Canfield’s (1978) solution to Rota’s conjecture. 


Theorem 4.3. If n is sufficiently large, the partition lattice 11, is not a Sperner 
poset. 


Proof. We actually prove a slightly weaker result. We show that for certain large 
values of 7, the partition lattice II, is not a Sperner poset. It is a relatively straight- 
forward extension to obtain the general result. Let Y consist of all partitions of 
type m (2m)? (3m) where hy + hy +hy =k +1, and k =k, is chosen to maxi- 
mize the Stirling number S(n, k). Note that mh, + 2mh2 + 3mh; = n. Furthermore, 
hy, hy, and Ag satisfy: 


hy, = [5(k +1) — 2n/m — 6/3, 
hy = [n/m — (k +1) + 28)/3, 
hy = [n/m — (k +1) — 0]/3. 


The value @ is taken from {—1,0,+1} so that each A, is an integer. However, for 
the time being m is unspecified, although we assume that m divides n. 

Any partition into & classes which is comparable to a partition in ? must belong 
to one of the following six types: 


Type 1: m"-2(2m)"2*! (3m)? 
Type 2: mm"! (2m) (3m)! 
Type 3: m™~'(2m)""(3m)"" | (4m)! 
Type 4: mm" (2m) 7m) (4m)! 
Type 5: m’(2m)'""'(3m)""“'(Sm)! 
Type 6: m” (2m)'? (3m)? (6m)! 


We will show that for sufficiently large n, there are fewer than || partitions of 
these six types. The remaining partitions into k classes together with the partitions 
in ? will then form an antichain of more than S(n,k) elements which shows that 
I], is not a Sperner poset. 

By Lemma 4.2, the ratio of the number of partitions of Type 1 divided by the 
number of partitions in P is hy (hy — 1)/(h2 + 12"). Then it is an easy (although 
tedious) calculation to show that if we set m = [1/1.06k], then this ratio goes to 0 
as n tends to infinity. An analogous computation shows that for this value of m, the 
ratio of the number of partitions of Type i divided by the number of partitions in 
goes to 0 for each of the other five types. It may be shown that when n > 4 x 10°, 
all six ratios are sufficiently small that their sum is-less than 1 and the theorem 
follows. O 
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Some progress has been made in reducing the value of n for which it can be 
shown that II, is not a Sperner poset. Jichang and Kleitman (1984) have lowered 
the estimate to 3.4 x 10°. However, the enormity of these estimates and the width 
of the corresponding partition lattices are striking testimony to the adage well 
known to researchers in combinatorical mathematics: Woe be to those who make 
conclusions based on detailed examinations of small examples. Sometimes we are 
startled to learn just how large small can be. 


5, Linear extensions and correlation 


Let P be a poset, and Ict denote the set of all linear extensions of P. It is natural 
to consider the elements of € as equally likely outcomes in a finite probability 
space. When x and y are distinct points in P, we let Prob[x < y| denote the prob- 
ability of the event consisting of all linear extensions in which x < y. Observe that 
Prob[x < y! is the ratio of the number of linear extensions with x < y divided by 
||. Note that Prob[x < y] =1 <> x <y in P, and Prob[x < y] =O <> y <x in 


Similarly, if x, y, z are three distinct points in P, we let Prob[x < y|x < z] denote 
the conditional probability that in a random selection of a linear extension, the 
relation x < y holds given that x < z holds. In 1980, Rival and Sands made the 
following conjecture. which quickly became known as the xyz-conjecture. 


Conjecture 5.1. If x,y,z are distinct points in a poset P, then 


Prob[x < y] < Prob[x < y|x < 2]. 


It is easy to see that the xyz-conjecture holds except possibly when {x, y, z} 
is a three-element antichain. In this case, Rival and Sands conjectured that the 
inequality in Conjecture 5.1 was strict, and this stronger version became known as 
the strict xyz-conjecture. The original conjecture was settled in the affirmative by 
Shepp (1980, 1982) using the FKG-inequality from statistical mechanics. The strict 
xyz-conjecture was proved by Fishburn (1984) using an important generalization 
of the FKG-inequality proved by Ahlswede and Daykin (1978), which we call the 
AD-inequality. 

Although we do not include its proof here, the AD-inequality is a marvelous 
device with a growing list of applications in combinatorics. We encourage the 
reader to study its elementary proof carefully. 

Let P be a distributive lattice with mect and join denoted A and V respectively. 
When A and B are subsets of X, welet AA B = {aN b:a€ Aandb € B} andAV 
B={avb:a€ Aandb € BAWhen f : X + RandA CX, let f(A) = rca F(x). 
Let Ro denote the nonnegative real numbers. 

Here is the AD-inequality. 


Theorem 5.2. Let P be a distributive lattice and let a, B, y, and 6 be functions 
from X to Rg satisfying: 


Partially ordered sets 447 


(i) a(x)B(y) < y¥@Ay)d& V y) for all x,y € X. 

Then the following inequality holds for every A,B Cc X: 

(ii) a(A)B(B) < y(A A B)d(A v B). 
Corollary 5.3 (Fortuin ct al. 1971). Let P be a distributive lattice and let ~:X — 
[0,1] setisfy: 

(i) wlx)uely) < HO Ay) Vy), for all x,y € X. 

If f and g are monotonic functions from X to R, then the following inequality 
holds: 

(ii) (Fe) MN (ge)(X)] < [AON Gee) (X)). 


Proof. Assume first that f and g map X to Ro. For each x € X, define a(x) = 
f(x)u(x). B(x) = g(x)ul(x), v(x) = w(x), and 6(x) = f(x)g(x) u(x). Then for an 
arbitrary x,y, € X we have: 
a(x)B(y) = f(x)u(x)g(yu(y) 

= f(x)g)u(x)u(y) 

< f(x)gyur A y)ul(x Vv y) 

< w(x A y) f(x V yg(x Vv y) u(x V y) 

= yw Ay)d(x Vy). 


Therefore a(X)B(X} < y(X)8(X) as claimed. When the range of f or g includes 
negative reals, we increase both functions by some suitably large constant. O 


© 


A subset S of a poset P is called a down set (up set) if x € S andy <x in P 
(x < y in P) always implies y € S. 


Corollary 5.4. Let U, and U, be up sets in a distributive lattice P. Then |U,||U2| < 
JU, U,||X|. 


Proof. Set a(x) = B(x) = y(x) = &(x) =1 for every x € X. Observe that U, Vv 
U,=U,N U2, and U,AU,CX. O 


The special case of Corollary 5.4 when P = 2" was first proved by Kleitman 
(1966) and in dual form by Seymour (1973). 
We close this section by outlining Shepp’s (1982) proof of the xyz-conjecture. 


Theorem 5.5. Let P be a poset and let x, y, and z be three distinct points in X. 
Then 


Prob[x < y] < Prob[x < y|x < z]. 
Proof. We assume {x,y,z} is a three-element antichain in P. Let k be a positive 


integer, and let Y, denote the set of all order preserving functions from P to 
k. Define a partial order P, on Y, by setting f < g in P, <= f(x) 2 g(x) and 
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f(u) — f(x) < g(u) — g(x) for every u € X. It is straightforward to verify that the 
poset (Y;,P,) is in fact a distributive lattice. 


Now let Uy(k) = {f € Yq: f(x) < f(y)} and U2(k) = {f € Y,: f(x) < f(z)}. 
Then U,(k) and U>(k) are up sets in the distributive lattice (Y,, P,). Therefore 


|U, (&)}|U2(k)] < [Ui (A) Uae, 
so that: 
i(k) [Ui(k) 0 Ua(k)I/ Ved 
IYx |Ur(k)/|¥ «I 
However, it is easy to see that as k tends to oo, the left-hand side of this inequality 
approaches Prob[x < y] while the right-hand side approaches Prob[x < y|x < z]. 
QO 
The reader should note that the truly clever part of this proof is the nonstandard 
definition of the partial order P, so that (Y,,P;,) is a distributive lattice having 
U,(k) and U2(k) as up sets. Fishburn’s (1984) proof of the strict xyz-inequality 


requires two applications of the AD-inequality. Winkler’s (1986) survey article is 
a good starting point for an overview of work on correlation. 


6. Balancing pairs and the 4-3 conjecture 


The following conjecture is due to Kislitysn (1968), although it was also made 
independently by Fredman (1979) and Linial (1984): 


Conjecture 6.1 (The 5 42 eOUIeclne)- If P is not a chain, then P contains distinct 
points x and y for which 4 & Prob[x > y] < 5 


If true, this conjecture is best possible as is evidenced by the three-point poset 
2+1. Given a poset P = (X,P) which is not a chain, let 5(P) denote the largest 
positive number for which there exists a pair x,y € X with 6(P) < Prob[x > y] < 
1 — 6(P). Using this terminology, we can restate the }—3 conjecture as follows. 


Conjecture 6.2. If P is not a chain, then 6(P) 2 }. 


The original motivation for studying balancing pairs in posets was the connection 
with sorting. The problem was to answer whether it is always possible to determine 
an unknown linear extension of a poset P with O(logr) rounds (questions) where 
t is the number of linear extensions of P. The answer would be “yes” if one could 
prove that there exists an absolute constant 4) so that 8(P) > 5) for any P which 
is not a chain. 

Linial (1984) has shown that the 1 412 conjecture holds for posets of width two. 
Fishburn et al. (1992) show that it holds for posets of height at most two. Although 
the conjecture remains open for general posets, we present a partial result, due 
to Kahn and Saks (1984), which is particularly appealing in view of its nontrivial 
application of the Alexandrov—Fenchel inequalities for mixed volumes. 
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Theorem 6.3. if P is not a chain, then X contains a distinct pair x,y so that 


3 < Prob[x > y| < 4. 


and thus 6(P) > 7: 


Proof. Clearly, we may assume that P does not have a least element. Let ¢ 
denote the set of all linear extensions of P and let n= |X|. For each L € @, let 
h,: X >a be the order preserving injection determined by L. Let |¢|=12 and 
then define h: X — Rg by A(x) = (lies h, (x))/t. The value h(x) is the average 
height of x among the lincar extensions in @. Since no element satisfies h(x) = 0, it 
follows that there exist distinct elements x,y € X with 0 < A(y) — A(x) < 1. Note 
that such a pair must be incomparable. We will show that 2 < Prob[x > y] < 
+. The argument depends on a series of lemmas which hold for an arbitrary 
incomparable pair in P. , 

Fix an arbitrary incomparable pair x, y € X. For a positive (negative) integer i, 
let e; be the number of lincar extensions of P in which x is below (above) y by 
exactly i positions. 


Lemma 1. e; = e_1. 
Proof. Since x and y occur consecutively, they may be interchanged. 
Lemma 2. ¢2+¢€_2 <e) +e 4. 


Proof. Suppose L ¢ € and |h,(y) — Ay (x)| = 2. Let u be the unique point be- 
tween x and y. If u || x, exchange u and x. If u is comparable with x, then u || y. 
In this case, the cyclic permutation (uyx) converts L into an extension in which x 
and y are consecutive. The mapping is easily seen to be an injection. 0 


Lemma 3. /f |i| > 2, then e; < 6. | + ej). 


Proof. Without loss of generality i 2 2. Suppose the lemma is false and of all 
counterexamples, choose one for which |X| = is minimum. For this value of 7, 
we choose P so that the number of comparable pairs is maximum. We first show 
that for every incomparable pair u,v € X, one of u and v is greater than or equal 
to x and the other is less than or equal to y. 

Suppose to the contrary that «u || v and that the above statement does not hold. 
Let P’ be the poset obtained by adding the relation u < v to the partial order on 
X and taking the transitive closure. Also let P” be the poset obtained by adding 
u <u. In P’, we let e denote the number of linear extensions with x below y by 
oaey i] paswiens Also, let e” denote the corresponding number for P”. Then 
ej =e, +e; for cach j 21. Since P’ and P” have more ee pairs than P, 
we now that ege_, te, ande” <e” | +e" .,so that e, < e;_; +4; as claimed. 

We next claim that € MIN(P) and y €e MAX(P). For suppose u € MIN(P) and 
that wu <x. Ifu<y in P, then 2 must be comparable with every point in X, ie., 
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x, Veen = y 
¥3 
X Y2 
*; 
¥ 
x= Xo 
Figure 6.1. 


u is the least element of P. In this case, we consider the poset P’ = P — {u} and 
the numbers ¢; defined in the natural way for P’. Since e; = e/, we would conclude 
e; < e;_1 + €j41. The contradiction shows x € MIN(P). Dually, y ¢ MAX(P). 

These remarks show that X is the union of two chains x = xy < x) <x2<--- < 
x,and y; < yo <-+- < yy <ys4) = y. Furthermore, no y; is larger than any x, in P. 
(Sec fig. 6.1.) 

We now distinguish two cases, 

Case 1.i <n—1. Choose L € € with h;(y) —hye(x) =k. If x is not the least 
element in L, exchange x with the element immediately under it. If x is the least 
element in 1, then y is not the greatcst. Let z be the element immediately over 
x. If x || z, exchange them; otherwise exchange y with the clement immediately 
over it. This procedure is an injection which transforms L into a lincar extension 
in which x is below y by either i — 1 or i+ 1 positions. 

Case 2. i =n — 1, For each subposet Q Cc P, let e(Q) denote the number of linear 
extensions of Q. Let Z = P — {x,y}. Then e, ; = e(Z — {x,}) +e(Z — {y)}), ej = 
e(Z) and e;,, = 0. Now e(Z) = e(Z — {x,}) + e(Z — {y1}) so, to complete the argu- 
ment, we need only show that e(Z — {x,}) 2 e(Z — {x:}). However, this inequality 
follows immediately since the mapping x; — x;_; transforms a linear extension of 
Z — {x,} into a linear extension of Z — {x,}. This completes the proof of Lemma 
3. O 


The next lemma is a specia#case of the Alexandrov—Fenchel inequalities for 
mixed volumes. This method was pioneered by Stanley (1981). 


Lemma 4. Let Ko and K, be convex subsets of R". For each A withO< A <1, let 
K, = {(1 — A)vo + Avy: vo € Ko, vu) € Ki}. Let d be the dimension of the affine hull 
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of K, for 4 € (0,1). Then there exist unique numbers ao, a,,...,@q so that for all A, 
the d-dimensional volume of K) satisfies: 


d 
Vol(K,) = Ps @ ag(l =A) "a5, 


k=0 
Furthermore, the sequence @9,4),@2,...,@4 is logarithmically concave, i.e., 
2 
Gy, > ay-1Ag41, fork =1,2,...,d—1. 


Lemma 5. The sequences e),€2,€3,...,€n @nd €_n,@_nsi,-.-,e_, are logarithmi- 
cally concave. 


ne Let R? denote the vector space of all order preserving functions from P 
o R. Also let C(P) = {f € RP: 0 < f(x) < 1 for all x € P}. For A withO <A <1, 
2 K, = {f € C(P): f(y) — f(x) = A}. Note that K, = (1 - A)Ky + AK}. Also note 
that when 0 < A < 1, Ky has dimension n — 1. ‘ 
For each Le € with x <y in L, let A\(L)={f€ Ky: u<w in L= f(v) < 
f(w)}. Then K, is the union of the 4,(L)’s taken over all L with x < y in L. Since 
the A,(L)’s have disjoint interiors, we see that 


Vol(K,) = > Vol(A,(L)). 
. oan L 


Now consider some particular L € & with x < y in L. Suppose L orders X as up < 
vy < +++ <v,-y With x = v; and y = v;. Then A,(L) consists of those f € C(P) for 
which 0 < f(vo) < f(v1) < +++ < f(vn-1) < 1 and f(v;) = f(v;) + A. Now consider 
the volume preserving mapping defined by: 

F (UK) > Fug) — A when k > j; 

flux) ~ fy) —f(vi) whenj>2k>t, and 

f(uy) — f(vg) when i > k. 
Then the image of A,(Z) under this mapping is the set of f ¢ C(P) for which 


0-< fv) < fli) $+ < fli) < floja) < + < fUn) <1 
and 0 < f(u;) < f(vis:) <->» < f(v;) = A. However, this set is the product of two 
simplices and its volume is therefore 

(1—a)t-s+! pi-i-l 

(n-j+d! G-i- DY 
It follows that 

(Ps Ayes ~<A)" hal divin! 
ol( a) Qo ae (n—j+i! G=f=1) 
Osi<jgn--1 fe? 


Ay barst 
we tries 


(1 — a)jr- i AE! 
Gear @-i GopF 
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From Lemma 4, we conclude that a; =e;_,/(n—1)!, and thus the sequence 
€1,€2,€3,..- iS logarithmically concave. The argument for the other sequence is 
dual. O 


For the remainder of the proof, we assume that x and y are an incomparable 
pair with 0 < A(y) — A(x) < 1. For each i > 1, we let 6; =e;/t and a; = e_;/t. We 
then know that the following conditions hold: 

(1) a1,4@2,43,... and b,, bo, b3,... are sequences of nonnegative real numbers so 
that \oj51 4 + Dojo1 8; = Le 

(2) ay = by . 

(3) a7 + bz < ay + by. 

(4) Qi S Aj, + Qja1 and bj < Bj. + bist for all i 2 2. 

(5) a? > aj_1a;,, and b? > b;-,b;4; for all i > 2. 

(6) If a; = 0, then a@;,; = 0, for all i > 1. 

(7) If b; = 0, then 6;,, = 0, for all i > 1. 

It remains only to show that whenever these seven properties are satisfied, we 
always have 5°;,, ia; — }0;,,ibi 2 1 whenever 5°,., bi < ;;. To accomplish this, 
we fix a value b = bi = aj, and the value B = }7,,, b;. For the pair (b, B), we 
determine the unique sequences {a;: i > 1}, {b;: i > 1} satisfying these seven con- 
ditions which minimize >> ia; — >> ib;. 

The infinite geometric sequence obtained by setting « = b/B and b; = b(1 — 
e)'"! is easily seen to maximize 5* ib; among all sequences {b;: i > 1} satisfying 
b, = b, 35; = B, and b? > b;_1b;,, for all i > 2. The argument to minimize )> ia; 
is somewhat more complicated. We know a2 + bz < a, + b; = 2b so ay < 26 — by = 
b(1 +). Therefore a@;,; < (1 + &)a; for every i > 1. On the other hand, set A = 
1—B and for each j > 1, let s; = A—5*)_, aj. Then aj, <5;. Also aj. < aj + 
Qj42 [Gj + Six = Gj + 8; — Gjx1, SO that aj.) < (a; + 5;)/2. It is then easy to verify 
that 5° ia; is minimized when there is some k > 1 so that a; = b(1 + ¢)'~' for all 
i=1,2,...,k and either: 

Type 1: ay.) = 8, a; = 0 for all i > k +2; or 

Type 2: ays, = (sy + a,)/2 where 54 > ay, Gks2 = Shar = (Se — ay)/2, and a; =0 
for all i >k +3. 

Set a = a;,,/a, and B = ax,o/a,. We verify that >> ia; — 55 ib; > 1 for a Type 2 
sequence {a;: i > 1}. The reader may enjoy the challenge of handling the Type 1 
case —it is somewhat easier. Now for a Type 2 sequence, we know: 


1l+e2a>1, B=a-1, and (1) 


(1+ e)!-* 5 


ae+B= =B 


15 I/e. (2) 
Using the definitions of {a;: i 2 1} and {b;: i > 1}, we find that 


Se ee eA et 28) 


étet(atB+1) 
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The inequality }* ia; — }° ib; > 1 is then equivalent (using (1)) to 


(1+e)*! <1, 2h 
ke “Book 


This in turn is equivalent to 


(_d+e)7!. (3) 


(467 +5e4+2)(1+e)*-! Z 1 


(2k +1)e ae (4) 
Now p = TEPAeya Gat — 3: — 1 so the inequality 0 < B < e& converts to 
k-1 1 
(2e+1)(l+e)s' < B (5) 
and 3 
(267 +2e41)(1+e)*' > x. (6) 


We may assume « < 2/(2k — 1); otherwise k > 1/e+ 5 and 


(462 +5e4+2)(1+e)"! 
(2k + le 


(46? +5e4+2)(14+ ek! 
2+2¢ 


< (2e+1)(14 ce)! 
<1/B. 


< 


We may also assume & > 3/(3k + 1), for if e < 3/(3k + 1), inequality (5) implies 


Bi NES 8 6 11 
1+ —— ——_~ +~>—— +1 2 =>. Z 
(+57) laesie*aesi*? 3 7) 
However, inequality (6) is false for all k 2 1. 
To complete the proof, we observe that the left-hand side of inequality (3) is an 
increasing function of e« when « > 3/(3k +1). It suffices to test the validity of (3) 


at an upper bound for ¢. When k = 1, the trivial bound € < 1 works. For k 2 2, 
use the bound ¢ < 2/(2k — 1). This completes the proof. O 


In a certain sense, the approach taken by Kahn and Saks cannot be improved. 
In fig. 6.2, we show a poset P containing two points x, y satisfying A(y) — A(x) =1 
and Prob|x > y] = #4. 

Other proofs bounding 6(P) away from zero have been given. Khachiyan (1989) 
uses geometric techniques to show 5(P) > 1/e?. Kahn and Linial (1991) provide 
a short and elegant argument using the Brunn—Minkowski theorem to show that 
5(P) > 1/2e. Friedman (1993) also applies geometric techniques to obtain even 
better constants when the poset satisfies certain additional properties. Kahn and 
Saks (1984) conjectured that 6(P) approaches } as the width of P tends to infinity. 
Komlés (1990) provides support for this conjecture by showing that for every 
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Figure 6.2. 


eé > 0, there exists a function f,(”) = o(m) so that if P = (X,P) is a poset with 
|X| =n and at least f,(n) minimal points, then 5(P) > 5 — e. 

As Kahn and Saks (1984) point out, the value of the constant in Theorem 6.3 
could be improved if we could show that there exists a positive absolute constant 
y so that if P is not a chain, then it is always possible to find an ordered pair 
(x,y) with 0 < h(y) — h(x) < 1 — y. However, nobody has yet been able to settle 
whether such a y exists. If it does, then as shown by Saks (1985), it must satisfy 
y < 0.133. Even this value would not be enough to prove 6(P) 2 ; However, it 
is of interest to determine the maximum value of |k(y) — A(x)| which allows one 
to conclude that } < Prob[x > y] < 2 Felsner and Trotter (1993) discuss how to 
modify the Kahn-Saks proof technique to obtain the next result, which is clearly 
best possible. 


Theorem 6.4. Let (x,y) be distinct points in a poset P, and suppose that 0< 
h(y) — A(x) < 3. Then } < Prob[x > y] < 3. 


Felsner and Trotter (1993) obtain a slight improvement in the Kahn—Saks bound 
by considering subposets in which the points are relatively close in average height. 


Theorem 6.5. There exists an absolute constant € > 0 so that if P is a poset which 


is not a chain, then 5(P) > e+ 3. 


In developing this theorem, Felsner and Trotter made a correlation conjecture 
which is of independent interest. Let x, y and z be distinct points in a poset P = 
(X,P). For each i,j € Z, let p(i,j) denote the probabilty that 4; (y) — Az (x) =i 
and h;(z) — h,(y) =/ in a random linear extension L of P. Felsner and Trotter 
then made the following cross product conjecture. 


Conjecture 6.6. Let x <: y <4ina poset P = (X,P). Then for all i,j € N, 
pi, fpG+1,7+1) < pG,j+ Dp+1,/). 


Brightwell, Felsner and Trotter (1995) prove the cross product conjecture in the 
special case i = j = 1. They then prove the following lower bound. 
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Theorem 6.7. /f P is a finite poset which is not a chain then 


5-Vv5 
i0 ° 


8(P) > 


The results of this section have emphasized existence questions — disregarding 
the issue as to how one actually goes about finding an incomparable pair of points 
which are balanced. Brightwell and Winkler (1991) showed that the problem of 
computing the number of linear extensions of a poset is #P-complete. However, 
if one is willing to use randomized algorithms, then a good approximation to the 
volume of a polytope can be efficiently computed, so that the theorems presented 
here can form the basis for a sorting algorithm. 

On the other hand, if we limit our attention to deterministic algorithms, then an 
alternative approach is necessary. Kahn and Kim (1994) use a concept of entropy 
for posets and show the existence of a polynomial-time deterministic algorithm 
for sorting in O(logt) rounds. Their algorithm shows how to efficiently locate 
pairs to use in queries so that, regardless of the responses, the determination of 
the unknown linear extension is made in O(logr) rounds. However, at individual 
rounds, the pairs need not be balanced in the sense that for a given pair (x, y) 
used in the algorithm, Prob(x > y] may be arbitrarily close to zero. 


7. Dimension and posets of bounded degree 


The dimension of a poset P = (X, P) is the least t for which there exists a fam- 
ily R= {L,,L5,...,L,} of linear extensions of P so that P = L,;NL20---O Ly. 
In fact, the dimension of P is the least t for which there exists a family R = 
{L,,L2,...,£,} of linear orders (not necessarily linear extensions of P) on X so 
that whenever x < y in P, there exists at least one / with y < {zi x <z in P} in 
L;. The concept of dimension was introduced by Dushnik and Miller (1941). 

In section 2, we presented Dilworth’s decomposition theorem, but we did not 
give much of an explanation for the role this theorem plays in research on posets — 
other than to motivate the Greene-Kleitman theorem. However, Dilworth’s theo- 
rem plays a major role in dimension theory. For example, Hiraguchi (1951) used it 
to prove that the dimension of a poset never exceeds its width. Hiraguchi (1951) 
also proved that the dimension of a poset P = (X,P) is at most |X|/2, provided 
|X| > 4. On the other hand, it is still unknown whether every poset P with three 
or more points always contains a pair whose removal decreases the dimension at 
most one (see Kelly 1984, Reuter 1989, Kierstead and Trotter 1991). 

Dilworth’s theorem is critical to dimension-theoretic ineqalities appearing in 
Bogart and Trotter (1973), Trotter (1974b, 197Sa,c, 1976a). It also plays a major 
role in the variants of dimension investigated in Bogart and Trotter (1976a,b), 
Kierstead et al. (1987), Kierstead and Trotter (1985, 1989). For additional back- 
ground material on dimension, the reader is encouraged to consult the monograph 
Trotter (1992) and the survey articles Kelly and Trotter (1982) and Trotter (1982). 
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Also, discussions of open problems in dimension theory are given in Trotter (1989, 
1992, 1994). 

In the remainder of this section, we discuss one important problem for which 
there is an interesting partial solution — utilizing the probabilistic method on posets, 
Brightwell’s survey article (Brightwell 1993) highlights the recent work in this 
rapidly growing area of research. 

For integers n > 3, k > 0, define the crown s* as the poset of height 2 having 
n+k minimal elements a), 42,...,@,., and n+ k maximal elements 5,,b),..., Dnek 
with a; < bj_1, b;-2, b;-3,...,bj-n+1 for each i (cyclically). Trotter (1974a) showed 
that dim(S*) = [2(n + k)/(k + 2)]. As noted previously, Hiraguchi (1951) proved 
that the dimension of a poset on m points does not exceed m/2, when m > 4, 
so the crowns {S): n > 3} show that this inequality is best possible. In fact, the 
crown s? is called the standard example of an n-dimensional poset. However, it 
is of interest to investigate conditions which force the dimension of a poset to be 
small in comparison to the number of points. 

We now proceed to study one such condition which surfaced in the investigation 
of crowns. For a point x in a poset P, let deg(x) count the number of points 
comparable (but not equal) to x in P. Then let A(P) denote the maximum value of 
deg(x) for x € P. Rédl and Trotter proved that if A(P) < k, then dim(P) < 2k? +2. 
For each x € P, let U(x) = {y: x < y in P} and let w = max{|U(x)|: x € P}. Then 
u< A(P). 

We now present a strengthening of this result due to Fiiredi and Kahn (1986). 
Lemmas 7.1, 7.2, 7.5, Corollary 7.4 and Theorem 7.6 all come from that paper. 


Lemma 7.1. dim(P) < 2(u + 1)(log |X|) + 1. 


Proof. Let |X| =n and set t = [2(u+1)logn]. Let L,,L2,...,L, be linear orders 
of X chosen at random. Then for each i = 1,2,...,¢ and for each x € P, the prob- 
ability that x < y in L; for all y € U(x) is at least 1/(u + 1). Hence the probability 
that no L; satisfies x < y in L; for all y € U(x) is at most [1 — 1/( +1)! < 1/n?. 
Thus the probability that there exists a pair x,y © X with y € x in X but there is 
no L; with x < y in L; for all y € U(x) is less than 1. Thus dim(P) < t as claimed. 
| 


For integers n, k with 1<k <n, let dim(1,k;n) denote the dimension of 
the poset formed by the 1-element and k-element subsets of {1,2,...,m} or 
dered by inclusion. Then dim(1,k;) is the least ¢ for which there exist 1 lin- 
ear orders L;,L2,...,L, on {1,2,...,2} so that for each (k + 1)-element subset 
Sc {1,2,...,n} and for each x € S, there is at least one L; in which x is the least 
element of S. The following enim follows along the same lines as Lemma 7.1. 


Lemma 7.2. For all integers n, k with 1 <k <n, dim(1,k3n) < k?(1 + log(n/k)). 


The Fiiredi-Kahn argument also depends on the following result, which is known 
as the Lovasz local lemma (Erdés and Lovdsz 1973). Other applications of this 
lemma are given in chapter 33 by Spencer. 
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Lemma 7.3. Let G be a graph on {1,2,...,m)} and let k = A(G) denote the maxi- 
mum degree in G. Suppose A\,A2,...,Am are events in a probability space so that 
for each i =1,2,...,m: 

(1) Prob[A;] < 1/4k, and 

(2) A; is jointly cen of the events {A;: ij is not an edge in G}. 
Then Prob[A;A2--- Am] > 0. 


Corollary 7.4. Let 6 > 500, s = [b/logb], and v = [4.7 log b]. Let # be a hyper- 
graph whose edges are subsets of sizes at most b from a set X. Suppose further 
that no point of X belongs to more than b edges in X. Then there is a parti- 
tion X = X,UX2U-+UX, so that |H 1 X;| < v for every edge H € # and every 
i=1,2,...,5 


Proof. Let X = X,UX,U---UX, be a random partition of X. We denote by 

A(H, i) the event |H m X;| > v. Then let G be the graph whose vertex set consists 

of the pairs (H,i) where H is an edge in # and 1 <i < s. The edges of G are the 

pairs (H,i)(H',i’) for which HM H' # 0. Therefore A(G) < (1+ b(b — 1))s <b’. 
However, since |H| < b, 


Prob[A(H,i)] = Prob{{H 7.X;| > v| 


EQ) 
SOQ)" 
AC) ae) 


rss 
The conclusion then follows from the Lovasz local lemma. O 


< 


We also need the following lemma whose elementary proof we leave as an 
exercise, 


Lemma 7.5. Let a and b be positive integers. Let # be a hypergraph on the vertex 
set Y so that each edge in # has at most a elements and no vertex of Y belongs 
to more than b edges. Then there exists a partition Y = Y, UY2U---UY, with 
r=(a—1)b+1 so that |HY;| <1 for all edges H € # and alli with cicr. 


We are now ready to present the upper bound on dimension established by 
Fliredi and Kahn (1986). 


Theorem 7.6. Let P be any poset with A(P) < k. Then 
dim(P) < 50k(log k)’. 
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Proof. Let n= |X|. When k < 500, the result follows from the R6édl-Trotter 
inequality dim(P) < 2k? +2, so we assume k > 500. Let # be the hypergraph 
whose vertex set is X and whose edges are the up sets U(x) ={ye X:x<y 
in P}. By Corollary 7.4, we obtain a partition X = X, UX,U---UX, where 
s = [(K + 1)/log(k + 1)] so that |U(x) 1. X;| < v = [4.7 log(k + 1)] for every x € X 
and every i = 1,2,...,5. 

Then let #; be the hypergraph obtained by restricting # to X;. Each edge in 
3%; has size at most vu and no point of X; belongs to more than k +1 edges. By 
Lemma 7.5, we obtain a partition X; = X;, UX; U---UX;, where r = (v — 1)(k + 
1) +1 so that |U(x) 1 X;;| < 1 for all x, i, 7. Now we know by Lemma 7.2 that 


d= dim(1,u+lir) < (v+1) (1 + log eae dy. 

v+1 
Let M,, M2,...,My be a family of linear orders on {1,2,...,7} so that for each 
(v + 1)-element subset S Cc {1,2,...,7} and cach y € S, there is some i with y the 
least element of S in M;. For cach ij, let Ni; be an arbitrary linear order on Xj; 
and let Ni denote the dual of Nj;, 1e., uw < v in Ni; <= v <u in Nj. Finally, for 
each i= 1,2,...,s and f/=1,2,..., .d, we define two linear orders L;, and Li on 
X by 


Li = NiM,(1) < Niat,i2) Ss Nii <X- X;, and 
Li = Nimjay < Ninja <0 < Nim < X — Xi 


In both L;; and Lijs the ordering on X — X; is arbitrary. Furthermore, the subscripts 
are interpreted so that M, orders {1,2,...,r} by Mj(1) < Mj(2) <---< Mj(r). 

Now suppose x,y € P with y ¢ x in P. Choose @ so that x € Xjq. Set T = {j: 
U(y)n Xi; # O}. Since |7"| < v, there is some 6 for which a is the least element of 
TU {a} in Mg. If y ¢ Xiq, then x < y in both Lig and Lig. ly € Xia, then x <y 
in exactly one of Lig and Li,. 


This shows that dim(P) < sd < 50k(logk)? as claimed. O 


Until recently it was not known whether there exists an absolute constant c 
so that dim(P) < ck whenever A(P) < k. Note that for each k > 2, the crown 
Siu; Satisfies A(S?,,) =k and dim(S°,,) =k +1. However, Erdés et al. (1991) 
have substantially improved this lower bound by investigating the dimension of 
a height-two random poset. They define the sample space Q(n,p) as consisting 
of all height-two posets containing m minimal points a1,a@2,...,@, and m maximal 
points 6y,b2,...,b,. For a poset P € Qn, p), the probability that a; < b; in P is 
equal to p (which in general is a function of n). Events corresponding to distinct 
pairs of points in P are indeperAient. 

Erdés et al. (1991) develop upper and lower bounds on the expected value of 
the dimension of a random poset for values of p in the range 


(logn)!** 


<p<l—nil’, 
n P 
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However, taking the particular value p = 1/logn, their results imply that there 
exist absolute positive constants 6, and & so that the following inequalities almost 
surely hold for the random poset P: 


A(P) < dn/logn and dim(P) > &x. 


This shows that if we define the function f: N — N by “f(k) is the largest positive 
integer for which there exists a poset P = (X,P) with A(P) < k and dim(P) < 
f(k)”, then there exist absolute positive constants c; and c; so that 


ck(logk) < f(k) < ek(logk)?. 


It is probably a very difficult problem to determine the correct exponent on the 
logarithmic factor in the preceding inequality. Perhaps the answer will come with 
improved bounds on the expected value of the dimension of a random poset in the 
sparse case, i.e., for values of p satisfying p < (logn)!** /n. Other dimension-related 
questions for random posets are posed in Erdés et al. (1991); further applications 
of dimension-theoretic concepts for random posets are given in Brightwell and 
Trotter (1994b). 

Many other interesting dimension-theoretic questions center around families of 
subsets ordered by inclusion. For example, the proof of Theorem 7.6 requires an es- 
timate on the dimension of the poset formed by the 1-element and (log 7)-element 
subsets of an n-element set. Lemma 7.2 gives an upper bound of the form c(log7)*, 
and Kierstead (1995) has just shown a lower bound of the form clog’ n/ log logn. 
So some genuinely new idea is needed to determine more accurate estimates for 
f(k). 

Dushnik (1950) developed upper and lower bounds for dim(1, 437) which result 
in an exact formula when k > 2,/n, and Spencer (1972) gave asymptotic results 
when & is relatively small in comparison to n. Fiiredi et al. (1991) give the following 
asymptotic formula: 


dim(1,2: ”) = Iglgn+ (5 + o(t)) Ig lg lgn. 


Hurlbert et al. (1994) show that dim(2,n —2;n)=n—1, when n 25; and 
Brightwell et al. (1994) show that dim(s, s + k;n) = O(k? log n). 


8. Interval orders and semiorders 


Let # be a collection of closed intervals of R. Define a partial order P on # by 
I, < hin P => x < y in R for every x € Ij, and y € J). Posets obtained from this 
construction are called interval orders. The. following theorem of Fishburn (1970) 
provides a forbidden subposet characterization of interval orders. We leave the 
proof as an exercise. 


Theorem 8.1. A poset P is an interval order <= > P does not contain 2 +2. 
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A poset P is called a semiorder if there exists a function f: X — R so that x < y 
in P <> f(y) > f(x) +1. Evidently, a semiorder is an interval order having a rep- 
resentation in which all intervals have length 1. The following theorem due to Scott 
and Suppes (1958) provides a forbidden subposet characterization of semiorders. 
Again the proof is omitted. 


Theorem 8.2. A poset P is a semiorder <=> P does not contain either 2+2 or 
34+1. 


Rather than present the proofs of these two important, but by now well-known, 
theorems, we choose instead to discuss some recent work in which interval orders 
and semiorders surface in a surprising manner. 

Let # be a class of posets. We say that the on-line dimension of # is at most ¢ if 
there exists a strategy for constructing a realizer L;, L2,...,L, for any poset from 
¥ constructed in an on-line fashion. As was discussed in section 4, the poset and 
the realizer are to be constructed one point at a time. At each step, a new point 
is added to the poset. This point is then inserted into each of the existing linear 
extensions in a manner such that they remain a rcalizer. 

The reader may enjoy showing that if # is the set of posets of dimension at most 
2, then the on-line dimension of # is infinite, i.e., for each t, there is no algorithm 
which will construct on-line realizers of size ¢ for poscts from #. 

The members of family {S‘: k > 0} of 3-dimensional crowns are called 3-crowns. 
The following result is due to Kierstead et al. (1984). 


Theorem 8.3. Let ?,, denote the class of posets of width at most n which do not 
contain any 3-crowns. Then the on-line dimension of P, is at most ((S" — 1)/4)!. 


Proof. Set ¢ = (5” — 1)/4. Use Kierstead’s theorem (4.2) to construct an on-line 
partition of a poset P in ¥ into ¢ chains C),C2,...,C,. For each x € X, let C(x) 
denote the unique @ for which x € Cy. Let M be any of the ¢! linear orders on 
{1,2,...,t}. We construct on-line a linear extension Ly of a poset from #,. When 
the new point x enters P, let S; = {y € P: x <y in P} and S) = {y € P: x || y in 
P and C(x) < C(y) in M}. If S| US = 9, insert x at the top of Ly. If $; US: £9, 
and y is the lowest element of S, U S2 in Ly, insert x immediately under y. 

We now show that the set of all z! linear extensions of P determined by this pro- 
cedure form a realizer of P. To accomplish this, choose an arbitrary incomparable 
pair x || y from P. We show that there is at least one M for which x > y in Ly. In 
fact, we will show that x > y in some Ly so that C(y) is the least clement in M 
and C(x) is the greatest element in M. 

A fence F starting up at x and ending at y is a sequence x = xg,X),X2,-..,%; = 
y for which the only comparapilities between these points are xy <x) > x2 < 
x3; >--- Note that such a fence starts up from xj = x, but can end either up 
or down at x; = y. 

Let Y = {1,2,...,¢} — {C(x), C(y)}. Define a binary relation Q on Y by 
“aQB <=> there exist points u,v and a fence F starting up at x and ending at 
v so that a = C(u), B = C(v),u <y and {u,y} || F in P”. 
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Claim. /f aQB and yQ6, then either aQé or yQB. 


Proof. Choose points u,v and a fence F = {x = xo, x|,X2,...,x; =u} which wit- 
ness aQB. Similarly, choose points w,z and a fence G = {x = uo, u1,...,uj =z} 
which witness yQ6. Suppose that neither aQ6 nor yQB. Then u is less than one 
or more points in G. Choose the least m so that u < u,,. Then m is odd (and posi- 
tive). Similarly, let p be the least integer so that w < x». The p is odd and positive. 
Then the set H = {x,x1,X2,..-,Xp, U1, U2,...,Um} is connected and has Xp and uy, 
as distinct maximal elements. Let K be a minimum-size connected subposet of H 
containing x, and u,,. Then K U {y, u, w} is a 3-crown. The contradiction completes 
the proof of the claim. 0 


From its definition, it is clear that Q is irreflexive, i.e., we never have aQa for 
any a € Y,. By taking 6 = y in the claim, we conclude that either aQy,or BOB. 
Hence aQ6. This shows Q is transitive, and is therefore a strict partial order on 
Y. Furthermore, in view of Theorem 8.1, (Y,Q) is an interval order! 

Choose an interval representation of (Y, Q) in which all end points are distinct. 
Then let M’ be the linear extension of Q determined by the left end points in this 
representation. Form M from M’ by adding C(y) as least element and C(x) as 
greatest element. We claim that x > y in Ly. 

Suppose to the contrary that x <y in Ly. When the latter of these two 
points enters the poset, let x = ug < uy < U2 < +--+ < us <Uss) = y be the se- 
quence of points between x and y in L,,. Note that for all i =0,1,2,...,5, either 
(uj <Ujsy in P) or (u; || ua; and C(u;) < C(ujs1) in M). We call any sequence 
X = Ug, U4, V2,-.-,;UmsUmy) = y a blocking chain if 

(1) v9 < uy <6 <p < Uy. IN Lys 

(2) for / =0,1,...,m, either (vy; < via, In P) or (v; || vj; and C(v;) < C(vj41) in 
M). 

Of all blocking chains, we choose one, say {vg,V},--.,Um+t}, for which m1 is as 
small as possible. For each é = 1,2,...,m, let a = C(u;). The minimality of m 
implies that v; || vj and a >a; in M whenever 0 <i, j <m+1 and |j —i| 22. It 
follows that m is even and that for even i, vi < vj, in X. For odd i, y; || visi and 
ay < A+ in M. Thus Cly) < Gy) < Am < An-3 < An-2 So < hs < HH < 03 < 
a4 <a <a < C(x) in M. 

Next we observe that the set F = {vo, v1} is a fence starting up at x and ending 
at v1. Also {y, um} || F. It follows that a,, <a, in Q. Choose the largest integer 
i so that am <a; in Q. Suppose first that i is odd. Then @; < qs; in M, which 
implies that the left end point of the interval corresponding to a; is less than the 
left end of the interval corresponding to a;,;. However, the inequality a, < a; in 
Q implies that the interval for a, lies entirely to the left of the interval for aj. 
This in turn implies @,, < a; in Q, contradicting our choice of i. 

Now suppose that i is even. Choose points u€ Ca,, z€Co, and a fence 
F = {x = 20, 21, 22,---.%, = Z} Starting up at x and ending at z so that u < y and 
{u,y} || F. Let u’ = MAX{u, v,»,} and let G be a minimum-size connected subposet 
of F U {v;, u;.,} so that G contains both x and u;,,. Then G is a fence starting up 
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at x and ending at v,,;. Furthermore, {y,u’} || G which implies a, < a;,; in Q. 
This is also a contradiction. O 


It is mildly irritating that we do not know whether it is necessary to exclude all 
3-crowns from the posets in order to have finite on-line dimension. It is certainly 
necessary to exclude $3, but perhaps this is enough. 

We next discuss an extremal problem for interval orders with a surprising con- 
nection to hamiltonian circuit problems. It is well known that there exist posets 
whose cover graphs have large chromatic number [see Kriz and Neéetfil (1991), 
for example]. It is easy to see that such graphs exist as cover graphs of interval or- 
ders. In connection with this topic, Felsner and Trotter (1995) made the following 
conjecture. 


Conjecture 8.4. Let n 21, and let ¢=2". Then there exists a permutation 
Ai,..-,A, of the subsets of {1,...,2} so that: 

(1) A; = 9, and 

(2) For each i =1,2,...,¢—1, either Aj C Aj,: or Ajs, C Aj. Furthermore, 
|A;4A;,,| = 1. 

(3) For each i,j =1,2,...,¢, if Aj C Aj andi > j, theni=j+1. 


Here is a re-formulation of the preceding conjecture as an extremal problem. 


Conjecture 8.5. For each n > 1, let f(n) = s be the largest integer for which there 
exists a sequence B,, B>,...,B, of distinct subsets of {1,2,...,} so that: 

(1) B, = 0, and 

(2) for each i = 1,2,...,s —1, Bj, ¢ B;, and 

(3) for each i = 1,2,...,8 — 2, Bixg Z B; U Bis. 
Then f(n) = 2-1! + [(2+1)/2]. 


Trotter and Felsner show that f(m) < 2"-' + |(m+1)/2] and that equality holds 
if and only if Conjecture 8.4 is valid. These conjectures are related to the following 
(surprisingly difficult) problem. 


Conjecture 8.6. Let G denote the comparabilty graph of the poset formed by the 
k-element and (k + 1)-element subsets of a (2k + 1)-element set partially ordered 
by inclusion. Then G has a hamiltonian cycle. 


Although Conjecture 8.6 is known to be true for small values of k, most of 
the results thus far are negative, i., a hamiltonian cycle cannot be formed by 
combining certain types of matchings [see Duffus et al. (1988) and Kierstead and 
Trotter (1988), for example]. Ca the other hand, it is shown in Felsner and Trotter 
(1995) that there exists a cycle whose size is at least one-fourth of the total number 
of vertices. This fraction has subsequently been raised by C. Savage and P. Winkler 
(pers. comm). 

We next discuss an elementary extremal problem for posets. For integers n,k 
with 0 < k < (5), let O(n,k) denote the class of all posets having n points and 
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k comparable pairs. For each poset P € Q(n,k), let e(P) count the number of 
linear extensions of P. Then set e(n,k) = max{e(P): P € Q(n,k)}. Fishburn and 
Trotter (1992) then show that the extremal posets are semiorders. 


Theorem 8.7. Every poset P € O(n, k) with e(P) = e(n, k) is a semiorder. 


Proof. Let P be such a poset. We first show that P does not contain 2 + 2. Suppose 
to the contrary that the chains u < x and v < y are incomparable. Of all such pairs 
of chains, we choose two for which |U(x)| + |U(y)| is minimum. We may therefore 
assume that U(x) C U(y). Let P’ be the poset obtained by replacing the relations 
z<y by z <x for all z € D(y) — D(x). Then P’ € Q(n,k). We now show that 
e(P’) > e(P). 

Exchanging x and y maps €(P) — €(P’) to €(P’) — €(P). Although the map is 
1-1, it is not onto since any linear extension in which y < u < v <x is not in the 
image of the map. The contradiction shows P is an interval order. “ 

If P contains a 3-element chain x < y <z with all three points incompara- 
ble to w, form P” from P by replacing the relations y < z by w < z for all z € 
U(y) — U(w). As before, e(P”) > e(P). The contradiction shows P is a semiorder. 

O 


At first glance, Theorem 8.7 seems to be very helpful in determining e(n, k), 
and some progress is made in Fishburn and Trotter (1992). However, the general 
problem remains open. 

Recently, P. Winkler (pers. comm.) has proposed another extremal problem in- 
volving semiorders. For a poset P = (X, P) and a point x € X, let deg(x) = |{y € 
X:y<xinP, ory >x in P}|. Then define the flexibility of P, denoted flex(P), 
by 


flex(P) = 5 \(deg(x))’. 


x€X 


Now let n and k be fixed integers with 0 < k < (5). It is an easy exercise to show 
that among all poscts containing » points and k comparable pairs, any poset with 
maximum flexibility is a semiorder. 


Problem 8.8. For fixed n and k, find all semiorders with n points and k compa- 
rable pairs for which the flexibility is maximum. 


Problem 8.9. For a poset P = (X, P) with n points and k comparable pairs, let i 
be an integer with 0 <i < & and let a; denote the number of permutations (linear 
orders, not necessarily linear extensions) of the ground set X so that exactly i of 
the k comparable pairs are in the same order as in P. Then S>i.,@; =n!, and 
a; = ax_;, for each i = 0,1,...,4. Is the sequence ao, a),...,a, unimodal? 


Winkler noted that the sequence need not be log-concave, so the mixed volumes 
approach used by Stanley (see the discussion in section 6) will not apply. 

The next result is a recent theorem of Brightwell (1989) establishing the 4-3 
conjecture for semiorders. 
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Theorem 8.10. Let P = (X,P) be a semiorder which is not a chain. Then X con- 
tains a pair x,y of incomparable points with + < Prob[x < y] < 2. 


Proof. Suppose the result is false and choose a counterexample with |X| mini- 
mum. Define a linear extension L by x < y <=> Probjx < y] > 3, Let |X| =n and 
label the points in X so that xj < x2 <+-- <x, in L. 

Since P is a semiorder, there exists a function f: X — R so that x < y in P <=> 
f(y) > f(x) + 1. We now show that f(x,) < f(x2) < +++ < f(x). Suppose to the 
contrary that 1 <i <j <n, but that f(x,) < f(x,;). Then x; ¢ x; in P. However, 
x; <x;in L,sox, ¢ x; in P. Thus x; || x; in P. However, the inequality f(x;) < (xj) 
implies that « > x; in P whenever wu > x; in P. Dually v < x; in P whenever uv < x, 
in P. 

It follows that every L € €(P) with x; < x; in L can be transformed into a linear 
extension L’ with x; < x, in L’ just by interchanging these two points. The mapping 
is 1-1 which shows Prob(x; < x;] 2 4. Thus Prob[x; < x;] > 3 and x; < x; in L. The 
contradiction shows f(x,) < f(x2) <---< f(x») as claimed. 


We now show that x; || x:4: for all i= 1,2,...,2—1. Suppose to the contrary 
that x; < x;,, in P. Then every point of X’ = {x), x2,...,x;} is less than every point 
of X” = {xi41,Xi01,-.-,4tn}. At least one of P’ and P” is not a chain, and we can 


restrict our attention to that subposet to locate x and y. The contradiction shows 
X; {| Xia. for §=1,2,...,n — 1 as claimed. 

We say x; separates x; and x;,, from above if x; :> x; and x; || x;,;. Dually we 
say X; Separates x; and x;,, from below if x; <: xj4, and x; || xj. We say x; separates 
x; and x;,, if it either separates them from above or it separates them from below. 

If x; separates x; and x;,, from above, then x, < x; in P fork = 1,2,...,i. This 
implies that x, docs not separate x, and x,,; when 1 < k < i. Dually, if x; separates 
x; and x;,, from below, then x; does not separate x, and x,,, when i < k <n. So 
each x; scparates at most one pair from below and at most one pair from above. 

However, x; and x2 cannot separate pairs from above and x,_; and x, cannot 
separate any pair from below. If follows that there are at most 2(n — 4) +4 = 2n -4 
ordered pairs (i,j) so that x; separates x; and xj4;. Hence there is at Ieast one (in 
fact at least two) valucs of i with 1 < i < n for which there is at most one j so that 
x; separates x; and xj41. 

For such a value of i, partition €(P) into three sets by letting €, = {L € €(P): 
Xx; <Xi4; in L, and no element separating x; and x;,, is between them in L}; 
io) = {L € E(P): Xj; <Xj4; and L ¢ é€,}, and = {L € €(P): Xin) <X; in L}. Now 
let ¢ = |€(P)|. Then |@,| + |€2| > 27/3. 

If |&2| > /3, let j be the unique integer so that x; separates x; and x,,;. Then x; 
is between x; and x;,, in every L € €3. If j > i + 1, this implies Prob[x; < x;41] 2 4, 
and if j <i, it implies Prob[x;y< x;] > 4. Both of these implications are false, so 
we know || < ¢/3. Thus |@,| > 1/3. 

Now let L € €,, Form L' from L by interchanging x; and x;,,. This interchange 
is possible since any point between x; and x;,; in L is incomparable with both. 
This procedure determines a 1-1 map from €, to @3. However, no such map exists 
because |€3| < ¢/3. The contradiction completes the proof. © 
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The connection between semiorders and the }— ? conjecture is even more com- 
plex than suggested by the preceding result. Consider the infinite poset P = (X,P), 
where X = {x;: i € Z}. Furthermore, we define x; < x; in P if and only if i < j —1 
in Z. Then P is a width-two semiorder. Also observe that any incomparable pair 
in P is of the form (x;, x;,,), for some i € Z. For a positive integer n, let P,, denote 
the subposet of P determined by the points whose subscripts in absolute value are 
at most 7. Then it is an easy exercise to show that 


lim Prob[x9 > x;] = (S - V5)/10. 


Note that (5 — V5)/10 ~ 0.2764 < 4. So the 1-2 conjecture is false for infinite 
width-two semiorders, even though it is true for any finite poset which is either a 
width-two poset or is a semiorder! Also note that this example shows that the 
inequality in Theorem 6.7 is best possible when considering infinite posets of 
bounded width. Further results on infinite posets and balanced pairs are given 
in the recent papers Brightwell (1988, 1993). 

Rabinovitch (1978) showed that the dimension of a semiorder is at most 3, but 
that an interval order may have arbitrarily large dimension. Fiiredi et al. (1991) 
showed that if P = (X,P) is an interval order of height n, then 


dim(P) < Iglgn + G + o(1)) Ig Ig lgn. 


This inequality is best possible. For an integer n > 2, let 1, denote the canonical 
interval order consisting of all intervals with integer end points from {1,2,...,n}. 
Then Fiiredi et al. (1991) showed that 


dim(I,) = Iglgn+ G +0(1)) Ig Ig ign. 


This formula is closely related to the asymptotic formula given in section 7 for 
the dimension of the poset formed by the 1-element and 2-element subsets of 
{1,2,...,m}. 


9. Degrees of freedom 


Given a family ¥ of sets, a poset P is called an ¥-inclusion order if there exists 
a mapping which assigns to each x € X a set S(x) € # so that x <y in P <> 
S(x) C S(y). As an example, if ¥ is the collection of all closed intervals of R, then 
the ¥-inclusion orders are exactly the posets with dimension at most 2. 

Fishburn and Trotter (1990) studied the class of angle orders. These are the 
posets which arise when & is the set of angular regions in the Euclidean plane, 
ie., convex regions bounded by two rays emanating from a common point. They 
proved that every interval order is an angle order, as is every poset of dimension 
at most four. Both results admit elementary proofs; but while the first result may 
be mildly surprising, the second is certainly not. In a certain sense, to specify an 
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angle requires four coordinates—two to locate the corner point and one for each 
ray to specify the angle from [0,27r) at which it leaves the corner point. 

Fishburn and Trotter conjectured that not all 5-dimensional posets are angle or- 
ders, but were only able to prove the existence of a 7-dimensiona! poset which is not 
an angle order. R. Jamison (pers. comm.) settled this conjecture in the affirmative 
with an intricate ad hoc argument. However, Alon and Scheinerman (1988) have 
produced a much more general result using a powerful theorem of Warren (1968). 
We now outline their approach. 

For x ER, let sgn(x) = + if x > 0 and sgn(x) = — if x <0. For a vector x= 
(x1,X2,..-,%r), sgn(x) denotes the vector (sgn(x,), sgn(x2),...,sgn(x,)). The vector 
sgn(x) is called the sign pattern of x and t is the length of the pattern. We say ¥ 
has at most k degrees of freedom, and write deg(¥) < k, if the following conditions 
are satisfied: 

(1) there exists a mapping F which assigns to each Sc # a k-tuple F(S) = 
(S(1), S(2),...,5(k)) from R*; 

(2) there exists a finite set P,, P2,...,.P; of polynomials in 2k variables x;,x2,..., 
Xo; and 

(3) there exists a set J of sign patterns of length ¢ so that for every pair S,T of 
sets from ¥, S Cc T <> sgn(y(S,7)) € J where y(S, 7) is the vector of length ¢ 
whose jth coordinate is given by P;(S(1), 5(2),...,S(k), T(1), T(2),-.., T(k)). 

To illustrate this definition, let # denote the set of closed disks in R?. With 
each set (disk) S, we take (S(1),S(2)) as the coordinates of the center of S and 
S(3) as the radius. Take P, = x9 — x3 and Pz = (x6 — x3)* — (%5 — x2)? — (x4 — x1). 
Then take J = {(+,+)}. Let § and T be disks. Then S C T <=> the distance from 
the center of S$ to the center of T plus the radius of S is less than or equal to 
T <> sgn(y(S,7)) € J. This shows deg(#) < 3, i.e., ¥ has at most three degrees 
of freedom. As a second example, the set sf of angular regions in R? has at most 
four degrees of freedom. Here is Warren’s theorem (Warren 1968). 


Theorem 9.1. Let P,, P2,...,P, be polynomials in m variables and let d denote the 
maximum degree among these polynomials. Then there are at most (4edt/m)” sign 
patierns of the form sgn(P (x), P2(x),...,Pr(x)) where x = (x1,X2,...,%m) ranges 
over R’”, 


Let P(n,k) denote the number of labelled posets on 1 points having dimension 
at most k. Clearly P(n,k) < (n!)* < n*. Subsequent arguments will require the 
following lower bound on P(n, k) due to Alon and Scheinerman (1988). We leave 
the proof as an exercise. 


Theorem 9.2. The number a k) of labelled posets on n points having dimension 
at most k satisfies: 


P(n,k) 2 (n/ log ny"*-2?n/ log 


The preceding two results combine easily to prove the following striking result 
of Alon and Scheinerman (1988). 
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Theorem 9.3. Let ¥ be any family of sets having at most k degrees of freedom. 
Then there exists a poset P with dim(P) < k +1 which is not an ¥-order. 


Proof. Suppose deg(#) < k is witnessed by the set P;, P2,...,P, of polynomials 
and the set J of test patterns. Let d denote the maximum degree of these poly- 
nomials. We show that when n is sufficiently large, there is a labelled poset on n 
points having dimension at most k +1 which is not an ¥-order. 

When P is a labelled posct on the ground set X = {1,2,...,n} and P is 
an #-order, we designate the scts in -* corresponding to the points in X by 
(S,, S2,...,S,). Bach S; is associated with a vector x, = (x;),x;2,...,X;,). For each 
ordered pair (/,j) with 1 <i, f <n and ij, we knowi<j in PeSS,c 
S; <=> sgn(y(S;,5;)) € J. Recall that the ath coordinate of the vector y(S;, S;) is 
Po(xit, X2, very Miky XjlyXj2s--- )X jk): 

Concatenate in lexicographic order the n(m — 1) vectors y(S;,5;) into a single 
vector y of length n(n — 1)t. Then sgn(y) is the sign pattern of a vector whose 
entries are determined by a family of n(m — 1) polynomials in nk variables x;g 
where 1 <ignand1<B<k. Py 

If (7\,72,...,7,) is another n-tuple of sets from ¥ which yields the same 
sign pattern sgn(y) as (S;,52,...,5,,), then these two n-tuples correspond to the 
same labelled poset P. By Warren’s theorem, we conclude that there are at most 
(4edn(n — 1)t/nk)"* possible sign patterns. However, this number is clearly less 
than P(n,k +1) when n is sufficiently large. O 


There are a number of perplexing open problems involving the representation of 
posets as a family of sets ordered by inclusion. Here are two of the most appealing. 
Given a point x in d-dimensional Euclidean space and a positive number r, let 
By(x,r) denote the ball of radius r centered at x, i.e., the set of points at distance 
at most r from x. It is customary to call B,(x,7) a d-dimensional sphere. A poset 
P = (X, P) is called a d-dimensional sphere order if for each x € X, there exists a 
d-dimensional sphcre B, so that x < y in P if and only if B, C By, for all x,y € X. 


Problem 9.4. If P is a finite poset, does there always exist a positive integer d so 
that P is a d-dimensional sphere order? 


For historical reasons, a 2-dimensional sphere order is called a circle order, 
although it might be better to call it a disk order. Fishburn (1988) proved that 
every interval order is a circle order. Also, it is easy to see that every 2-dimensional 
poset is a circle order; in fact, we may require that the centers of the circles used 
in the representation be collinear. On the other hand, by Theorem 9.3, there exists 
a 4-dimensional poset which is not a circle order. 


Problem 9.5. If P is a finite poset and dim(P) < 3, is P a circle order? 


Problem 9.5 is intriguing because Scheinerman and Weirman (1989) showed that 
the countably infinite 3-dimensional poset Z* is not a circle order. On the other 
hand, it is a relatively easy exercise to show that if &, is the family of all regular 
n-gons in the plane (with bottom side horizontal) and P is any finite poset with 
dim(P) < 3, then P is a ¥,-inclusion order, for all m > 3. 
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10. Dimension and planarity 


A poset P is said to be planar if it has a planar Hasse diagram. The poset shown 
in fig. 10.1a is nonplanar, but the posets in figs. 10.1b and 10.ic are planar. Note 
that the diagram for the last example can be redrawn without edge crossings. 

As is well known, a planar poset P having a greatest and least element has 
dimension at most 2. Kelly and Rival (1975) provide a forbidden subposet charac- 
terization of nonplanar lattices by providing a minimum list ¥ of lattices so that 
a lattice P is planar if and only if P contains a lattice from ¥ as a subposet. One 
lattice from & is shown in fig. 10.1a. The lengthy argument for this theorem must 
be cleverly organized just so it can be written down on a finite number of pages. 
There are several other theorems in dimension theory for posets which exhibit 
these same characteristics: Kelly’s (1977) determination of all 3-irreducible posets, 
Trotter's (1981) determination of all 3-interval irreducible posets of height 2, and 
Kimble’s (1973) proof that if » > 4 and |X| <2n+1, then dim(P) <7 unless P 
contains the standard n-dimensional poset S2. In fact, Gallai’s (1967) forbidden 
subgraph characterization of comparability graphs belongs in this same grouping -— 
especially in view of its value in obtaining a list of all 3-irreducible posets (see 
Trotter 1992, Trotter and Moore 1976). 

Planar posets can have dimension exceeding 2: the planar posets in figs. 10.1b 
and 10.1c have dimensions 3 and 4 respectively. Trotter and Moore (1977) proved 
that a planar poset having either a greatest or least element has dimension at most 
3. Kelly (1981) then constructed planar posets of arbitrary dimension by the device 
of embedding Ss, in a planar poset. Kelly’s construction is illustrated in fig. 10.2. 

Three interesting problems gemain. Do there exist irreducible planar posets of 
arbitrarily large dimension? Provide a characterization of planar posets in terms 
of forbidden subdiagrams. Develop a fast algorithm which will produce a planar 
drawing of the Hasse diagram of a poset if such a drawing exists. 

Recently, Schnyder (1989) produced a striking theorem relating dimension 
and planarity in a different manner. Let G = (V,£) be an ordinary undirected 
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Figure 10.2. 


graph. We associate with G a poset P = P(G) of height 2. In P, MIN(P) = V and 
MAX(P) = E. Also vertex x is less than edge e in P <=> x is an end point of e. 
We call P the incidence poset of G. Here is Schnyder’s theorem. 


Theorem 10.1. Let G be a graph. Then G is planar <=> the dimension of its 
incidence poset is at most 3. 


Proof. Let P = P(G) be the incidence poset of G. Suppose first that dim(P) < 3 
We show that G is planar. Suppose to the contrary that G is nonplanar. [The 
argument we give for this part is patterned after a proof of Babai and Duffus 
(1981).] 

Choose an embedding of P in R? which associates with each y € V UE a vector 
y = (v1, ¥2,¥3) € R® so that usu in P = > uy; <u; in R for i=1,2,3. For each 
y € X UE, let m(y) be the orthogonal projection of y on the plane x, + x2 + x3 = 0 
in R?. Without loss of generality, all points in X U E project to distinct points on 
the plane x; + x2 + x; = 0, and these points are in general position. 

For each u € X and each e € E containing u as an end point, join m(u) and 
a(e) with a straight line segment. Since G is nonplanar, there exist distinct vertices 
u,v € V and distinct edges e, f € E so that u is an end point of e but not of f, uv 
is an end point of f but not of e, and the line segment 7(x)7(e) crosses the line 
segment m(v)7(f) at a point p interior to both. Let z be the point on the line 
segment ue in R? so that (z) = p. Also let w be the point on the line segment vf 
in R? so that a(w) = p. Then either z < w in R? or w <z in R°. However, z < W 
implies u < z< w <f, which is false since u is not an end point of f. Similarly 
w <z implies v < e which is also false. The contradiction shows that G is planar. 


470 WT. Trouer 


Figure 10.3. 


Now supposc that G is planar. We show that P has dimension at most 3. Without 
loss of generality, we assume that G is maximal planar. Choose a planar diagram 
of G using straight line segments for the edges. This diagram is a triangulation T 
of the plane. Each interior region is a triangle, and T has three exterior vertices 
which we label in clockwise order 2¢), i¢2, and w3. 

Now consider a function f which assigns to each angle of each interior triangle 
of T a color selected from {1,2,3}. The function f is called a normal coloring of 
i 

(1) all angles incident with exterior vertex u; are mapped by f to color i for 
PS 122.3% 

(2) at each interior vertex u of T, there is an angle mapped by f to color i for 
i=1,2,3; 

(3) at each interior vertex u of T, all angles mapped by f to color ¢ are consec- 
utive for i = 1,2,3; 

(4) at each interior vertex u of T the block of angles mapped by f to color 2 
appears immediately after the block of angles mapped by f to color 1; and 

(5) for each elementary triangle of T. f assigns the three angles to colors 1, 2, 
and 3 in clockwise order. 

We illustrate this definition in fig. 10.3 with a normal coloring of a triangulation. 

The following claim yields to a straightforward inductive argument and its proof 
is left as an exercise. 


Claim 1. Every planar iriangglation has a normal coloring. 


Let C be a cycle in a planar triangulation 7 which has been colored normally. 
A vertex x belonging to C is called a Type i vertex on C if all angles incident with 
x and interior to C are colored ?. When C is exterior triangle, u; is a Type i vertex 
on C. 
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Claim 2. [If C is a cycle in T, then C contains a Type i vertex, for each i = 1,2,3. 


Proof. Suppose the claim is false. Choose a counterexample C containing the 
minimum number of elementary triangles. Clearly C is not the boundary of an 
clementary triangle. Now suppose C does not have a Type 1 vertex. 

Suppose that C has two nonconsecutive vertices x and y which are adjacent via 
an edge e = xy interior to C. Then the region bounded by C can be partitioned 
into regions bounded by cycles C’ and C” having e as a common edge. Now C’ 
and C” both have a Type 1 vertex. If x is a Type 1 vertex for C’ and for C”, then 
x is a Type 1 vertex for C. An analogous statement holds for y. We conclude that 
one of x and y is a Type 1 vertex for C’ and the other is a Type 1 vertex for 
C". Consideration of the two elementary triangles sharing the edge shows this is 
impossible. 

Now let C = {x,X2,...,Xs} and let x; and x;,,; be any two consecutive vertices 
of C and let z; be the vertex so that x;x;,,z; 1s an elementary triangle interior to 
C. Let C; be the cycle obtained by deleting the edge x;x;,; and adding the edges 
x;Z; and z,x;,,;. Then C; has a Type 1 vertex because it contains fewer elementary 
triangles than C. Clearly z; cannot be a Type 1 vertex on C; because z; is an 
interior vertex of 7. 

It follows that one of x; and x;,, is a Type 1 vertex on C;. If x; is Type 1 on C,, 
then the angle of triangle x;x;,;z; incident with x; must be colored 3; else x; is Type 
1 on C. Thus the angle of x;x;,12; incident with x;,, is colored 1. This implies that 
X;., is not Type 1 for C;. Dually, if x;., is Type 1 for C;, then the angle of x;x;12; 
incident with x;,; is colored 2, the angle of x;x;,.,z; incident with x; is colored 1, 
and x; is not Type | for C;. 

If some vertex x;,; is Type 1 for both C; and C;,,, then x; is Type 1 for C. So 
either x; is Type | for C; fori = 1,2,...,s, or x;41 is Type 1 for C; for? = 1,2,...,5 
In the first case, there is no Type 2 vertex on C); in the second, there is no Type 
3 vertex on C2. The contradiction compictes the proof. O 


Claim 3. Let P; be the binary relation on the set V of vertices of G defined by 
xPiy <=> there exists an elementary triangle T having x and y as vertices in which 
the angle incident at y is colored i. Then the transitive closure Q; = P; is a partial 
order on X. 


Proof. It suffices to show Q; has no directed cycles. This follows from Claim 2 
since a directed cycle in Q; could not have cither a Type i+ 1 or a Type i+2 
vertex. O 


For each i = 1,2,3, let M; be a linear extension of Q;. Then let L; be any linear 
extension of P so that: 

(1) The restriction of L; to V is M;. 

(2) For each e € E, the M,-largest element of V which is less than e in M; is less 
than e in P. 

Alternatively, L; is obtained from M, by inserting the elements of E as low 
as possible. To complete the proof, it suffices to show that P = L,; 1 L2M L3. To 
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Figure 10.4. 


accomplish this, it is enough to show that for each edge e = xy and each vertex z 
not an end point of e, there exists some i so that z > e in L;. This means that we 
must find some M; in which z is above both x and y in M;. In fact we show that 
there is some i for which z > x and z > y in Q;. 

If z is an exterior vertex, say z = u;, then z is the largest element in Q;. Now 
suppose z is an interior vertex. Then for each i = 1,2, 3, there is a path S;(z) from 
z to the ith exterior vertex u;. The starting point of S;(z) is v9 = z. If vj has been 
determined, and vu; is an interior vertex, then v;,, is the unique vertex so that the 
angles at v; on either side of the edge u,v;,, are colored i+ 1 and i +2. 

The paths S;(z), S5(z), and S3(z) are pairwise disjoint and partition T into three 
regions R,,R2, and R3 as shown in fig. 10.5. 

If the edge e = xy joins two vertices in the region R;, then z is greater than both 
x and y in Q;. This completes the proof. 


It is well known that the problem of deciding whether a poset P satisfies 
dim(P) < 2 belongs to the class P of problems admitting a polynomial-time so- 
lution. For fixed t 23, Yannakakis (1982) proved that the problem of decid- 
ing whether a poset P satisfies dim(P) <7 is NP-complete. For these reasons, 
Schnyder’s theorem (10.3) is all the more striking since it equates a well-known 
polynomial-time problem, planarity testing, with an apparently NP-complete prob- 
lem, deciding whether a particular poset has dimension at most 3. However, the 
poset being tested has a special form. The maximal elements all have degree two 
in the comparability graph. Also, it is not known whether it is NP-complete to 
answer whether the dimension of a height-two poset is at most 3. The answer is 
“yes” for dimension 4 or more. 

Schnyder’s theorem has bee applied to find efficient algorithms for laying out 
a planar graph on a grid (see Kant 1992, Schnyder 1990, Schnyder and Trotter 
1995), Recently, Brightwell and Trotter (1994a) have extended Schnyder’s theorem 
to arbitrary planar maps. 


Theorem 10.2. Let G be a planar multigraph and let D be a drawing of G in 
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the plane so that no edges cross. Then let P(D) be the poset consisting of the 
vertices, edges and faces of the drawing D partially ordered by inclusion. Then 
dim(P(D)) < 4. 


The proof of Theorem 10.4 depends on the development of special graph- 
theoretic techniques applied to ordinary planar graphs satisfying a property some- 
what weaker than 3-connectedness. The argument is inductive and required 
Brightwell and Trotter (1993) to first establish the following theorem. 


Theorem 10.3. Let M be a convex polytope in R’, and let Py denote the poset 
consisting of the vertices, edges and faces of M partially ordered by inclusion. Then 
dim(Py,) = 4. 


In fact, the proof of Theorem 10.3 yields the even stronger conclusion that the 
subposet of Py determined by the vertices and the faces of M is 4-irreducible. 
Theorem 10.3 cannot be extended to yield a bound of the dimension of the face 
lattice of a convex polytope in R” for n > 4. This is due to the existence of cyclic 
polytopes (see the discussion in Brightwell and Trotter (1993)]. 

However, Theorem 10.3 can be extended to surfaces of higher genus since it 
iS easy 10 prove by induction on n the existence of a function f(m) so that the 
dimension of the poset of vertices, edges and faces of a multigraph drawn without 
edge crossings on a surface of genus n has order dimension at most f(n). The 
only difficulty encountered in establishing the existence of f(7) is the case n = 0. 
Here we know of no elementary proof of any finite bound, although of course 
Theorem 10.3 yields an upper bound of 4 in this case. 
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11. Regressions and monotone chains 


If P = (X, P) is a poset, we calla map f: X —+ X a regression if f(x) < x for every 
x € X. When C = {x) < x2 <--+ <x,} is a k-element chain in P, we say that a 
regression f is monotone on C if f(x1) < f(x2) <--- < f(x,). By convention, a 
regression is monotonic on any 1-element chain. 

For k > 1, there are several interesting conditions on a poset which guarantee 
that every regression is monotonic on some k-element chain. Here is an important 
example due to Rado (1971). 


Theorem 11.1. For every k > 1, there exists an integer no = no(k) so that ifn > no 
and f is a regression on the subset lattice 2", then f is monotonic on some k-element 
chain. 


An alternative proof of Theorem 11.1 has been provided by Harzheim (1982) and 
this argument extends to a wider class of posets. However, neither argument gives 
much information about how large no must be in terms of k. This is not surprising 
in view of the arguments’ dependence on Ramsey-theoretic tools emphasizing 
existence. 

By way of contrast, we present in this section a sharp result for posets of bounded 
width. The result is due to Peck et al. (1984). 


Theorem 11.2. Lez w and k be positive integers and let P = (X,P) be a poset of 
width at most w. If |X| > (w+1)*°! then every regression is monotonic on some 
k-element chain. 


Proof. We proceed by induction on k, noting that the case k = 1 is trivial. Now 
assume & > 2 and that the theorem holds for smaller values of k. Let P = (X, P) 
be any poset of width at most w and let f be any regression on P. We show f is 
monotonic on some k-element chain. 

For each x € X, let H(x) be the largest 1 for which there is a t-element chain 
Xy <X2 <+++ <x, = x on which f is monotonic. Without loss of generality H(x) < 
k —1 forallx € X. 

Then let Y = {x € X: H(x) <k—-1},F = {x € X: H(x) =k - 1, f(x) =x} and 
M={xe€X: H(x)=k—1,f(x) x}. Evidently, X = Y UF UM is a partition. 

Now suppose that F is not an antichain. Choose x,x’€ F with x < x’ in P. 
Then choose a (k — 1)-element chain x; < x2 < +--+ < x,_; =x on which f is mono- 
tonic. Then adding x’ to this chain yields a k-element chain on which f is mono- 
tonic since f(x,) < f(x2) <--+ < f(xy_1) = f(x) = x < x’! = f(x’). The contradic- 
tion shows that F is an antichajn and thus |F| < w. 

Next suppose that x € M and that H(f(x)) =k —1. Then we may choose a 
(k — t)-element chain x; < x2 <--- < x,_; = f(x) on which f is monotonic. Since 
f(x) <x, we may add x to this chain to obtain a k-element chain on which f is 
monotonic. The contradiction shows H(f(x)) < k —2 for every x € M,ie., k 23 
and f(M) c Y. 
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Now let y € Y. It is easy to see that f(y) also belongs to y. Thus the restriction 
of f to Y is a regression. Since this restriction is not monotonic on any chain of 
k —1 points, it follows from the inductive hypothesis that |Y| < (w + 1)*-?. 

Since |X| > (w +1)!” |Y| < (w+ 1)*-? and |F| < w, we conclude that 


|[M| = |X| -|Y|-|F| 
> (wel t—(w4 dk? —w 
= w({(w + 1)? = 1] 


2 wlY] 


Since the width of P is at most w, it follows that there is some yy € Y for which 
the inverse image f° '(yy) is not an antichain. We may then choose distinct points 
x, x’ € M for which f(x) = f(x’) = yo and x < x’ in P. As before, we choose a 
(k — 1)-element chain x; < x2 < +++ < x,_,; =x on which f is monotonic and add 
x' to form the desired k-element chain. 


The reader may enjoy the challenge of showing that the inequality |X| > (w+ 
1)* ! in Theorem 11.2 is best possible. The basic idea is to fix w and then construct 
a poset P, = (X,, P,) and a regression f on P, by induction on &. The poset P, 
is a w-clement antichain, and P, is constructed by placing w disjoint chains, each 
containing 1+(w+1)* ? points, on top of Py_,. We refer the reader to Peck et 
al. (1984) for further details. 

There appears to be some intrinsic connection between regressions and arith- 
metic progressions. Following Trotter and Winkler (1987), we define an arith- 
metic progression in a poset P = (X,P) as a chain x; <x. < +--+ <x; for which 
there is a constant d so that there are exactly d points in each of the intervals 
{y EX:x) Sy <x} fori =1,2,...,¢— 1. The following result is due to Trotter 
and Winkler (1987). 


Theorem 11.3. Let k and w be positive integers and let ¢ > 0. Then there exists 
a number ny = no(k, w, €) so that if P = (X,P) is a poset of width at most w and 
|X| 2 no, then for every subset S CX with |S| > e|X|, there is a k-element chain 


X1 <X_<+++ <x, contained in S which is also a k-term arithmetic progression in 
P. 


The proof of Theorem 11.3 proceeds by induction on w with the case w = 1 being 
a restatement of Szemerédi’s (1975) celebrated theorem on arithmetic progressions 
in subscts of N having positive upper density. It is reasonable to conjecture that 
for each k > 1, there is some m = no(k) so that if L = (X,P) is any distributive 
lattice with |X| > np, then every regression on L is monotonic on some k-element 
chain. This is supported by Theorems 11.1 and 11.2. 

Also, we believe that for every k > 1 and every « > 0, there issome no = no(k, €) 
so that if L = (X,P) is a distributive lattice with |X| > yg and S is any subset of 
X with {S| > e|X|, then S contains a k-term arithmetic progression. It is an easy 
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exercise to show that this conjecture holds in the case where L is a subset lattice 
of the form 2”. 

Some modest progress has been made on these conjectures. Alon et al. (1987) 
study regressions on up sets in n?, while Kahn and Saks (1988) show that for each 
e > 0, there exists an integer mo so that if L = (X,P) is a distributive lattice and 
|X| > no, then any antichain in L has less than ¢|X| points. 
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1. Introduction/ History 


Matroid theory dates from the 1930s when Whitney (1935) produced his seminal 
paper in which the term matroid first appeared. As the word suggests, Whitney 
conceived a matroid as an abstract generalisation of a matrix, and much of the 
language of the theory is based on that of linear algebra. However, Whitney’s 
approach was also motivated by his work in graph theory and as a result some of 
the matroid terminology has a distinct graphical flavour. Some time later Van der 
Waerden also used the concept of abstract dependence in his “Modern Algebra’’. 

Apart from isolated papers by Birkhoff (1935), Maclane (1936) and Dilworth 
(1941-1944), on the lattice theoretic and geometric aspects of matroid theory, 
and two important papers by Rado (1942, 1949) on the combinatorial applications 
of matroids and infinite matroids, respectively, the subject lay virtually dormant 
until the late fifties when Tutte (1958-1959), published his fundamental papers on 
matroids and graphs and Rado (1957) studied the representability problem for 
matroids. Since then interest in matroids and their applications in combinatorial 
theory has accelerated rapidly. Indeed it was realised that matroids have 
important applications in the field of combinatorial optimisation and also that 
they unify and simplify apparently diverse areas of pure combinatorics. 


2. Axiom systems 


A matroid has many different but equivalent definitions, several of which were 
described in Whitney’s original paper. Deciding which set of axioms to use is 
difficult. To some extent it depends on the background of the reader or user. 
However, if we take the concept of a vector space as one of the most basic in 
mathematics we can regard matroid theory as having exactly the same relationship 
to linear algebra as does point set topology to the theory of real variables. That is, 
it postulates certain sets to be “independent” (linearly independent) and develops 
a fruitful theory from certain axioms which it demands hold for this collection of 
independent sets. 

Accordingly we define a matroid M to be a finite set § and a collection # of 
subsets of S (called independent sets) such that (I1)-(I3) are satisfied. 

(11) ES. 

(12) If XE ¥ and YCX then YES. 

(13) If U, V are members of ¥ with |U| = |V| + 1 there exists x € U\V such that 
VUXES, 

A subset of S not belonging to # is called dependent. 


Example. Let S be a finite subset of a vector space V and let # be the collection 
of linearly independent subsets of vectors of S. Then (S, #) is a matroid. 


Following the analogy with vector spaces we make the following definitions. A 
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base of M is a maximal independent subset of S, the collection of bases is denoted 
by 8 or B(M). 
The rank function of a matroid is a function r: 2° Z defined by 


r(A) = max(|X|: XC A, XEF) (ACS). 


The rank of the matroid M, sometimes denoted by r(M), is the rank of the set S. 
A subset A C S is closed or a flat or a subspace of the matroid M if for allx ES\A 


r(A Ux) =r(A)+1. 


In other words A is closed if no element can be added to it without increasing its 
rank. If for x ES, ACS, r(A Ux) =r(A) we say that x depends on A. We define 
the closure operator of the matroid to be a function a: 2" 2, such that o(A) is 
the set of elements which depend on A. It is easy to prove from the axioms that 
a(A) is the smallest closed set containing A. A subset X is spanning in M if and 
only if it contains a base. 

All the above concepts are very familiar from vector space theory. 

To illustrate what can be done starting just with the basic axioms (I1)—(I3) we 
now obtain some elementary properties of independent sets and bases of a 
matroid M on S. 

It is clear that if A is independent there exists a base B such that ACB. 

The following stronger result is used extensively. 


Proposition 2.1 (The augmentation theorem). Suppose that X, Y are independent 
in M and that |X|<|Y|. Then there exists ZC Y\X such that |\X UZ|=|Y¥| and 
X UZ is independent in M. 


Proof. Let Z, be a set such that over all ZC Y\X such that XUZ is in- 
dependent, |X U Z,| is a maximum. If |X UZ,|<|Y| then ¥,CY, |¥o|=|XU 
Z,| +1, and since Y, is independent by (13), 3yE¥,\(X U Z,) such that XU 
Z, Uy is independent. The set Z, Uy contradicts the choice of Z). O 


An immediate consequence of this is the following result, which extends the 
well-known property of bases of a vector space. 


Corollary 2.2. Ail bases of a matroid on S have the same cardinality. 


It is fairly obvious that knowledge of the bases, or rank function, or closure 
operator is sufficient to uniquely determine the matroid. Hence it is not surprising 
that there exist axiom system?“for a matroid in terms of each of these concepts. 
We list some of these axioms in the following theorems. 


Proposition 2.3 (Base axioms). A non-empty collection B of subsets of S is the set 
of bases of a matroid on S if and only if it satisfies the following condition: 
(Bl) if B,, B, EB and x EB\B,, Ay EB\B, such that (B, Uy rE B. 
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Proposition 2.4 (Rank axioms). A function r:2°’—Z is the rank function of a 
matroid on § if and only if for X CS, y, zES; 

(R1) r(8) = 0; 

(R2) r(X)<r(X Uy) <=7(X) +1; 

(R3) of 7X Uy) =r(X Uz) = r(X) then r(X Uy Uz) =r(X). 


Proposition 2.5 (Alternative rank axioms). A function r:2°—-Z is the rank 
function of a matroid on S if and only if for any subsets X, Y of S 

(R1’) O<7(X) <|X|; 

(R2’) X CYS7(X) =r(Y); 

(R3‘) (XUY)+r(X NY) <r(X) +7(Y). 


The condition (R3’) is a statement of the important property that the rank 
function of a matroid is a submodular function. This submodularity is at the heart 
of this area of combinatorics. 

We can also easily obtain another axiomatisation of a matroid by its in- 
dependent sets. 


Proposition 2.6. A collection # of subsets of S is the collection of independent sets 
of a matroid on S if and only if J satisfies conditions (11), (12) and the following 
statement: 

(13’) if A is any subset of S all maximal subsets Y of A with YES have the 
same cardinality. 


An alternative and attractive approach is to axiomatise a matroid via its closure 
operator. It needs little more than definition chasing to prove that such a 
characterisation is as follows. 


Proposition 2.7. A function o on the subsets of S is the closure operator of a 
matroid on S ff it satisfies the following conditions for all A, B CS, and elements x, 
y of S: 

($1) ACa(A); 

(S2) AC B>a(A)Co(B); 

($3) a(a(A)) = 0(A); 

(S4) if ySa(A) but y Ga(AU {x}) then x Eo(AU {y}). 


The conditions (S1)—(S3) are the usual conditions satisfied by any set function 
with pretensions to being a closure function. But condition (S4) is special to 
matroids, It is sometimes referred to as the Steinitz-Maclane exchange axiom. 

The proofs of all the above assertions are fairly routine (see Welsh 1976). 


We shall call a set S with any of the above systems (%, r, 0, @) a matroid and 
will usually just denote it by @. Unless otherwise specified, S will always be the 
set supporting M and we sometimes signify this by writing S(M). Throughout r 
and o will denote a rank function and closure operator of a general matroid. 
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Two matroids M, and M, on 5S, and S, respectively are isomorphic if there is a 
bijection ¢: S,—>S, which preserves independence. It is clear that equivalently ¢ 
is an isomorphism if and only if it preserves the rank function, bases, and so on. 
We write M, = M, if M, and M, are isomorphic. 


3. Some examples 


The uniform matroid U * Let S be a set of cardinality x and let ¥ be all subsets of 
S of cardinality <k. This is a matroid on S, called the uniform matroid of rank k 
and denoted by UX. 

Its rank and closure functions are defined by 


lA] |Al<&, 
HA=ty alee, (ASS): 


A |Al<k, 
AA)=15 lalek. (ACS). 


The special matroid in which every set is independent is the free matroid. 

Vectorial matroids. Let S be any finite subset of a vector space V. Let a set 
X=({x,,...,4,}€ if and only if the vectors x,, x,,...,X, are linearly 
independent in V. Then ¥ is the collection of independent sets of a matroid M. 
The rank function is just the rank (or dimension) function of V restricted to the 
set S. Any matroid obtained this way is called vectorial, and a matroid is 
representable if it is isomorphic to some vectorial matroid. 

Cycle matroids of graphs. Let G be a graph, let S be its set of edges E(G) and 
let X € # if and only if X does not contain a circuit if G. Then # is the collection 
of independent sets of a matroid on S, called the cycle matroid or polygon matroid 
of the graph G and denoted by M(G). 

Algebraic matroids. Let F be a field and let K be an extension of F. 


Proposition 3.1. Let S be a finite subset of K and let X EF if and only if XCS 
and the elements of X are algebraically independent over F. Then S is the collection 
of independent sets of a matroid on S. 


The proof of this is a straightforward piece of field extension theory. Any 
matroid obtained this way is called algebraic. 

Affine dependence. Let S b#a finite set of points in d-dimensional Euclidean 
space R*. An element x € R% is affinely dependent on the set {x,,... ,X,} of R‘ if 
there exist real numbers A, (1 <i<r) such that 


rea AX» 
i=1 
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and 
> A, =1. 
i=! 


A subset X CR?° is affinely dependent if no element x © X is affinely dependent 
on X\y. It is easy to prove that if # is the collection of subsets of § which are 
affinely independent then # is a matroid on S. 

A Euclidean representation of matroids of low rank. Matroids of rank <3 have 
the following very useful geometric description in the plane. 

Let M be a matroid of rank 3 on the set $= {x,,...,x,}. Place the n points 
X,,--.,, on the plane and draw a line through each closed set A such that 
|A| #3, r(A) = 2. Then the bases of M are all subsets of S of cardinal 3 which are 
not collinear in this diagram. 


Example. Let M be the matroid on {x,,x,,...,x,} whose bases are all 3-sets 
except 


{x1,%3,X6} {X1,X4.Xs} > {X2,X5, Xp}, {X2,%3,X4} - 


Then a Euclidean representation of M is shown in fig. 3.1. 

Conversely it is easy to check that any diagram of points and lines in the plane 
in which a pair of lines meet in at most one point represents a matroid whose 
bases are those 3-sets of points which are not collinear in the diagram. 


Example. The diagram D of fig. 3.2 is a Euclidean representation of the matroid 
on {x,,...,X,} with bases all 3-sets except {x,,%.,% 6}, {%,,%4,X7}, {X%1,%3,%s5}; 


{Xp %35X4}, {X2,%5,X7}, {%3,% Xo}, {X4,X5, Xe}. : : 
The reader familiar with projective geometry will recognise this matroid as the 
well-known Fano plane which is vectorial over the field GF(2) under the map 


x,—(1, 0,0), x,—(0,1,0), x,;—(0,0,1), 
X,27(0,1,1), 2»*,7(1,0,1), x. (1,1,0), 
x,—(1,1,1). 


x 


Xs 


x4 2 - 
x 4 


Figure 3.1. Figure 3.2. The Fano matroid. 


X3 
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Figure 3.3. 


This matroid, called the Fano matroid, and denoted by F, is more well known 
as the projective geometry PG(2, 2) (see chapter 13). 


In both the above examples there are no two points x, y which form a 
dependent set. When such points are present they can be signified by ‘“‘double 
points’. 


Example. The following diagram represents a matroid of rank 2 on {1,... , 6} in 
which r({2, 3}) = r({4, 5, 6}) = 1. 


o___ __e_______e 
{1} {2, 3} {4, 5, 6} 


This idea can be extended to rank 4 matroids by using 3-dimensional configura- 
tions as shown in the following example. 


Example. Consider the tetrahedron in fig. 3.3. This is a matroid on {1,..., 7} in 
which the bases are the union of 1 with any 3 non-collinear points on the base. 


4. The polygon/cycle matroid of a graph, circuits, connectedness 


As mentioned earlier, Whitney introduced matroids to deal with graph properties 
in an abstract setting. The abstraction rest on the following easy observation. 


Proposition 4.1. The edge sets of the forests of a graph G are the independent sets 
of a matroid on E(G). 


This matroid is called the cycle or polygon matroid of G and is denoted by 
M(G). 


Example. Let G be the graph of fig. 4.1. The polygon matroid of G has as its 
independent sets all subsets of {a, b, c, d,e} which do not contain one of the sets 
{a,b,c}, {c,d,e}, {a,b,d,e}. 
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Figure 4.1. 


From very elementary graph theory we can list the following basic properties of 
M(G). 


4.2. If G is a connected graph the bases of M(G) are the spanning trees of G. 
4.3. If G is a disconnected graph the bases of M(G) are the maximum forests of G. 


Another elementary observation is that the polygon matroid M(G) can just as 
well be defined by listing the circuits of G. This clearly carries over to general 
matroids and continuing this analogy Whitney defined a circuit of an arbitrary 
matroid M to be a minimal dependent subset of S(M). 


Example. In the uniform matroid U* on S in which all subsets of S of k or fewer 
elements are independent, the circuits are all (K + 1)-subsets of S. 


Example. If G is a graph, the circuits of M(G) are just the edge sets which 
comprise the circuits of G. 


Clearly, knowledge of the set of circuits of a matroid completely specifies the 
matroid. It is not surprising therefore that a matroid can be axiomatised in terms 
of its circuits, and for those readers motivated by graph theory, this may well be 
the most useful approach. 

Obviously every dependent set contains a circuit and it is easy to see the 
following. 


4.7. If C is a circuit then r(C) =|C|-1, |C| <r(S) +1, and every proper subset of 
C is independent. 


Proposition 4.8. A collection € 4 {®} of sets is the set of circuits of a matroid if 
conditions (C1) and (C2) hold. 

(Cl) If C,, C, are distinct circuits then C, is not contained in Cy. 

(C2) If C,, C, are distinct circuits and z € C, OC, there exists a circuit C, such 
that C,C(C, UC,)\{z}. 


In fact Whitney used a stronger axiomatisation replacing (C2) by what is now 
called the strong circuit axiom (C3) given by: 
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(C3) If C,, C, are distinct members of € and y © C,\C, then for eachx EC, N 
C, there exists C,€©@ such that 


yEcC,C(C,UC,)\{}. 


Proving the equivalence of (C1)-(C2) with (C1) and (C3) is not trivial. 

Continuing the analogy with graphs we note that in a graph G an edge e is a 
loop iff {e} is a circuit of M(G), while distinct edges e, f, are parallel iff {e, f} isa 
circuit of M(G). Accordingly we make the following definitions for general 
matroids. 

An element x € S is a loop of M on S if and only if {x} is a circuit of M.A pair 
of distinct elements x, y of S are parallel in M iff {x, y} is a circuit of M. 
Trivially, we have 


xisaloop © r({x})=0, (4.9) 
x and y are parallel © r({x}) =7({y}) =r({x, y})=1. (4.10) 


For many purposes one can ‘“‘forget about’? loops and parallel elements in 
matroid theory in the same way as many authors in graph theory only consider 
graphs with no loops or parallel edges. However, as we will see, if we adopt this 
course the intrinsic simplicity and symmetry of duality theory (to be studied 
shortly) vanishes. 

In the same way as the adjective “simple’’ is used to describe a graph which has 
no loops or parallel edges we define a simple matroid to be a matroid with no 
loops or parallel elements. 

When G is a connected graph any spanning tree has |V(G)|—1 edges. Thus 
counting up the connected components we get the following. 


4.11. The rank of the matroid M(G) is |V(G)| — k(G) where k(G) is the number of 
connected components of G. 


More generally if A is any subset of edges of G let k(A) denote the number of 
components in the subgraph of G with the edge set A and vertex set those 
vertices of G incident with a member of A. 


4.12. For any subset AC E(G) the rank of A in M(G) is rg(A) =|V(A)| — k(A). 


It is clear from this how to characterise the closure operator and hence 
implicitly the flats of M(G). a 


4.13. Let A be a set of edges of G, let eC E(G)\A. Then e belongs to a(A), the 
closure of A in M(G), if and only if there is a circuit C of G withe EC CAU {e}. 


However, despite the power of matroid theory as a tool in the clarification of 
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certain graphical ideas we warn that many problems of graph theory cannot even 
be posed in matroid language. Basically this is because there is nothing in a 
matroid which corresponds exactly to a vertex in a graph. 

For example since any tree T on n edges has a cycle matroid which is 
isomorphic to that of any forest on the same number of edges, it is clear that 
there can be no concept in matroid theory corresponding to ordinary graphical 
connectedness. It is easy however to introduce a notion of separability which 
corresponds exactly to 2-connection in graph theory. 

A matroid M is connected or non-separable if for any pair of distinct elements 
e, f of S(M), there exists a circuit C containing e and f. The following 
observations are easy to check. 


4.14, A matroid M on S is connected if and only if for any proper subset A of S 
r(A) + 7(S\A) > 7(S). 


4.15. Write e~f if there is a circuit C containing both e and f. Then ~ is an 
equivalence relation on the groundset S and the equivalence classes are called the 
connected components of M. 


In the same way as M can be naturally broken up into its connected 
components, the reverse operation of taking the “direct sum” of the components 
resurrects the original matroid. 

The direct sum of matroids M, on S, and M, on S,, with S, NS, =@ is the 
matroid M,®M, on S,US, which has as its independent sets all sets (7, U J,: 
I, €.$(M,), 1,€ A(M,)}. Equivalently the set of circuits of M,®M, is just the 
union of the sets of circuits of M, and M,. 

Clearly 


7(M, ® M,) = r(M,) + 7(M,) 


and a disconnected matroid is just the direct sum of its connected components. 

Finally we address the problem of characterising those matroids isomorphic to 
the cycle matroid of some graph. Call such matroids graphic. It is easy to see that 
most matroids are not graphic. 


Example. The uniform matroid Uj is not graphic. (Try to draw a graph with 4 
edges such as its circuits are all edge sets of size 3.) 


A complete characterisation of graphic matroids was given by Tutte (1959) in 
one of the really deep theorems on the subject; for further details of this and 
more on the subject of higher connectivity and Seymour’s theory of splitters see 
Welsh (1982), Oxley (1992) and chapters 10 and 11. 
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5. Duality 


The concept of matroid duality (which we emphasise has nothing to do with 
vector space duality) is of fundamental importance in the applications of matroids 
to combinatorial theory. 

The basic result due to Whitney is easily proved. 


Theorem 5.1. /f {B;: iI} is the set of bases of a matroid M on S then {S\B,: 
i€ I} is the set of bases of a matroid on S, called the dual matroid of M. 


We denote the dual matroid of M by M*. Obviously M and M* are related by 
(M*)*¥ =M. (5.2) 


The following properties of M* are straightforward deductions from its 
definition. 


5.3. A subset X CS is independent in M* if and only if S\X is spanning in M. 
' §.4. An element x € S is a loop of M if and only if x belongs to every base of M*. 
5.5. The rank of M* is \S|—7r(M). 

This last result is a special case of the following more general fact. 


5.6. The rank functions r, r* of M, M* respectively are related by 
r*(A)=|A|-—1(S)+r(S\A) (ACS). 


The function r*: 2’ Z we call the corank function of M. A cobase of M is a 
base of M*: a cocircuit of M is a circuit of M*, if x is a loop of M* it is called a 
coloop of M and so on. 

Since M* determines M uniquely we have the obvious result that a matroid is 
uniquely determined by its cobases, cocircuits or corank function. Moreover, each 
of the earlier axiomatisations can be dualised to give an axiomatisation of a 
matroid in terms of these dual concepts. 

For example if we define a hyperplane of M to be a maximal proper flat it is 
routine to check the following. 


5.7. H is a hyperplane of M on S iff S\H is a circuit of M*. 


As a consequence, since the circuits of a matroid determine it uniquely, we 
have the following. 


5.8. A matroid is uniquely determined by its collection of hyperplanes. 
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In general there is a simple but useful duality principle — that to every statement 
about a matroid there is a dual statement. For example a dualisation of the 
statement 

(a) An element x is a loop of M if and only if x does not belong to any base of 
M. 
is the statement 

(a)* An element x is a coloop of M if and only if x does not belong to any 
cobase of M. 

As another example, because of 5.7 and 5.8 the reader can obtain axioms for a 
matroid in terms of its hyperplanes merely by complementing the circuit axioms 
(C1)-(C2) of section 4. 

More interesting is the light duality sheds on the concept of planarity in graph 
theory. To understand this first consider the following class of matroids. 

A cocircuit (minimum cut) of a graph G is a minimal set of edges whose 
deletion from G increases the number of connected components. A fundamental 
observation is the following. 


Proposition 5.9. The collection of cocircuits of a graph G is the set of circuits of a 
matroid M*(G) on the edge set E of G. 


This matroid is called the cocycle or bond matroid of G and as the notation 
suggests we have the following. 


Proposition 5.10. For any graph G, M*(G) and M(G) are dual matroids, that is 
M*(G)=(M(G))* . 


The reader familiar with graph theory will not find it difficult to verify 5.9 by 
showing that the cocircuits of a graph satisfy the circuit axioms (C1)—(C2) of a 
matroid. 

A consequence of 5.10 is that any statement about circuits in a graph must have 
an exact dual counterpart in terms of cocircuits. For example try and verify 
directly that the cocircuits of a graph satisfy the strong circuit axiom (C3). 

Another useful property of duality is the following. 


Proposition 5.11. M is a connected matroid if and only if M* is. 
Proof. Use criterion 4.14 in terms of the rank function. O 


As an application, we note that a graph is 2-connected iff any pair of edges 
belongs to a cocircuit. 

Finally we relate duality to planarity. A matroid M is cographic if it is 
isomorphic to the cocycle matroid of some graph G. Again it is easy to verify the 
following. 
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5.12. U% is the smallest matroid which is not cographic. 


The concept of abstract duality allowed Whitney to characterise planar graphs 
in the following way. 


Theorem 5.13, A graph is planar if and only if its cycle matroid is cographic. 
Equivalently, by dualising the above statement, we have the following. 
Corollary 5.14. A graph is planar if and only if its cocycle matroid is graphic. 


As in the case of graphic matroids Tutte’s theorem completely characterises 
cographic matroids. 
An alternative reinterpretation of Whitney’s planarity condition is as follows. 


Proposition 5.15. A matroid is both graphic and cographic if and only if it is the 
cycle matroid of a planar graph (equivalently the cocycle matroid of a planar 


graph). 


Thus the statement that M, and M, are dual matroids can be regarded as the 
natural generalisation of the statement that G and H are dual planar graphs. 


6. Submatroids and minors 


We now look at the different ways in which a matroid M on S induces “smaller” 
matroids which in some sense preserve some of the structure of M. The simplest 
operation is defined by the following. 


6.1. Let M be a matroid on S and Let 0<k<r(M). Then if ¥, ={X: XC F(M); 
|X|<k}, F, is the collection of independent sets of a matroid M, on S. 


We call M, the k-truncation of M. It is trivial to verify 6.1 and to note that M, 
has the rank function r, given by 7,(A) = min(k, r(A)) (A CS) and the bases of 
M, are the independent sets of M which have cardinality &. 


Example. The uniform matroid U* is the k-truncation of the free matroid on n 
elements. 


We next show how a matrc# M on S induces two matroids on a subset T of S 
which correspond in the natural way to subgraphs of a graph obtained by the 
operations of deleting and contracting of edges. 

If #(M) is the set of independent sets of M on S and TCS, let 


I(M|T)={X: X CT, XE F(M)} . 
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6.2. #(M|T) is the set of independent sets of a matroid on T. 


We denote this matroid by M|T and call it the restriction of M to T. 

The proof of 6.2 is again trivial. When M is the cycle matroid M(G) of a graph 
G and T C E(G), recall that G| T denotes the subgraph with edge set consisting 
of those edges of G not belonging to E(G)\T. Then it is easy to verify 


M(G|T)=M(G)|T. (6.3) 


Because of this analogy between M | T and the effect of removing the elements 
of S\T the matroid M|T is sometimes called the matroid obtained from M by 
deleting S\T. 

An alternative and commonly used notation is to write M\X to denote the 
matroid obtained from M by deleting the subset X of its groundset S(M). Thus 
M\X=M(|(S\X). 

Obviously the rank function of M|T is just the restriction of r to T. The 
following basic properties of M|T are easily checked. 


6.4. If X is dependent in M and X CT then X is dependent in M|T. 
6.5. M|T has as its circuits all circuits of M which are contained in T. 


Having carried out a restriction of M to the set 7 it is natural to examine the 
dual operation. This turns out to be the exact counterpart of the concept of 
contraction in graphs. There are several equivalent ways of defining it as can be 
seen from the following statements. 


Proposition 6.6. Let M be a matroid on S and let T CS. Define #(M - T) to be the 
matroid on T with rank function d given in terms of the rank function r of M by 


MA) =(AU(S\T))—r(S\T) (ACT). 


Then 
(M-T)*=M*|T, (6.7) 
(M|T)*=M*-T. (6.8) 
The circuits of M-T are the minimal non-empty sets of the form CNT, C 
a@ circuit of M . (6.9) 
X is independent in M-T if there exists a maximal independent set Y of 
M|\(S\T) such that X U Y is independent in M . (6.10) 


We call M -T the contraction of M to T because when M is the cycle matroid of 
a graph G and T C E(G), M -T is the cycle matroid of the graph G - T obtained 
from T by contracting out of G the edges in E(G)\T. 

For X ¢ S(M), we often us M/X to denote the matroid obtained from M by 
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Figure 6.1. Figure 6.2. 


contracting out X. Thus 
MIX =M-(S/X). 


Note by (6.7), (6.8) that deletion and contraction are dual concepts in matroid 
theory. 

Contraction also has an attractive geometric interpretation. Consider for 
example M on S= {a,b,c,d,e, f, g} where a, b, c, g are the vertices of a 
tetrahedron in R® and d, e, f are points in general position in the faces {g, b, c}, 
{a,b,c}, {a, b, g} as shown in fig. 6.1. Then the contraction of g out of M gives 
the matroid M-T of rank 3 obtained by just projecting down from g and 
represented geometrically by fig. 6.2. 

Any matroid N which can be obtained from a matroid M by some sequence of 
contractions and deletions is called a minor of M. Since the operations of 
contraction and deletion commute the following is equivalent. 


6.11. N is a minor of M if and only if M can be obtained from M by a restriction 
(contraction) followed by a contraction (restriction). 


Moreover, since the operations are dual the following holds. 
6.12. N is a minor of M if and only if N* is a minor of M*. 


The crucial importance of minors in matroid theory is because many of the 
interesting properties seem to be preserved under the taking of minors and a 
natural way to obtain a good characterisation of such a property 7 is to find some 
finite class of matroids {N,,...,N,} such that having property 7 is equivalent to 
not having one of the N, as a minor. In recent years, due mainly to the research 
efforts of Tutte and Seymour, many deep characterisations of this type have been 
obtained — we refer to chaptgr 10 for details. 


7. Geometric lattices 


A completely different approach to matroid theory was initiated by Birkhof! 
(1935). 
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Recall that a flat or closed set of a matroid M is just any subset A of E(M) such 
that r(A U {e}) =r(A) + 1 for any e € A. It corresponds exactly to a subspace in a 
vector space. Clearly the flats of M, ordered by inclusion form a partially ordered 
set (see chapter 8). However, more can be said. It is easy to prove that if A, B 
are any two flats then AMB is also a flat. This leads to the following easy 
observation. 


7.1. For any matroid M, the set of flats of M ordered by inclusion forms a lattice 
£(M). 


In particular the lattice &(M) has the following special properties. 


7.2. £(M) has a minimal element 0 and a maximal element I together with a height 
function h such that for any pair of elements x, y 


hQ@) +h(y)zh@ay)thevy). 


The proofs of 7.1 and 7.2 are easy. An element x of £(M) corresponds to a flat 
X of M. The zero element 0 is just the closure of the empty set, namely the set of 
loops of M. The height A(x) of x is then just the rank r(X) of the corresponding 
flat X and 7.2 follows from the submodularity of the rank function. 

From 7.2 it follows that £(M) is a semimodular lattice which has the additional 
property that any element of ¥(M) can be written as the join of atoms. But this is 
exactly the definition of what is known as a geometric lattice. 

It is natural therefore to consider the reverse proposition that every finite 
geometric lattice ¥ can be obtained by the above construction from some matroid 
M. This is the case. To prove it just construct a matroid M on the set A of atoms 
(elements of height 1) of ¥ in which a subset X ={x,,...,x,} is called 
independent if and only if 


A{x,ve--vx at. 


Verifying that this construction gives a matroid M with the property that £(M) = 
£ is just a routine check. 


Example 1. If M is the free matroid in which every subset is independent, 2(M) 
is the distributive lattice of all subsets of S(M). 


Example 2. If M is the uniform matroid of rank 3 on S = {a, b, c, d}, £(M) is the 
lattice of fig. 7.1. 


Example 3. Semimodular lattices which are not the lattices of flats of a matroid 
are shown in fig. 7.2. In both cases the element g is not the join of atoms. 


We can therefore sum up the previous discussion by stating the following. 
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Figure 7.1. Figure 7.2. 


Theorem 7.3. A finite lattice is isomorphic to the lattice of flats of a matroid if and 
only if it is a geometric lattice. 


Thus in a sense matroid theory can be regarded as just a small subset of lattice 
theory. However, the flaw in this approach is that it does not seem to give any 
insight into duality and it is from duality that much of the richness of the theory 
comes. As a test of this observation consider the following problem. Given a 
geometric lattice £ construct the geometric lattice of the matroid dual to that of 
£. There seems to be no natural lattice theoretic answer to this. 

Another point in connection with Theorem 7.3 is that the correspondence 
between matroids and geometric lattices is only a bijection between the class of 
simple matroids and geometric lattices. This is because if M, can be got from M 
by deleting loops or elements in parallel then the geometric lattices Y(M) and 
£(M,) are indistinguishable. 

The relation between matroids and geometric lattices parallels the formation of 
projective spaces from vector spaces, 1-dimensional subspaces in the vector space 
become points of the projective space. 

However, provided we are concerned only with simple matroids, much can be 
gained by a geometric lattice approach and it is this approach which is taken in 
the influential work of Crapo and Rota (1970). (They use the term combinatorial 
geometry for what we call simple matroid and pregeometry for matroid.) 

Finally we remark that minors of M are clearly visible in the geometric lattice 
of M. That is to say, if we consider M on S with geometric lattice £(M) as shown, 
then if T is a flat of M, the minors M|T and M-(S\T) are the sublattices 
indicated in fig. 7.3. Similarly if ACB are any two flats the interval sublattice 
[A, B] is the geometric lattice of the minor M obtained by contracting S\B out 
and then restricting to S\A. 


a 
8. Pavings, transversals and linkages 


We observed in 5.8 that a matroid is uniquely determined by its hyperplanes. By 
dualising the circuit axioms and recalling that H is a hyperplane of M only if its 
complement is a circuit we have the following. 
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Figure 7.3. 


Proposition 8.1. A collection # of subsets of S is the set of hyperplanes of a 
matroid on S if and only if 

(H1) No member of # properly contains another. 

(H2) if H,, H, are distinct members of H and x ZH, UH, there exists H, © H 
such that (HW, H,)Ux CH. 


From this basic result it is easy to recognise the following large class of 
matroids. 

A d-paving of S is a collection of subsets of S each of cardinality =d, no one 
of which is contained in another and with the property that each d-subset of S is 
contained in a unique member of . A trivial consequence of this definition and 
8.1 is the following. 


Proposition 8.2. [f P is a d-paving of S then it is the set of hyperplanes of a 
matroid on S of rank (d + 1). 


Any matroid isomorphic to one obtained in this way we call a paving matroid. 
Although they have a very simple structure (every set of cardinality <d is 
independent and every circuit has cardinality d + 1 or d + 2), they often occur as 
counterexamples, possibly because they are far away from the linear case. 

A particularly regular class of paving matroid is when all the hyperplanes have 
the same cardinality. We then have the very special structure, well known in the 
theory of block designs as Steiner systems see chapter 14. Explicitly we have the 
following. 


Corollary 8.3. The blocks of a Steiner system are the hyperplanes of a matroid. 


Despite their rather limited structure there dre surprisingly many paving 
matroids. For example if f(z), p(t) denote respectively the number of matroids 
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and paving matroids with groundset of cardinality n then we have 


22" -(3/2)log(a) + O(log log 1) gn-logn + O(log log 42) 


< p(n) = f(a) <2 (8.4) 

Apart from the ways in which we have already met matroids in connections 
with graphs and Steiner systems they seem to have a natural place in the theory of 
linkages. 

The simplest way to see this is to consider the following class of matroids 
occurring in transversal theory. Let of =(A,: i€ J) be a finite family of subsets of 
S. A partial transversal of sf is a set X = {x,,x,,...,%,} such that there is an 
injection w{1,...,¢}—J with the property that x,;E Ay, (1<i<2). 

The following observation, first made by Edmonds and Fulkerson (1965) is 
fundamental. 


Proposition 8.5. The collection P(#) of partial transversals of a finite family 9 of 
subsets of S is the set of independent sets of a matroid on S. 


Any matroid which is isomorphic to one which is obtainable in this way is called 
a transversal matroid. 

In a sense this is just a special case of the following more general construction. 
Take an arbitrary graph G and a distinguished subset A of vertices of G. We say 
that a set {x,,...,x,} of vertices of G can be linked into A if there is a sequence 
P,,...,P, of paths of G which have the following properties: 

(a) P,, P; are vertex disjoint if i #j, 

(b) the endpoints of P, are x, and a, where a;€ A. 

An obvious extension of 8.5 is the following. 


Proposition 8.6. For any graph G and any distinguished subset A of vertices of G 
the collection P(A) of sets of vertices which can be linked into A form the 
collection of independent sets of a matroid on A. 


Any matroid which can be obtained in this way is called a gammoid. 

Clearly every transversal matroid is a gammoid obtained by taking G to be a 
bipartite graph. 

A closed related class of matroids occurs by taking a set X of vertices of a graph 
G to be independent if there exists some matching of G which contains X among 
its endpoints. This defines the class of matching matroids. 


Induced matroids 


A natural extension of thes? matroids obtained from linkages is the idea of 
inducing matroids. 

For any graph G let M be a matroid with groundset § a subset of the vertex set 
V(G). If TCV, G induces a matroid G(M) on T by defining X CT to be 
independent in G(M) if and only if there are vertex disjoint paths in G linking X 
to a subset Y of V(G) which is independent in M. 
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A particular case of this occurs when f is a function on S— T. By taking G to 
be the obvious associated bipartite graph with vertex set § U T and edges (s, f(s)) 
we may define f(M) to be the induced matroid G(M). This construction was first 
introduced by Nash-Williams (1966), the extensions to arbitrary linkages was 
carried out some years later, for details see Mirsky (1971), Welsh (1976) or Oxley 
(1992). 


9. Submodular set functions 


A function 4: S>R is submodular if 
H(X) + WY) 2 (XUV) + w(XNY) (X,YCS). 


There is an intrinsic relationship between matroids and submodular set functions; 
not only because of the submodularity of the rank function, but because of the 
following result. 


Theorem 9.1. Let « be an integer valued, submodular, non-negative set function 
on S with u(9)=0. Then yp induces on S the matroid M(j) which has as its 
independent sets the collection of sets X, such that for all ACS 


p(A)2=|XN Al. 
The rank function r of M(,2) is given by 
r(X) =inf{(A) + |X\Al): ACS}. 


Obviously any matroid can be obtained by choosing y to be the “right” 
function. What is interesting is to use the above theorem in a most natural way so 
that the class of matroid drops out. 


Example. As a very simple example, let (A,: i € /) be a family of subsets of S and 
let w: 1->Z* be defined by 


BKJ)=|AY)| Jct 


and where A(J) = U {A;: i= J}. 
It is easily checked that » is submodular and the resulting matroid has as its 
independent sets those J for which the family (A,: i € J) has a partial transversal. 
Interchanging the roles of points and sets in the above (sometimes rather 
grandly called point-set duality) gives the class of transversal matroids. 


More interesting is the following construction which has an easy interpretation 
in terms of submodular functions. The sum of two rank functions on the same set 
is a submodular function and leads to the following construction. 

Given two matroids M, on S, and M, on S,. Define M, v M,, the union of M, 
and M, to be the matroid on S, US, in which a set X is independent if it can be 
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expressed in the form X,UX, where X, is independent in M, and X, is 
independent in M,. 

Clearly when S,9S,=@ the union of M, and M, is just the direct sum 
M, ®M,,. In general, however, it is a difficult operation to unravel. For example 
the following question first posed in Welsh (1971) is still unresolved. 


Problem. Call a matroid M irreducible or prime if there do not exist non-trivial 
M,, M, such that M=M,vM,. Classify, or find necessary and sufficient 
conditions for a matroid to be irreducible. 


An almost immediate consequence of its definition is the following. 


Proposition 9.2. M is transversal if and only if M is the union of matroids of rank 
one. 


Much less obvious is the following fundamental calculation often called the 
matroid partition theorem (see also chapter 11). 


Theorem 9.3. If M=M,v-:::v M, with M, having rank function r, then M has 
rank r given by 


AS) =min (7,(A) +--+ ,(A) + |S\Al) (ACS). 


This follows immediately from Theorem 9.1 and the observation that since each 
r, is submodular the sum 7, +-::+ 1, is a submodular set function. 

From 9.3 we have what are often described as the two covering/packing 
theorems about matroids. 


Corollary 9.4. A matroid M has k pairwise disjoint bases iff for all ACS 
kr(A) + |S\A| = kr(S). 


Corollary 9.5. The ground set S of a matroid M can be covered by k or fewer 
independent sets if and only if 


kr(A)2|A| (ACS). 


In both cases 9.4 and 9.5 the conditions are easily shown to be necessary. They 
have many applications in different areas of combinatorics and versions of these 
results restricted to graphs or vector spaces or transversals were originally proved 
by quite complicated and ad hoc arguments which were relevant to the particular 
specialisations see for example chapter 2. The realisation that matroids gave a 
unified and simple treatment of this general area was a main impetus to interest in 
the subject. For more on this we refer to chapter 11. 

By taking the union of M, with the dual of Mf, we get an easy condition for M, 
and M, to have an independent set of prescribed size. More precisely, we get the 
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following result called the matroid intersection theorem due to Edmonds (1965) 
(for a proof and algorithms see chapter 11). 


Theorem 9.6. If M, and M, are matroids on S and k € Z* then M, and M, have a 
common independent set of size k if and only if for any subset A of S 


r,(A) +1,(S\A) 2k. 


A curiosity is that the problem of deciding whether three or more matroids 
have a common independent set of size k is NP-hard (see chapter 29) and appears 
to be hopelessly difficult. 

Both 9.3 and 9.6 can be easily deduced from, or lead to a very nice result of 
Rado (1942) which generalises in a very natural way the classic Hall theorem (see 
chapter 3). This can be stated as follows. 


Rado-Hall theorem 9.7. [f M is a matroid on S and A=(A,: i€ 1) is a family of 
subsets of S then A has a transversal which is independent in M if and only if for all 
Jol 


r(AV)) = [JI - 


For a proof of this, an account of its many applications in transversal theory 
and its relation with Theorems 9.3-9.7 we refer to the text by Mirsky (1971). 


10. Linear representability 


One of the oldest problems in matroid theory is the question of finding necessary 
and sufficient criteria for a given matroid to be embedded in a vector space over a 
given field. 

There certainly exist matroids which are not representable over any field. 

A very simple example of such a matroid is the following. 


Example. Consider the rank 3 matroid M on the 14 elements {a, b, c, d, e, f, 8, 
a’, b', c', d', e’, f', g’} which has a Euclidean representation of the form of fig. 
10.1. M is just F,@F; where F? is the Fano matroid “less a line’. F, is only 
representable over fields of characteristic 2, F) is only representable over fields 


a 


Figure 10.1. 


504 D.J.A. Welsh 


not of characteristic 2. As a result this direct sum is not representable over any 
field. 


Similar examples of any rank can be constructed by taking the direct sum of 
projective spaces, 


M = PG(n, gq) B® PG(m, q’), 


where g and q’ are powers of different primes. 

However, in one sense these are unconvincing examples since the lack of 
connectivity is crucial for the non-representability. For example suppose we say 
that a flat Z is modular in M if 


MHZ) +rYHXZUY)+7(ZNY) 


for all flats Y of M. 

Call a matroid modular if every flat is modular. 

Examples of modular matroids are the free matroids and the projective 
geometries, and any matroid obtained from these by taking direct sums. 

It turns out (and this is really a theorem about lattices, see Birkhoff (1967, pp. 
90-93) that this is in fact a complete characterisation: more precisely we have the 
following. 


Theorem 10.1. A connected simple matroid is modular if and only if it is either a 
free matroid or a projective space. 


Corollary 10.2. A connected modular matroid is representable. 


For more on these sort of questions we refer to chapter 13. 

The first ‘‘interesting” non trivial example of a non-representable matroid was 
found by Maclane (1936) who noticed that the nine element example whose 
Euclidean representation is given by fig. 10.2 is only representable over a field if 
the points a, b, c are collinear. That is {a, b, c} has rank 2 rather than rank 3 as in 
fig. 10.2. This is because the classic theorem of Pappus demands that in such a 
geometric configuration of points coordinatisable over a field these points of 
intersection a, b and ¢ are collinear. 

For some time the above “non-Pappus’” matroid was believed to be the 


Figure 10.2. The non-Pappus configuration. 
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smallest non-representable matroid. This turns out to be false as is shown by the 
following example of Vamos (1971). 


Example 10.3. V, is a self-dual matroid on 8 elements which is not representable 
over any field, see fig. 10.3. 


We now list the basic properties of representability, for proofs see Welsh (1976) 
or Oxley (1992). 


10.4. if M is representable over F then so is any minor of M. 
10.5. If M is representable over F so is its dual M*, moreover, if the representation 
of M is by the columns of the r by n matrix (r =r(M), n= S(M)) 
M=(I,, A] 
then M* is representable by the columns of the matrix 
M*=[-A',J,_,]. 
10.6. if M is representable over F then any matroid induced from M is represent- 


able over some extension of F. 


10.7. if M, and M, are representable over F then M, v M, is representable over 
any sufficiently large extension of F. (In other words it is the size of the extension 
not its algebraic structure which matters.) 


Turning now to the representability of classes of matroids, we first have the 
following. 


Proposition 10.8. Graphic matroids are representable over every field and hence by 
duality and (10.5) so are cographic matroids. 


Matroids which are representable over every field form an important subclass of 
matroids. They are known as regular matroids and a complete characterisation (or 
more precisely) synthesis of these matroids is contained in a remarkable theorem 
of Seymour (1980). This shows that any regular matroid is effectively obtained by 
composing graphic matroids, cographic matroids and one special 10-element 


Figure 10.3. 
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matroid. For more on this and a characterisation of regular matroids in terms of 
their forbidden minors we refer to chapter 10. 


10.9. Transversal matroids, being the union of matroids of rank one, are 
representable over all sufficiently large fields. 


As far as representability over specific fields goes only a few cases have been 
settled. 

A matroid is binary if it can be represented over the field GF(2). The smallest 
non-binary matroid is the uniform matroid Uj. There are many good characteri- 
sations of this property which we illustrate in the following statement. 


Theorem 10.10. The following statements about a matroid are equivalent. 
(a) M is binary. 
(b) M has no minor isomorphic to Uj. 
(c) The symmetric difference of any two circuits is the union of disjoint circuits. 


The only other field for which a complete characterisation of representability is 
known is the ternary case. This is a deep result (see chapter 10). 

Finally we briefly consider the characteristic set problem. Let P be the set of all 
primes together with 0. Given a matroid M let C(M) be the set of integers p € P 
such that M is representable over a field of characteristic p. C(M) is the 
characteristic set of M and K is a characteristic set if K = C(M) for some matroid 
M. The problem is to characterise the subsets K of P which are characteristic sets 
of a matroid. 

Clearly from our earlier results we know the following. 


10.11. @ and P are characteristic sets, as is {p} for any prime p € P. 


An early result of Rado (1957) subsequently extended by Vamos (1971) is the 
following. 


10.12. if 0€ C(M) then all sufficiently large primes belong to P and conversely if 
0 C(M) then C(M) is a finite set. 


In other words, the following holds. 


10.13. if K is a characteristic set then either 
(i) OE K, |P\K|<*, or , 
(ii) OF K, |K|<~. 


Reid (1970) showed that all sets satisfying (i) are characteristic sets. He also 
produced some exotic finite sets of primes which are characteristic sets, for 
example {1103, 2089}, for details see Brylawski and Kelly (1980). A definitive 
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solution to the characteristic set problem was completed when Kahn (1982) 
proved the following. 


10.14. All finite subsets of P\{0} are characteristic sets. 


Combining all these results we obtain the complete answer to the characteristic 
set problem. 


Theorem 10.15. K is a characteristic set if and only if it satisfies (i) or (ii) of 
(10.13). 


11. Algebraic matroids 


The idea of treating algebraic dependence over a field axiomatically is implicit in 
the treatment by Van der Waerden (1937) in his classic text “Moderne Algebra’’. 

First recall the definitions. Let K be an extension field of the field F. Elements 
€,,--+,@, Of K are algebraically dependent over F when there is a non-zero 
polynomial p(x,,...,x,) with coefficients from F such that p(e,,...,¢,) =90. 
The collection of algebraically independent subsets of a subset S of K are the 
independent sets of a matroid on S$ and any matroid isomorphic to one which is 
obtained in this way we say is algebraic over F. 

A matroid is algebraic if it is algebraic over some field, and the sort of questions 
which are interesting are similar to those asked about linear representability, 
namely: find conditions for M to be algebraic over a specific field or over some 
field. What is the “algebraic characteristic set” of a given matroid and the like. 

Some progress was made on these in the early seventies by Piff (1972), 
Ingleton, and Main (1975) and more recently by Lindstrom (1985-1988) and 
Dress and Lovasz (1987). 

First we clarify the relation with linear representability by stating some of the 
basic results. 


11.1. If M is linear over F then M is algebraic over F. 


The converse of this is not true; the non-Fano matroid is algebraic over GF(2) 
but is not linearly representable over a field of characteristic 2. 


11.2. If M is algebraic over F then any minor of M and any truncation of M is 
algebraic over some extension of F. 


11.3. If M on S is algebraic over F and f: S—>T then f(M) is algebraic over F. 
This leads to easy proofs of the following results 


11.4, If M,: 1<i<n, are algebraic over F then so is M,v---v M. 


at 
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11.5. Transversal matroids are algebraic over every field. 
A recent result of Lindstrém (1985b) is the following. 


11.6. If M is linearly representable over the rationals then M is algebraic over every 
field. 


From this the following can be shown. 


11.7. For any field F there is no finite set of excluded minors for algebraic 
representability over F. 


Another interesting contrast with the situation vis a vis linear representability as 
given is that algebraic characteristic sets not containing 0 are not forced to be 
finite. 


Example (Lindstrém 1986). The non-Pappus matroid is algebraic over GF(p”) 
for every prime p but it is not algebraic over any field of characteristic zero. 


A recent review paper by Lindstrém (1988) gives a very good account of the 
many new results obtained about possible algebraic characteristic sets and 
algebraic matroids. Whether or not any fixed finite set of primes can be algebraic 
characteristic set of a matroid seems to be a difficult problem. 

We close this brief account by restating a problem first posed in Welsh (1971) 
and on which there seems to have been only limited progress. 


Problem. [f M is algebraic is its dual M* also algebraic? 


A more specific form of this to which it may be easier to find a counterexample 
is as follows. 


Problem. If M is algebraic over F is M* algebraic over F? 


One of the difficulties in this sort of problem is that there are not “large stocks” 
of non-algebraic matroids. ¢ngleton and Main (1975) were the first to exhibit a 
non-algebraic matroid by showing that the rank 4 matroid V, of section 10 was not 
algebraic and Lindstré6m (1985a) shows that the non-Desargues matroid consisting 
of the standard Desargues configuration with one line missing is not algebraic. He 
has also constructed an infinite family of such non-algebraic matroids. For morn 
on this see Oxley (1992). 
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12. Structural properties 


There are a rich variety of constructions which give a deeper understanding of the 
underlying structure of a matroid. At this level of exposition it is impossible to do 
more than uncover the tip of an iceberg by mentioning a few of the more 
important constructions and problems. 


Strong maps 


The concept of a linear mapping is so important in linear algebra that it is natural 
to try and develop an analogue in a matroid setting. This is achieved by defining a 
strong map between two geometric lattices ¥, and ¥, to be a function f: £,— ¥, 
satisfying 

(a) 2, BEL, > fla v b) = f@) v f(b), 

(b) if x is an atom of ¥,, f(x) is either an atom or the zero element of &,, 

(©) f(0) =0. 

Although this definition can be formulated in terms of matroids it is less 
complicated to describe it lattice-theoretically as above. Geometric lattices and 
strong maps form a category and it seems clear that this appears to be the most 
natural category of maps to use. 

Some elegant factorisation theorems for strong maps exist and a fairly rich 
theory has been developed, see for example Kung (1986). However, it should be 
emphasised that if £(M,) is the image of %(M,) under a strong map very few 
properties of M, are shared by M,. This is because any simple matroid M on a set 
S is the image of the free matroid under the canonical map @¢ defined by 


dx)=x xES, 


$(A)=o(Ux:x€ A) ACS, 


where @ is the closure function of M. 


Extensions and lifts 


A matroid N is an extension of M on S if N restricted to S is M. It is a single 
element extension if N has ground set S Ue, eS. 

Two very trivial ways of extending M are by adding either a loop or an element 
in parallel with an existing element of M. Almost, but not quite, as trivial is to 
add a coloop, that is an element so that 7(N) = r(M) + 1. 

We call an extension non-trivial if it is not of any of these three types, and 
henceforth only consider such extensions. 

A lift of M is a matroid N such that N* is an extension of M*. 

Geometrically we think of M as being obtained from a single element lift N by 
projection. 
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Quotients 


If M and Q are matroids which share a common groundset, Q is a quotient of M 
if every flat of Q is also a flat of M. It is clear that the identity map on the 
groundset induces a strong map between M and Q. Conversely any strong map 
between lattices whose atoms are in one to one correspondence induces a 
quotient between the corresponding matroids. When r(Q)=r(M)—1, Q is an 
elementary quotient of M. 

As might be expected, elementary quotients are intimately related to single 
element extensions. 


12.1. If M, Q have groundset S, Q is an elementary quotient of M iff there exists a 
single element extension N of M on say T=S Ue such that N-S=Q. In other 
words we have 


N onSUe 
delete “/ \ so e 
MonS Qons 


More generally, we have the following. 


12.2. If r(M)=r(Q) +k and E is disjoint from S with |E| =k then Q is a quotient 
of M if and only if there exists an extension N of M by E such that E is independent 
in N and 


N|IS=M, N-S=Q. 


The Dilworth truncation of a matroid 


A more interesting construction is what is known as the Dilworth truncation 
D,(M) of M. This is the matroid on the set ¥* of k-flats of M and which has 
independent sets defined to be 


g={r(U F)=\sl+k-1wert, 
ies 


In other words, D,(M) is the matroid induced on the set ¥* by the submodular 
function 1: ¥*—>Z* defined by 
a 
yi)=r(UF: iet)~k+ i: 


Somewhat surprisingly the Dilworth truncation of a matroid inherits many of 
the properties of its underlying matroid. For example Mason (1977) shows the 
following. 
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12.3. M is representable over a field of characteristic p if and only if D,(M) is 
representable over the same characteristic. 


12.4. If M is transversal then so is D,(M). 


For a more detailed and comprehensive account of these and other construc- 
tions we refer to the article of Kung (1986). We close with a brief discussion of 
one of the most intriguing structural problems in matroid theory. 

The Whitney number W, of a matroid M is defined to be the number of k-flats 
of M. Possibly inspired by the intuition that geometric lattices are “eggshaped” 
when regarded as posets, a long standing conjecture about matroids is that for any 
matroid the sequence (W,: 1k <r(M)) is unimodal. This is certainly true for 
the cases where the numbers are known explicitly, projective and affine geomet- 
ries, boolean algebras and the like. Stronger forms of the unimodal conjecture are 
that the sequence W, is log concave, that is 


W2>W,_4.Wi4,, 2<k<r-1 


or even that the ratio W7,/W,_,W,,, is a minimum when the underlying matroid 
is free. For more on this for cases where the W, are known explicitly see chapter 
31. 

Not much progress has been made on any of these problems over the last 15 
years apart from a very beautiful paper by Dowling and Wilson (1975) who gave a 
very elegant proof of the result that for any simple M 


W,+W,+---+W.<W_ t+ +W_,+W_,, (12.5) 


where r is the rank of M and 1l<k<r-1. 


13. Colourings, flows and the critical problem 


The chromatic polynomial of a graph is a very familiar concept in graph theory. 
Indeed, it has become so familiar that one tends to overlook the significance of 
the fact that, if we calculate the chromatic polynomial of a given graph by the 
well-known technique of successively contracting and deleting edges, then the 
resulting polynomial is independent of the order in which we consider the edges. 

This basic observation is the key to the Tutte~Grothendieck theory developed 
by Brylawski (1972) and having applications in fields as diverse as percolation 
theory and codes. : 

Brylawski’s idea was to define a “Tutte-Grothendieck” invariant for matroids 
as follows. A matroid invariant is a function f from the set of matroids to a 
commutative ring, such that, if M is isomorphic to N, then f(M)=f(N). It is a 
Tutte-Grothendieck invariant if, in addition, it satisfies the following two 
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conditions: for any e which is not a loop or coloop, then 


(i) f(M)=f(M\e) + f(m/e) 

(ii) f(M, ®M,) = f(M, )f@,) - 
It is easy to check that examples of Tutte-Grothendieck invariants are the 
numbers of bases of M, the number of independent sets of M, and the number of 


spanning sets of M. 
Brylawski’s main result is the following. 


Theorem 13.1. Any Tutte-Grothendieck invariant is uniquely determined by its 
values on the two single-element matroids consisting of a loop or coloop. More 
precisely, there exists a polynomial T(M; x, y) in two variables x, y such that, if f is 
a Tutte-Grothendieck invariant, and if f{(M,) =x and f(Mj)=y where M, is the 
single-element matroid of rank 1, then f(M) = T(M; x, y). 


The polynomial 7(M; x, y) is known as the Tutte polynomial of the matroid M. 
It is most easily defined by its relation to the (Whitney) rank generating function 
R(M; x, y) which is given by 


T(M;x, y)=R(M;x—1,y—1), (13.2) 
where R is defined by 
R(M3x, y) = », grey let (13.3) 


Example. The reader is invited to verify that if M is the cycle matroid of the 
graph G of fig. 13.1 then by successively deleting and contracting the elements, 
the Tutte polynomial of M is given by 


T(M;x, y)=x°t+2x?+2xyty+xty. 


The remarkable thing is that, no matter in what order we carry out our 
contractions and deletions, we always get the same polynomial. Furthermore, any 
function which can be calculated by this contraction—deletion method must be an 
evaluation of this polynomial for suitable x and y. For example, b(M), the 
number of bases of M, is given by b(M) = T(M; 1, 1), since the value of b on a 
coloop and a loop is in both cases 1. Similarly, i(M), the number of independent 
sets of M, is given by i(M) = T(M; 2, 1), since a coloop has two independent sets 
whereas a loop has only one. 

The prototype Tutte—Grothendieck invariant is the chromatic polynomial of a 

a 


Figure 13.1. 
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graph, see chapter 1. If we define the chromatic or characteristic polynomial of a 
matroid M by 


P(M; A) = (-1)T(M, 1-4, 0), (13.4) 


where M is the cycle matroid of a graph G, the chromatic polynomial P(G; A) of 
the graph G is given by the equation 


A“ P(M(G); A) = P(G; A), (13.5) 


where k(G) is the number of connected components of G. Incidentally, this shows 
that two graphs with isomorphic cycle matroids must have the same chromatic 
number. 

Let G be a connected graph having no isthmuses and suppose that the edges of 
G are oriented, giving a directed graph G. For any finite abelian group H an 
H-flow on G is a map ¢ : E(G)—>H such that Kirchhoff’s conservation laws are 
obeyed at each vertex of G. Clearly whether or not ¢ is an H-flow depends on the 
orientation given to G. A nowhere-zero H-flow is one which assigns a non-zero 
element of H to each edge of G. 

A crucial observation is that for any abelian group H, if F(G; H) denotes the 
number of nowhere-zero H-flows on G then 


F(G; H) = F(G/le; H) — F(G\e; H) 


for any e€G which is not a loop or coloop. Thus F is essentially a Tutte— 
Grothendieck invariant of M(G) and we have the somewhat surprising conse- 
quence. 


Proposition 13.6. The number of nowhere-zero H-flows on G is given by 
T(M(G);0,1—A) where A is the order of the abelian group H. 


Thus we may sensibly speak of G possessing a nowhere-zero n-flow to mean 
that G has a nowhere-zero H-flow for some (and hence every) abelian group H of 
order n. 


Two longstanding conjectures of Tutte (1966) are the following. 
Conjecture 13.7 (Tutte’s 5-flow conjecture). Every bridgeless graph has a 5-flow. 


Conjecture 13.8 (Tutte’s 4-flow conjecture). A cubic bridgeless graph has a 4-flow 
if it has no subgraph contractible to the Petersen graph Py. 


Both of these conjectures have attracted a lot of attention over the years. The 
4-flow conjecture is intimately related to edge colourings, while Seymour (1981a) 
has improved an earlier result of Jaeger (1976) by showing that every bridgeless 
His possesses a 6-flow. For more on these problems we refer to chapters 1 and 


For a general matroid M which is representable over a finite field of order q the 
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chromatic polynomial has an intriguing interpretation in terms of a fundamental 
blocking problem of projective geometry. This goes back at least as far as Veblen 
1912). 

Cates the projective space PG(r — 1, q). For any positive integer t, a t-block 
in PG(r — 1, q) is a set X of points such that XM F #6 for each flat F of rank 
r—t, In particular, the 1-blocks are the sets which have non-empty intersection 
with every hyperplane of PG(r — 1, q). We call X a minimal t-block if X is a 
t-block but X\p is not a ¢-block for each p€ X. 

Standard vector space arguments can be used to show that the projective space 
PG(t, g) as a subspace of PG(7 — 1, g) is a minimal #-block for each integer t > 2 
and each prime power q. The following remarkable theorem gives the relationship 
between blocking sets and the chromatic polynomial of the underlying matroid. 


Theorem 13.9. Let M be a matroid of rank r on S which is embedded in 
PG(r-1,q). Then S is a t-block if and only if P(M;q')=90. More generally, 
P(M; q‘) enumerates the collection of hyperplanes whose intersection is an (r — t)- 
flat avoiding S. 


For a class of matroids which are representable over GF(q) Crapo and Rota 
(1970) define c(M; qg) to be the minimum f¢ such that P(M; g') > 0. Determining 
c(M; q) is called the critical problem of combinatorial geometry. 

Note also that Theorem 13.9 is a justification for the slight abuse of language in 
the statement ‘“‘M is a t-block”. What we mean is that any of the various vector 
representations of M in PG(r — 1, qg) is a ¢-block; in other words, it is not their 
coordinatisation which is important, but their geometrical structure. 

As an example, consider the Fano matroid F,. Since 


P(F;; A) = (A 1)(A—2)(A— 4), 
F, is a 2-block over GF(2) and a 1-block over GF(4). More generally we have the 


following result. 


Theorem 13.10. For any prime p, a t-block over GF(p) is a 1-block over GF(p') 
for each positive integer t. 


The converse of Theorem 13.10 is not always true since there exist 1-blocks over 
GF(p') which, are not representable over GF(p). However, if a 1-block over 
GF(p’) is representable over GF(p), then it is a t-block over GF(p). Similarly, 
since the uniform matroid U? has chromatic polynomial A” — nA + n — 1, we know 
that the (g¢ + 1)-point line is a }-block over the field GF(q) for each prime power 
q 


For complete graphs K,, we have 
P(M(K,); A) =(A-1)(A-2)---(A-n+1). 


It follows that for any prime power qg =p’, the cycle matroid of K,,, is a 
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1-block over GF(q), and is thus also a t-block over GF(p). Now the cycle matroid 
M(G) of any graph G is representable over every field, and so M(G) is a t-block 
over GF(q) if and only if G has chromatic number x(G) > q‘. Thus the graphic 
1-blocks over GF(2) are just the cycle matroids of non-bipartite graphs. 

When we come to 2-blocks over GF(2), however, the situation is much more 
complicated. Since K, is a minimal graph which is not 4-colourable we deduce 
that M(K,) is a minimal 2-block. However, there are many other minimal 
2-blocks, since if G is any edge-critical graph for which x(G) = 5, but y(G\e) =4 
for each edge e, then M(G) is a minimal 2-block. 

If we let P,, denote the Petersen graph then Tutte (1966) noticed that each of 
the matroids M(K,), F, and M*(P,,) has the additional property that no loop-free 
minor is also a 2-block. A 2-block with this property is called a tangential 2-block. 
In 1966, Tutte proved that these three matroids are the only tangential 2-blocks 
with rank at most 6, and Datta (1976) used a complicated geometrical argument 
to show that there is no tangential 2-block of rank 7. 

Tutte’s tangential 2-block conjecture, originally made in 1966 and still unset- 
tled, can be stated in the following form. 


Conjecture 13.11 (Tutte’s tangential block conjecture). The only tangential 2- 
blocks are M(K,), F, and M*(P,,). 


Seymour’s theory of splitters discussed in chapter 10 is a major step towards 
proving this conjecture. It is related to Hadwiger’s conjecture which, in its full 
form, reads as follows. 


Conjecture 13.12 (Hadwiger). If a loopless graph G is not -colourable it contains 
a subgraph contractible to K,,,. 


Dirac (1957) showed that Hadwiger’s conjecture is true for n =3, and Wagner 
(1964) showed that for 1 = 4 it was equivalent to the four-colour. conjecture, and 
thus it holds for 7 = 4. Thus we know that there can be no new tangential 2-block 
which is the cycle matroid of a graph. Seymour (1980) used his characterisation of 
regular matroids to prove the following striking result. 


Theorem 13.13. Any new tangential 2-block must be the cocycle matroid of a 
graph. 


Seymour’s result shows that Tutte’s tangential 2-block conjecture is exactly 
equivalent to the 4-flow conjecture 13.8. Thus he has reduced this seemingly 
intractable geometrical problem to the conceptually much simpler problem of 
characterising those graphs which have no 4-flow. More precisely, there exists a 
tangential 2-block other than M(K,}, F, and M*(P,,) if and only if there is a 
bridgeless graph G which does not contain a subgraph contractible to the Petersen 
graph and which has no 4-flow. 

There are many other applications of these polynomials to diverse fields 


516 D.J.A. Welsh 


including coding theory (Greene 1976), arrangements of hyperplanes in space 
(Zaslavsky 1975), percolation and reliability (Oxley and Welsh 1979) and more 
recently to knot polynomials (Jaeger et al. 1990). For a much fuller discussion we 
refer to the review article of Brylawski and Oxley (1992). 


14. Varieties and universal models 


Apart from Seymour’s theory of splitters discussed in chapter 10 one of the two 
major developments in the subject over the last decade stems from the paper of 
Kahn and Kung (1982) and can be summarised as follows. 

A hereditary class of geometries (simple matroids) is a collection of geometries 
which is closed under taking minors and direct sums. Thus they are the analogues 
of varieties in universal algebra. A sequence of universal models for a hereditary 
class # is a sequence (T,,) of geometries in # with r(T,) =n and satisfying the 
following property. 


If G is a geometry in # of rank n then G is a subgeometry (restriction 
minor) of T,, . (14.1) 


A variety of geometries is a hereditary class with a sequence of universal 
models. The achievement of Kahn and Kung was to give a complete characterisa- 
tion of the possible varieties. 

First some examples. 


Example 14.2. The collection of geometries coordinatisable over a fixed finite field 
is a variety. The universal models are the projective geometries PG(n, q). 


Example 14.3. The collection of graphic matroids is a variety, the universal models 
are the cycle matroids of the complete graphs. 


Example 14.4. The trivial variety consisting of the free geometries, has universal 
models the Boolean algebras. 


Two other simple varieties are the following: 

(i) Let M, be the rank one geometry and M, be the line with g + 1 points. 
Let M,,(q)=M,®:--®M, and M,,,,(q)=M,®-:-BM,@M,. The 
subgeometries of these geometries form a variety called the variety of matchstick 
geometries of order q. = 

(ii) If B, is the Boolean algebra on {1,...,#} and on each of the lines 12, 
23,...,(a@—1)n we add q- A points in general position let 0,(q) denote the 
resulting geometry. Then subgeometries of these geometries form a variety called 
the variety of origami geometries of order q. 

(iii) A partial partition of N = {1,.-..,n} is a collection of non-empty, disjoint 
subsets of N. Ordered by refinement the partial partitions of N form a geometric 
lattice which is isomorphic to the lattice of flats of the cycle matroid M(K,,) of the 
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complete graph. Now take finite group A and define a partial A-partition of N to 
be a partial partition 7 of N together with a labelling of the elements of each 
block of a with elements from A. The partial partitions can be equated in a 
“projective manner’’ and if the resulting equivalence classes are ordered by 
refinement we again get a geometric lattice of rank n and which is denoted by 
Q,,(A). These lattices were discovered by Dowling (1973), they give a large stock 
of non-representable matroids since he showed that Q,(A) is representable if and 
only if A is a subgroup of the group of units of a finite field. Moreover, 
Q,(A)=@,(A’) iff A and A’ are isomorphic. These lattices known as Dowling 
lattices were demystified to some extent in a series of papers by T. Zaslavsky on 
voltage-graphic matroids (see for example Whittle 1989). What is surprising about 
Dowling geometries is their prominence in Kahn and Kung’s major classification 
theorem. 


Theorem 14,5. Let # be a variety of geometries. Then 2 is one of the following 
collections: 

(1) the variety of free geometries; 

(2) the variety of matchstick geometries of order q; 

(3) the variety of origami geometries of order q; 

(4) the variety of geometries coordinatisable over GF(q); 

(5) the variety of subgeometries of the Dowling geometry Q,(A) for some fixed 
finite group A. 


One consequence of Theorem 14.5 is the following statement which seems hard 
to prove directly. 


Corollary 14.6. The only hereditary classes which are also closed under the taking 
of duals are the varieties of geometries coordinatisable over a fixed finite field. 


Theorem 14.5 has led to renewed interest in the voltage graphic geometries of 
Zaslavsky and their universal model, the Dowling geometries. As an example of 
another extremely beautiful and surprising theory prompted by the paper of Kahn 
and Kung, Whittle (1989) has completely extended the theory of the critical 
problem for projective spaces to the Dowling geometries Q,,(A) for any finite 
group A. 


15. Oriented matroids 


A feature of graphs which is difficult to capture in a matroid is the notion of 
direction. Minty (1966) seems to have been the first to attempt to extend the 
notion of orientability from graphs to matroids but it turned out that his notion of 
orientability was exactly the concept of regularity discussed in chapter 10. A more 
general and interesting concept of orientability which extended that of Minty was 
proposed by Rockafellar (1969). Rockafellar’s ideas were motivated more by 
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considerations from convexity and linear programming and have certainly had a 
strong influence on the theory of oriented matroids as it exists today. 

In what follows I will attempt to give some idea of the different approaches 
which led to the independent discovery of oriented matroids in the middle 
seventies by Las Vergnas (1975), Bland (1977), Folkman and Lawrence (1978). 

We start first with the graphically motivated approach proposed in Bland and 
Las Vergnas (1978). 

Given a matroid M on § an orientation of M is the partitioning of each circuit C 
into a positive part C” and a negative part C, and we say x is positive (negative) 
in C if x EC” (respectively C_). The sign function is defined for any circuit C 
and x EC by 


. _f+ xEC*, 
wesc |s TEE 


Given three circuits C,, C,, C, with C,C C, UC, we say C, satisfies the rule of 
signs with respect to C, and C, if the following conditions hold: 


xEC,N(C,\C,) > sign,,(x) = sign, (X), (a) 
xEC,U(C,AC,) > sign,,(x) = sign,,(x), (b) 
xEC,NC,NC, and sign, (x) =sign,,(x) 

> sign,,(x) = sign, (x) = sign, (x). (c) 


Now it is easy to see that given any graphic matroid all the rule of signs means 
is that when the graph’s edges are directed the resulting orientation satisfies the 
rule of signs. 

However, more than this is needed. An orientation of a matroid M is 
compatible with its structure if for any pair of circuits C, D, and x E CN D such 
that 


sign,.(x) = —sign,(x) 


and y € D\C then there exists a circuit E such that E obeys the rule of signs with 
respect to C and D and yEEC(CUD)\x. 

Finally we say M is orientable if there exists some orientation which is 
compatible with its structure. 

As should be expected from the origin of the definition we have the following 
basic result. 


a 
Proposition 15.1. Every graphic matroid is orientable. 
Proof. Use any orientation of the associated graph. O 


Less obvious is the following. 
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Proposition 15.2. A binary matroid is orientable if and only if it is regular. 
Proposition 15.3. If M is orientable so is its dual M* and so is any minor of M. 


These results 15.2 and 15.3 become more apparent if one takes the alternative 
approach to orientability proposed by Folkman and Lawrence (1978). This is 
motivated more by ideas from convexity. Their definition of an oriented matroid 
(which turns out to be cryptomorphic with that of Bland and Las Vergnas treated 
above) is as follows. 

An involution on a set E is a function *: EE such that (x*)* =x for each 
element x € E; it is fixed point free if x* #x for each x EE. 

An oriented matroid is a triple (E, @, *) where E is a finite set, @ is a collection 
of non-empty subsets of E and * is a fixed point free involution on E such that 

(a) @ is a clutter (that is no member properly contains another); 

(b) If AES then A*E€ and AN A* =9; 

(c) ff A, BE @ and x©& AN B* and A#B* then there exists CE € with 


CC(AUB)\{x, x*}. 


The members of € are called the circuits of the oriented matroid. 


Example. Suppose that E is a finite set of non-zero vectors in a vector space over 
an ordered field and E = —E. 

For x E let x* = —x. 

Let € be the minimal subsets C of E such that 

(i) CNC*t=9, 

(ii) there exist positive real numbers a, (v © C) such that 


> a,v=0. 


vec 


Then (E, €, *) is an oriented matroid. In other words C € € if and only if it is the 
vertex set of a simplex containing the origin in its interior and is not of the form 


{u, —u}. 


The connection between the two definitions of oriented matroid which we have 
given is seen from the following. 

Consider an oriented matroid defined by a triple (E, @, *). 

For each element x € FE and subset AC E define 


X= {x,x*}, A={x:x€A). 
Let € ={C: CE}. Then we have the following. 


Proposition 15.4. if (E,@,*) is an oriented matroid, then the collection € of 
subsets of E is the set of circuits of a matroid on E. 
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Conversely given a matroid M on S which is oriented it is an easy exercise to 
reverse the above process and construct a triple (E, €, *) satisfying the axioms 
@)-©. be 

Let E=S US’ where S’ is a disjoint copy of S. If C=C” UC’ is an oriented 
circuit of M the corresponding circuit of @ is in the obvious notation C* U(C7)’. 

A theory of duality and minors, closely analogous to the unoriented case exists 
for oriented matroids. However, as an indication of the diverse nature of 
orientability consider the following examples. 


Proposition 15.5. Any matroid which is linearly representable over an ordered field 
is orientable. 


Example 15.6. The matroid V, described in section 10 is a matroid which is not 
coordinatisable over any field but it is orientable. 


Proposition 15.7. There exists an infinite collection of non-orientabie matroids, 
none of which is a minor of the other. Hence there can be no finite excluded minor 
criterion for orientability. 


Because the structure of oriented matroids retains many properties of vector 
spaces over ordered fields they are of considerable interest in the study of abstract 
ideas of linear programming and convex polyhedra, for further details we refer to 
the seminal papers of Bland and Las Vergnas (1978), Folkman and Lawrence 
(1978), Las Vergnas (1980), and especially to the recent monograph of Bjorner et 
al. (1993). 


16. Extensions of matroids 


Over the last twenty years the ideas and techniques of matroid theory have been 
generalised in several directions. In this final section we can do no more than give 
very brief pointers to these extensions. 

Historically one of the earliest and certainly one of the most important is the 
theory of polymatroids introduced by Edmonds (1970). Polymatroids are convex 
polyhedra in R" associated in a very natural way with submodular set functions. 
They are treated in greater detail in chapter 11. 

An alternative generalisation of matroids introduced by Dunstan et al. (1972) 
are supermatroids which are defined as follows. 

Let P be a finite partial order with a zero and let # be a collection of elements 
of P satisfying a 

(S1) 0E ¥, 

(S2) if x ¥ and y <x in P then ye ¥, 

(S3) for any element a € P if P(a) = {x: x <a} then all maximal elements of 
P(a)M # have the same height in P. 

The condition (S3) means that rank is well defined and when P is a distributive 
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lattice it turns out to be a submodular function on the lattice. This leads to a 
generalisation of many combinatorial aspects of matroids, for example duality, a 
Rado~Hall theorem and the like. 

Supermatroids are closely connected with F-geometries introduced by Faigle 
(1980) and to the notion of a greedoid introduced by Korte and Lovdsz (1981, 
1983) and discussed below. For a discussion of their interrelationship and details 
of the first matroid intersection theorem for these classes we refer to Tardos 
(1990). 

A greedoid (S, G) is a set system such that 

(G1) EY, 

(G2) AEG, AA®DAxEA such that A\{x} EG, 

(G3) X, YE@, |X|>|Y|> 4x EX\Y such that YU {x} EG. 

This concept was introduced by Korte and Lovdsz (1981) as a characterisation 
of structures for which a greedy type algorithm “works” for a wide class of 
objective functions. As with matroids there are many different axiomatisations, 
the above is the most natural in the present setting but unlike matroids, greedoids 
have alternative formulations in terms of strings and sequences. As a class they 
strictly include matroids and can be characterised (in the obvious way) as 
set-systems having a rank function r satisfying 


r(@)=0, AA)S|Al, XCY > (X)<7(Y), 
together with the following substantially weakened form of submodularity 
r(X) = (XU {a}) = (XU {b}) D> (XU {a} U (b})=r(X). (16.1) 


A particularly important class of greedoid is that of interval greedoids which 
have the additional property 


If A, B, CES with AC BCC and xES is such that AU {x} and 
CU{x}€ G then BU {x} E. (IG) 


As an example of an interval greedoid which is not a matroid and which 
appears not to be in any of the earlier classes consider the following. 


Example (Directed branching greedoid). Let G be a directed graph with r € V(G) 
a distinguished vertex to be called the root. If G is the set of arborescences in G 
rooted at r then (E(G), Y) is a greedoid. 


This example highlights the difference between greedoids and matroids and at 
the same time gives the reader an understanding of the way greedoids might be 
defined in terms of ordered strings. For an account of the now quite well- 
developed theory of greedoids we refer to the monograph of Korte et al. (1991). 

Greedoids include matroids and what are known as antimatroids. These are 
interval greedoids (S, ) which also satisfy the condition that for all A, BE @ 
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with BCA and all x E S\A, 
BU{xsEG D> AUX EG. 


Antimatroids were introduced by Edelman and Jamison (1985) as abstract 
models of convexity. The classic example of an antimatroid is the following. 


Example. Let S be a finite subset of R” and let X = {x,,...,x,}€@ if x; does 
not belong to the convex hull of S\{x,,...,x,} for l1<i<k. Then (S, Y) is an 
antimatroid. 


The above example motivates us to call the complements of members of @ a 
convex set in the antimatroid (S, 4). Various combinatorial properties of convex 
sets can be studied in this more abstract framework. For example, define the 
Helly number of an antimatroid as the smallest integer h with the property (H): 


If # is a collection of convex sets such that every #' C # with |X’|<h 
has a non-empty intersection then the members of # have a non-empty 
intersection, (H) 


The following theorem characterises the Helly-number of an antimatroid. 


Theorem 16.2. The Helly number of an antimatroid (S, G) is the maximum size of 
a set X CS such that every subset of X is convex. 


For more details on this and related topics we refer to Edelman and Jamison 
(1985) or Korte et al. (1991). 

Lastly we remark that throughout this article we have assumed the underlying 
groundset of M is finite. There are many different plausible extensions of the 
finite theory to the infinite case, we refer the reader to Oxley (1978) for an 
indication of possible approaches. 


17. Conclusion 


Over the last twenty years or sa there has been a substantial increase in the 
applications and theory of matroids. In this brief account it has been impossible to 
do more than cover basic concepts and highlight some topics. Others will be 
treated in chapters 10 and 11, but a much fuller picture can be obtained from a 
wide range of texts devoted primarily to the subject, namely Crapo and Rota 
(1970), Tutte (1971), Welsh (1976), Bryant and Perfect (1980), Recski (1989), 
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Truemper (1992), the volumes edited by White (1986, 1987 and 1992) and 
especially the monograph of Oxley (1992) where a comprehensive treatment of 
the fundamentals of the subject can be found. 
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1. Introduction 


A flourishing branch of matroid theory concerns results of the type “for any 
matroid M, either M contains an object of type T, or M has the structure 3, and 
not both”. In this chapter, we survey some such results. 

Although what we mean here by “structure 2” can vary widely, the “object of 
type T” is generally a minor. We begin then with a discussion of minors. Let M be 
a matroid on a set S. For X CS, M\X denotes the matroid N on § — X, in which 
Y CS — X is independent if and only if it is independent in M. We define M/X to 
be (M*\X)* where M* denotes the dual of M. If e€ S, we abbreviate M\{e} to 
M\e, etc. Any matroid expressible in form M\X/Y for some X, YCS with 
XN Y=© is a minor of M. Since (M\X)/Y = (M/Y)\X, the following holds. 


Theorem 1.1. (i) Any minor of a minor of M is a minor of M. 
(ii) If N is a minor of M then N* is a minor of M*. 


The prime example of the kind of theorem we are concerned with is the 
following result of Tutte (1958). 


Theorem 1.2. A matroid is binary if and only if it has no U2? minor. 


(M is representable over a field F if M is isomorphic to the linear independence 
matroid of the columns of some matrix with entries in F, and M is binary if it is 
representable over GF(2). The matroid U* has n elements, any k of which form a 
base. In general, an N minor of M means a minor of M isomorphic to N.) 

We shall sketch a proof of Theorem 1.2 due essentially to Truemper. Let M be 
a matroid on S, and for each base B let A,(B) be the (0, 1)-matrix (@,,: 
e& B, f €S— B) with rows and columns indexed by B and S — B respectively, 
where for e € B and f € S ~ B, a,, = 1 if and only if e belongs to the unique circuit 
included in BU {f}. Now a knowledge of A,B) for one base B does not 
determine M uniquely; but if M,, M, are both matroids on S, and every base B of 
M, is also a base of M, and A 1B) = Ay,(B), then M, = M,. (For, if possible, 
choose bases B, B’ of M, such that B is a base of M,, and B' is not, with 
|B’- B|=1. Let Ay (B)=(a,: e€ B, fES—B), and let B'- B={f}, B- 
B’ = {e}. If a, =1 then B' is a base of M,, a contradiction; while if a,,=0 then 
B’ is not a ‘base of M,, since Ay (B)=A,y,(B), again a contradiction. ) 
Consequently we have the following lemma. 


Lemma 1.3. Let M,, M, be distinct matroids on S, and let B be a base of both 
matroids with Ay (B)=A,,(B). Then there. are bases B’, B” of both matroids 
such that |B” ~ B’ | = 1, Ay (B )=Ay,(B’), and Ay (B")# Ay,(B"); and M,, 
M, are not both binary. 


Proof. Choose a base B’ of both matroids with A,, (B’) = Ay,(B’), such that for 
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some base B” of M, with |B”— B’|=1, either B” is not a base of M, or 
Ay (B") 4 Ay,(B"). The first is impossible since A, (B’)=Ay(B'), and the 
second is what we want. M, and M, are not both binary because a binary matroid 
M is uniquely determined by a knowledge of A,,(B) for one base B (for a 
representation of M may be obtained by adding an identity matrix to 
A,(B)). O 


Proof of Theorem 1.2. Since U? is not binary and minors of binary matroids are 
binary, the “‘only if” part is clear. To prove the “if” part, let M@, be a non-binary 
matroid on S. For each base B of M,, let Ay, (B) be the matrix with rows indexed 
by B and columns indexed by B and columns indexed by S, consisting of A,, (B) 
augmented by a B x B identity matrix. Now choose some base B, and let M, be 
the binary matroid represented by Ay, (B) over GF(2). Since M, is not binary it 
follows that M,#M,, and Ay (B)=Ay (B). Choose B’, B” as in Lemma 1.3. 
Since M, is binary it is represented by both Ay,(B’) and Ay,(B"). Let B’— B= 
{e,}, BY— B' = {f,}, and let A,, (B"), Ay,(B") differ in the (e,, f,)-entry, where 
e,&B", fES—B". Let Z= {e,,f,,€2,f,}, let X=S—(B'UB"\)Z) and let 
Y = (B' UB’) — Z. Now B' NZ, B" NZ are both bases of M,\X/Y and M,\X/Y, 
and they are related as in Lemma 1.3, that is |(B’UZ)—(B'UZ)|=1, 
Ay x/y(B' UZ)= Ay, .xiy(B’ UZ), Ay, \xiy(B" UZ)# Ay, xiy(B" UZ), (for 
these matrices are submatrices of Ay (B’), Ay,(B’), Ay,(B"), Am,(B”)), and 
in particular M,\X/Y # M,\X/Y; and so one of M,\X/Y, M,\X/Y is not binary. 
Since M, is binary, it must be the former. We have shown than that M, has a 
non-binary minor with at most four elements, and now the result follows 
easily. O 


A characterization of the form of Theorem 1.2 can exist for binary matroids 
since any minor of a binary matroid is also binary. On the other hand, it is 
surprising that the characterization is so simple. For one might expect there to be 
many, even infinitely many, minor-minimal non-binary matroids, yet Theorem 1.2 
asserts that there is, up to isomorphism, only one. In general, it happens quite 
frequently that if we calculate the minor-minimal matroids not possessing a 
certain natural structure, we obtain a more elegant answer than we have any right 
to expect, a small, interesting set of matroids. This is the main motivation behind 
the theory of matroid minors; that the exclusion of simple sets of matroids often 
corresponds quite closely with the imposition of natural structures. Of course, it 
does not always work; for instance, there are infinitely many minor-minimal 
matroids not representable over the reals — see Ingleton (1971). 

As for the “‘structure 2”, this can vary widely, but generally what we have in 
mind is a way to describe the matroid with a description of length bounded by a 
polynomial in the number of elements. Most matroids have no such description, 
because there are too many matroids on a given set for them all to have “short” 
descriptions, and so finding such a description is usually very helpful. Our 
structures are all constructions of one kind or another — for instance, we could 
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take as the structure > the properties of being graphic, being representable over a 
given field, being a gammoid, being uniform, or being made by piecing together 
matroids with a structure 5’ in a prescribed way. 

Results of this sort can be divided naturally into two classes, because their 
motivation derives either from the object excluded or from the structure imposed. 
In other words, we might want to obtain a short description of the matroids 
without certain minors, or we might be interested in a certain structure and wish 
to find a characterization of the matroids with this structure by excluded minors. 
We begin with a survey of “structure-driven’’ results like Theorem 1.2. The 
following was first proved by R. Reid (unpublished), and then independently by 
Bixby (1979) and Seymour (1979a). For a short proof see Kahn and Seymour 
(1988). 


Theorem 1.4. A matroid is representable over GF(3) if and only if it has no U?, 
U3, F, or Fj minor. 


(F, is the matroid represented over GF(2) by the seven non-zero 3-tuples.) 

The excluded minors for representability over any other fields are still 
unknown, although results of Kahn (1988) give some hope for GF(4). The 
problem for larger fields is probably too difficult, however. Even G.C. Rota’s 
popular conjecture (Rota 1970) that there are only finitely many minor-minimal 
matroids not representable over each finite field is still open. The general problem 
“which matroids are representable over which fields?” is very complicated; see 
chapter 9 for further details. 

Next we look at matroids representable over all fields. A matrix of real 
numbers is totally unimodular if every square submatrix has determinant +1 or 0. 
(Such matrices, investigated by Hoffman and Kruskal (1956), are important in 
linear programming — see chapter 11.) A matroid is regular if it can be repre- 
sented over the reals by the columns of a totally unimodular matrix. As a 
corollary of Theorems 1.2 and 1.4 we have the following result of Tutte (1958), 
which was provided with an elegant proof by Gerards (1989). 


Theorem 1.5. For a matroid M, the following are equivalent: 
(i) M is regular, 
(ii) M can be represented over every field, 
(iii) M can be represented over GF(2) and GF(3), 
(iv) M has no Uj, F, or F} minor. 


Proof. The equivalence of (iii) and (iv) follows from Theorems 1.2 and 1.4. But 
(i) > (ii) (the same totally unimodular matrix represents M over any field) and 
(ii) > (iii) is trivial. Let us show (iii) > (i). Choose a representation /|.A of M 
over GF(3), and view A as a real matrix of 0s and +1s. By /| A we mean a matrix 
whose columns are partitioned into two sets, the first of which forms an identity 
submatrix. Any representation of a matroid can be converted to one in this 
so-called standard form by row operations.) Since M is binary it follows that | A 


$32 P.D. Seymour 


also represents M over GF(2) (setting —1= +1); and since there is a bijection 
between bases of M and non-singular square submatrices of A, it follows that for 
every square submatrix B of A, 


det(B) = 0 (mod 2) <> det(B) = 0 (mod 3) . 


Since it is easy to show (by induction on the size — pivot on some non-zero entry) 
that a minimal matrix of 0, +1s which is not totally unimodular has determinant 
+2, it follows that A is totally unimodular, as required. O 


Being regular is not exactly a structural condition, since it is not apparent how 
to construct regular matroids; we shall return to this later. 

There is a third major result in a similar vein, the following, due to Tutte 
(1959). 


Theorem 1.6. A matroid is graphic if and only if it has no U ra F,, F|, M*(Ks) or 
M*(K, 3) minor. 


(For a graph G, M(G) denotes its polygon matroid, on the set E(G) and with 
circuits the edge-sets of circuits of G. The dual matroid is denoted by M*(G). If 
M = M(G) for some graph G, M is said to be graphic. K, and K,, are the usual 
“Kuratowski” graphs.) 

Theorem 1.6 is connected with the following well-known theorem of Kuratow- 
ski (1930) and Wagner (1937). 


Theorem 1.7. A graph is planar if and only if no subgraph is contractable to K, or 
to K,. 


Proof. Theorem 1.6 implies that for a graph G, M*(G) is graphic if and only if 
M(G) has no M(K,) or M(K, ,) minor, that is, if and only if no subgraph of G is 
contractible to K; or to K; ,. But G is planar if and only if M*(G) is graphic; for 
if M*(G) = M(H), the “vertex stars” of H give us the face boundaries of an 
embedding of G in the plane. The result follows. O 


This completes the list of the main “structure-driven” excluded minor theorems 
in matroid theory. 


2. Connectivity a 


For most of the remainder of this chapter we study the oppositely motivated 
theorem. Characteristically, such results involved connectivity, and we begin by 
discussing that. 

A separation of a matroid M on a set S is a partition (A, B) of S. The order of 
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(A, B) is 
r(A) + r(B) —r(S) +1, 


where r(X) denotes the rank in M of X C S. Thus every separation has order >1. 
If k 2 1 is an integer, a k-separation of M is a separation (A, B) of order <k with 
|A|, |B| =k. We say M is k-connected if it has no k’-separation for 1<k'<k. (We 
usually abbreviate “‘2-connected”’ to “‘connected”’.) 

For example, we have the following. 


Theorem 2.1. For a connected graph G, 
(i) if |VG)|=2, M(G) is connected &G is 2-connected and loopless, 
(ii) if |V(G)| =3, M(G) is 3-connected @G is 3-connected and simple, 
(iii) for every separation (A,B) of M(G) of order <k one of A, B spans 
MG is k-connected. 


Connectivity may enter into our theorems in at least three ways, and this gives 
a convenient classification of the results. The simplest way is when the structure 
predicted is just a low-order separation. For example, from graph theory we have 
the following form of a theorem of Menger (1927) (see chapters 1, 2). 


Theorem 2.2. Let G be a graph, let X, Y CV(G), with |X| =|Y| =k. Then exactly 
one of the following holds: 

(i) there are k paths of G from X to Y, mutually vertex-disjoint, 

(ii) there is a separation (A,B) of G with XCV(A), YCV(B) and |V(A)N 
V(B)| <k. 


(A separation of a graph G is a pair (A, B) of subgraphs with V(A) UV(B) = 
V(G), E(A) U E(B) = E(G), and E(A) MN E(B) =2.) 

The following elementary result generalizes the k = 2 case of Theorem 2.2 to 
matroids. 


Theorem 2.3. Let x, y be distinct elements of a matroid M. Then either there is a 
circuit of M containing both x and y, or there is a 1-separation (A, B) withxE A 
and y © B, and not both. 


For given X, YC V(G) with |X| =|Y| = 2, we may add new edges x, y joining 
the elements of X and of Y respectively, and apply Theorem 2.3 to deduce 
Theorem 2.2 with k=2. A similar device due to Tutte (1965) allows us to 
reformulate Theorem 2.2 in matroid terms for general k, and thereby generalize 
Theorem 2.2 to matroids. Given X, Y as in Theorem 2.2, we add two trees to G, 
both with k — 1 edges, connecting together the vertices in X and in Y respectively. 
The paths of Theorem 2.2 (i) exist if and only if we-may contract a subgraph of G 
such that the vertex set of one tree is contracted onto the vertex set of the other, 
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the vertices of each tree remaining distinct from one another. Thus the following 
(Tutte 1965) generalizes Theorem (2.2). 


Theorem 2.4. Let M be a matroid ona set S, and let X, Y CS with |X|=|Y|=k- 
1 and XM Y=. Then exactly one of the following holds: 

(i) M has a minor N on X UY such that X and Y are both bases of N, 

(ii) there is a separation (A, B) of M of order <k with XC A and YCB. 


In spite of the fact that edges of a graph are ‘‘matroidal objects” while vertices 
are not, it is more difficult to find a matroid generalization of the version of 
Menger’s theorem for edge-disjoint paths. One partially successful way to do so is 
as follows. If M is a matroid on a set S and e€ 5S, let us say e is a free-flowing 
element of M if for all integers k =O and for all non-negative integer functions c 
on S — {e}, exactly one of the following holds: 

(i) there are k circuits of M, each containing e and at most c( f) containing any 
other element f 

(ii) there is a cocircuit D of M with e€ D and eer cf) <k. 

Then the edge-disjoint form of Menger’s theorem may be expressed in matroid 
language, as follows. 


Theorem 2.5. Every element of a graphic matroid is free-flowing. 
This was extended by Minty (1966) to the following. 
Theorem 2.6. Every element of a regular matroid is free-flowing. 


Unfortunately for our attempted generalization, not all elements of all elements 
of all matroids are free-flowing. The following was proved (with a long and 
cumbersome proof) by Seymour (19776). 


Theorem 2.7. An element e of a matroid M is free-flowing if and only if there is no 
Uj or F* minor of M using e. 


Another method of proving Theorem 2.7 was given by Truemper (1987), using 
a result of Tseng and Truemper (1986); the latter explicitly find all matroids not 
possessing these two minors. In the next section we shall sketch their proof, 
because it illustrates the use of a number of results that we wish to survey. 


3. Connectivity to ensure the apread of information 


Suppose that we wish to understand the structure of matroids which, for some 
special element e, do not have a certain minor containing e. If the minor we are 
excluding is connected, the absence of this minor imposes no restriction what- 
soever On components which do not contain e, and so we would only hope to get a 
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structural description of the component containing e. (A component of a matroid 
is a connected minor with maximal element set.) The same kind of thing happens 
with higher connectivity, as we shall see in this section. 

How then to prove Theorem 2.7? One half is easy; if e is free-flowing in M, 
then it is free-flowing in every minor N of M which contains e, and yet no element 
of U or F} is free-flowing. This proves the “only if” direction of Theorem 2.7. 

For the “‘if’’ direction, we wish to analyze the matroids which have an element 
not in any Uj or Fj minor. The first step is the following seminal result of Bixby 
(1974). 


Theorem 3.1. Let e be an element of a matroid M. There is a U’, minor of M using 
e if and only if the component of M containing e is non-binary. 


In other words, we have the following. 


Theorem 3.2. Let e be an element of a connected matroid M. If M has a U2 minor 
then it has such a minor using e. 


To prove Theorem 3.2, we first verify it for matroids with 5 elements. Then we 
let N be a U2? minor of M (which we may assume does not use e) and the result 
follows from the following easy lemma of Seymour (1977a). 


Lemma 3.3. Let N be a connected minor of a connected matroid M, and let e be an 
element of M which is not an element of N. There is a connected minor N' of M 
using e such that N= N'\e or N'/e. 


Oxley (1984) showed that the curious property of U? in Theorem 3.2 is not 
possessed by any other matroid except tiny ones; nevertheless, Theorem 3.2 does 
have certain generalizations. For instance, one can again use Lemma 3.3 to show 
the following 


Theorem 3.4. Let e be an element of a connected matroid M. If M has a U3, U3, 
F, or F} minor, then it has such a minor (i.e., one of the four) using e. 


Another relative of Theorem 3.2, due to Seymour (1985), is the following. 


Theorem 3.5. Let e, f be distinct elements of a 3-connected matroid M. If M has a 
Ui minor then it has such a minor using both e and f. 


There is a parallel relative of Lemma 3.3 which can be used to prove Theorem 
3.5, due to Bixby and Coullard (1987). 


Theorem 3.6. Let N be a 3-connected minor of a 3-connected matroid M, and let e 
be an element of M which is not an element of N. There is a 3-connected minor N' 
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of M which uses e, such that N is a minor of N' and N’' has at most four more 
elements than N. 


A similar result of Truemper (1984) is the following. 


Theorem 3.7. Let N be a 3-connected minor of a 3-connected matroid M#N. 
There is a 3-connected minor N' of M such that N' # N, N is a minor of N' and N' 
has at most three more elements than N. 


Let us return to our proof of Theorem 2.7. We are given an element e of a 
matroid M, not in any U2 or F} minor, and we wish to prove that e is 
free-flowing. By induction we may assume that M is connected, and then 
Theorem 3.2 implies that M is binary. Again by induction (although the details 
are a little awkward — see Truemper 1987) we may assume that M is 3-connected 
and there is no 2-separation (A, B) of M/e with |A|, |B| 23. But then we may 
apply the following (difficult) theorem of Tseng and Truemper (1986) (see also 
Bixby and Rajan 1989). 


Theorem 3.8. Let e¢ be an element of a 3-connected binary matroid M, such that 
there is no 2-separation (A, B) of M/e with |A|, |B| 23. Then one of the following 
holds: ; 
(i) there is a F} minor of M using e, 
(ii) M is regular, 
(iii) M= F,. 


By means of Theorem 3.8 we may assume that our matroid is regular; but by 
Theorem 2.6, every element of a regular matroid is free-flowing. This completes 
the proof of Theorem 2.7. 

Theorems 2.4 and 2.7 arose then as matroid generalizations of versions of 
Menger’s theorem (or the max-flow min-cut theorem). There is another theorem 
about disjoint paths in graphs which is capable of matroid generalization 
(Robertson and Seymour 1990, Seymour 1980b, Shiloach 1980, Thomassen 1980, 
and see also Jung 1970). 


Theorem 3.9. Lets,,5,, t,, t, be distinct vertices of a graph G such that there is no 
separation (A, B) of G with s,, 55, t,, ty €V(A), |E(B)| 24 and |V(A)NV(B)|< 
3. Then exactly one of the following holds: 

(i) there are paths P,, P, of G such that P, has ends s,, t; (i=1,2) and 
VP) NV(P,) =D, 

(ii) G is planar, and can be @rawn in the plane with s,, 82, t,, t, incident with the 
infinite region, occurring on its boundary in that order. 


The condition about there being no separation (A,B) in Theorem 3.9 is 
important, for if there were such a separation (A,B), we could obtain no 
information about what is inside B from assuming the falsity of (i). For even if B 
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is highly non-planar, the paths in (i) cannot utilize this non-planarity because they 
cannot both get into B and out again. This is another instance of high connectivity 
being needed to “‘ensure the spread of information”’. 

To obtain a formulation of Theorem 3.9 in matroid language, we use an idea of 
Chakravarti and Robertson (unpublished), as follows. Given G as in Theorem 
3.9, we form a new graph G* by adding three edges e, f, g to G, joining $18, Sot; 
and f,t, respectively. Then paths as in Theorem 3.9 (i) exist if and only if there is 
a circuit of M*(G*) containing e, f and g. We would therefore have a matroid 
generalization of Theorem 3.9 if we could answer in general when three given 
elements of a matroid are in a circuit. This is still open, but for binary matroids is 
solved by the following, due to Seymour (1986), for it is easy to reduce the 
general binary problem to the case when M is 3-connected and internally 
4-connected. (A matroid M is internally 4-connected if there is no 3-separation 
(A, B) with |A], |B] > 4.) 


Theorem 3.10. Let e, f, g be distinct elements of a 3-connected, internally 
4-connected binary matroid M. Then one of the following holds: 
(i) there is a circuit D of M with e, f, gED, 
(ii) M = M(G) for some graph G, and e, f, g are incident in G with a common 
vertex, 
(iii) {e, f, g} is a cocircuit of M. 


4. Connectivity to eliminate hybrid structure 


In the previous section, we studied the second way in which connectivity may 
enter into our results; there we were looking for minors with some elements 
prescribed, and as we might expect, the absence of such minors gives us 
information about the structure of the whole matroid provided that no part of the 
matroid is “insulated” from our prescribed elements by a low-order separation. In 
this section, we study the third way that connectivity may enter, which is rather 
more unexpected. In some situations when we exclude certain minors, there is 
more than one structure possible; a sufficiently connected matroid without our 
minors will have either this structure or that, while a matroid of low connectivity 
may have some hybrid structure. We begin with an easy example from graph 
theory. 

Let us say a graph H is a minor of a graph G if H can be obtained from a 
subgraph of G by contracting edges. (If H is a minor of G then M(H) is a minor 
of M(G), but the converse need not hold. In our applications, however, the 
converse will always hold, since H and G will be sufficiently connected, and so we 
hope no confusion will arise.) A graph is outerplanar if it can be drawn in the 
plane with all its vertices incident with the infinite region. Chartrand and Harary 
(1967) showed the following. 


Theorem 4.1. G has no K, or K,, minor if and only if G is outerplanar. 
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(This may be derived from Theorem 1.7 by adding a vertex adjacent to all 
others.) This result, and Theorems 4.2—4.6, are discussed more fully in chapter 5. 

If we just exclude K, we get quite a different structure. A graph is series— 
parallel if it can be constructed by starting with a forest, adding loops, and 
repeatedly replacing edges by pairs of edges, either “in series” or “in parallel”. 
Dirac (1952) and Duffin (1965) showed the following. 


Theorem 4.2. G has no K, minor if and only if G is series—parallel. 


On the other hand, if we just exclude K, , we get a structure not much different 
from outerplanarity. 


Theorem 4.3. G has no K,, minor if and only if every block of G is either 
outerplanar or has <4 vertices. 


(A block of G is a maximal 2-connected subgraph.) In other words, we have the 
following. 


Theorem 4.4. If G is 2-connected, then G has no K,,, minor if and only if either G 
is outerplanar or |V(G)| <4. 


Thus, if we exclude K, ,, we get two possible structures for 2-connected graphs, 
one non-trivial and one trivial. The same phenomenon happens quite frequently, 
although in most interesting cases we need to assume 3-connectedness instead of 
2-connectedness. For instance, Hall (1943) showed the following. 


Theorem 4.5. [f G is 3-connected, then G has no K,, minor if and only if either G 
is planar or |V(G)| <5. 


Similarly, Wagner (1937) showed the following. 


Theorem 4.6. if G is a simple 3-connected graph with no K, minor, then either G 
has no V, minor or G=V,. 


(V, is constructed from a circuit of length 8 by adding 4 edges joining the 
opposite pairs of vertices, and = denotes isomorphism.) 


Theorem 4.7. If M is a 3-connected binary matroid, then M has no F> minor if 
and only if either M is regular or M= F,. 


Theorem 4.8. If M is a 3-caynected regular matroid, then either M has no Ryo 
minor or M = Ryo. 


(Rig is the matroid represented over GF(2) by the ten S-tuples with three 1s 
and two Os.) 
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These four results are all corollaries of the following, due to Seymour (1980a). 


Theorem 4.9. Let ¥ be a class of matroids closed under isomorphism and under 
taking minors. Let N © ¥ be 3-connected with at least four elements, and such that 
there is no 3-connected M€&¥ with one more element than N and with N as a 
minor. Suppose also that if N is a wheel, ¥ contains no larger wheel, and if N is a 
whirl, ¥ contains no larger whirl. Then for every 3-connected M & §F, either M has 
no N minor or M=N. 


(A wheel is a polygon matroid of the graph obtained from a circuit by adding 
one vertex joined (by one edge) to all others. A whirl is obtained from a wheel by 
declaring the “rim’”’ circuit independent, while leaving the dependence of all other 
sets unchanged.) 

For instance, to derive Theorem 4.7 we let ¥ be the class of binary matroids 
with no F} minor, and we let N = F,. We verify that no 3-connected 8-element 
member of ¥ has N as a minor, and then Theorem 4.7 follows from Theorem 4.9, 
The other three results follow in the same way. 

There is an attractive formulation of Theorem 4.9 which was independently 
discovered (for graphs) by Negami (1982), as follows. 


Theorem 4.10. Let N be a 3-connected minor of a 3-connected matroid M with at 
least four elements. Suppose that if N is a wheel, no larger wheel is a minor of M, 
and similarly for whirls. Then either M = N or there is a 3-connected minor N' of 
M with an element e such that one of N’\e, N'/le is isomorphic to N. 


(It is important that we only ask that our matroid be isomorphic to N; if we ask 
that it zs N, the best we do is Theorem 3.7.) 

To sce the equivalence of Theorems 4.9 and 4.10, we proceed as follows. It is 
clear that Theorem 4.10 implies Theorem 4.9. For the converse, given M and N 
as in Theorem 4.10, let ¥ be the class of matroids isomorphic to minors of M. 
Then Theorem 4.10 follows by applying Theorem 4.9. 

Another corollary of Theorem 4.10 is the following “wheels and whirls” 
theorem of Tutte (1966a). 


Corollary 4.11. Let M be a 3-connected matroid with at least four elements. Then 
either M is a wheel or whirl, or for some element e one of M\e, Mie is 
3-connected. 


Proof. Certainly M has a 3-connected wheel or whirl minor with =4 elements (for 
example, from Theorem 1.6 and Theorem 4.2, because U Fi is a whirl and M(K,) is 
a wheel). Thus we may choose a 3-connected minor N of M such that N # M and 
N has at least four elements, since we may assume that M is not a wheel or whirl. 
Choose N with as many elements as possible. By Theorem 4.10 there is a 
3-connected minor N’ of M such that for some element e of N’, one of N'\e, N’/e 
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is isomorphic to N. From the maximality of N, N’=M, and the result 
follows. O 


Let us illustrate the use of Theorem 4.10 further by a more complicated 
application, the following theorem of Wagner (1960). 


Theorem 4.12. Let G be a 3-connected simple graph with |V(G)|>4, with no 
K,—e minor. Then either M(G) is a wheel or G=K,, or G=C,. 


(K,—e is the graph obtained from K, by deleting one edge. C, is the 
complement of a circuit of length 6.) 


Proof. We assume G is not isomorphic to K,, or C,. For any simple 3-connected 
graph H, if one of M(H)\e, M(H)/e is isomorphic to one of M(K;.;), M(C,) for 
some edge e, then H has a K,—e minor (as we see by checking cases). By 
Theorem 4.10 it follows that G has no K, , or C, minor. But it has a K, minor by 
Theorem 4.2, and so we may choose a minor H of G such that M(#) is a wheel, 
as large as possible. For any simple 3-connected graph J, if M(J)\e is a wheel for 
some e & E(J) then J has a K, — e minor; while if M(J)/e is a wheel then J has a 
K,, or C, minor. In either case no such J is a minor of G. By Theorem 4.10 we 
deduce that G = H, as required. O 


Two more difficult theorems of the same type were proved by Oxley (1987a,b). 


Theorem 4.13. Let M be a 3-connected binary matroid with at least four elements. 
Then M has no W, minor if and only if M is a minor of A, for some n=3. 


(W, is the “four-spoke wheel”. A, is the matroid represented over GF(2) by 
the n n-tuples with n — 1 Os and one 1, together with the n n-tuples with one 0 and 
n-1 1s.) 


Theorem 4.14. Let M be a 3-connected matroid representable over GF(3), with no 
M(K,) minor. Then either M is a whirl or M has at most twelve elements. 


(Indeed, there are only two maximal (non-whirl) such matroids, one with 8 
elements and the other with 12, related to the (5, 6, 12) Steiner system.) 

So far in this section the results have only used 3-connectedness. There are two 
results of the same kind which use varieties of 4-connectedness. The first is due to 
Wagner (1937). 


Theorem 4.15. Let G be axsmple 3-connected graph with no K, minor. Then 
either G is planar, or G=V,, or there are three vertices u, v, w of G such that 


G\{u, v, w} has at least three components. 


In particular, every 4-connected graph with no K, minor is planar. Wagner used 
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this to prove his “equivalence theorem”, the equivalence of the four-colour 
conjecture with the four-colouring case of Hadwiger’s conjecture. 
The second is the following, due to Seymour (1980a). 


Theorem 4.16. Let M be a 3-connected regular matroid. Then either M is graphic, 
or M is cographic, or M=R,o, or there is a 3-separation (A, B) of M with |Al, 
|B| > 6. 


(M is cographic if M* is graphic.) 

In particular every 4-connected regular matroid except Rj, is graphic or 
cographic. We shall give some applications of Theorem 4.16 in section 6. The 
following lemma is used to prove Theorem 4.16, and is sometimes of use in its 
own right. 


Lemma 4.17. Let M be a 3-connected regular matroid. Then M is either graphic or 
cographic if and only if M has no Ry or R,, minor. 


(R,, is the 3-sum of M*(K,,) and M(K, —e), using the 3-element circuit of 
K,—e through the three vertices of valency 4. ‘‘3-sum” is defined in the next 
section.) 


5. Decomposition 


While Theorem 4.4 applies only to 2-connected graphs, it is equivalent to 
Theorem 4.3 which applies to all graphs. In this section we describe a decomposi- 
tion method for matroids which allows us similarly to extend our results from 
highly-connected matroids to all matroids. 

If M, and M, are matroids on S,, S, respectively, where S, 1S, = @, we define 
M to be the matroid on S, US, in which a set is a circuit if and only if it is a 
circuit of one of M,, M,. If S, and S, are not empty, we call M the 1-sum (or 
direct sum or disjoint union) of M, and M,. The following is clear. 


Lemma 5.1. If M is the 1-sum of M, and M, then (S,,5,) is a 1-separation of M. 
Conversely, if (A, B) is a 1-separation of M, then M is the 1-sum of M\B and 
M\A. 


Now let M,, M,be matroids on S,, S, with S$, 1S, = {z}, where z is not a loop 
Or coloop of either M, or M,, and with |S,|, |S,|=3. Let M be the matroid on 
S=S, US, — {z}, in which X CS is a circuit if and only if either 

(i) X is a circuit of one of M,, M,, or 

(ii) (XN S,) U {z} is a circuit of M, (i= 1,2). 

M is indeed a matroid, called the 2-sum of M, and M,. The next two lemmas 
are due to Seymour (1980a). 
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Lemma 5.2. if M is the 2-sum of M, and M, then (S,—S,,S,—S,) is a 
2-separation of M. Conversely, if (A, B) is a 2-separation of M of order 2, and z is 
a new element, then there are matroids M,, M, on AU {z}, BU {z} respectively 
such that M is the 2-sum of M, and M,. 


Lemma 5.3. If M is the 1- or 2-sum of M, and M, then M,, M, are both 
isomorphic to proper minors of M. 


(A minor of M is proper if it is not M itself.) 
Armed with these three lemmas, let us return to the results of the last section. 
From Theorem 4.7, for instance, we deduce the following. 


Theorem 5.4, Every binary matroid with no F¥ minor may be constructed by 1- 
and 2-sums, starting from regular matroids and copies of F,. 


Proof. Let M be binary, with no F> minor. We proceed by induction on the 
number of elements of M. If M is 3-connected then it satisfies the theorem by 
Theorem 4.7. If not, then it is expressible as a 1- or 2-sum of two smaller matroids 
by Lemmas 5.1 and 5.2. By Lemma 5.3 these smaller matroids are isomorphic to 
minors of M and hence themselves have no F} minor. From our inductive 
hypothesis, these smaller matroids can be constructed in the required way, and 
hence so can M, as required. O 


We remark that Theorem 5.4 has an easy converse, that only binary matroids 
with no F} minor may be constructed as in Theorem 5.4. 

Similarly, Theorem 4.13 yields a construction for all binary matroids with no W, 
minor, and so on. To go further, we need some composition corresponding to 
3-separations. This is difficult in general (but see Truemper 1985), and we shall 
only attempt it for binary matroids. If M is a binary matroid on S, we say X CS is 
a cycle of M if X may be partitioned into circuits. Let M,, M, be binary matroids 
on S,, S,, with S, 1S, = Z, where |Z| = 3 and Z is a circuit of M, and of M,, and 
Z includes no cocircuit of either matroid, and |S,|, |S,|=7. Let M be the binary 
matroid on S=S, AS, (X AY means (X— Y)N(Y¥—X)), in which X CS is a 
cycle if and only if there are cycles X, of M, and X, of M, with X= X, AX). 
Then M is said to be the 3-sum of M, and M,. Seymour (1980a) showed the 
following. 


Theorem 5.5. if M is the 3-sum of binary matroids M, and M, then (S, — S,, S, ~ 
S,) is a 3-separation of M, and |S, — S,|, |S, — S,|2=4. Conversely, if (A, B) is a 
3-separation of a binary matroid M with |A|, |B|=4, then there are binary 
matroids M,, M, on sets S,+S, with S,-—S,= A, S,—$,=B such that M is the 
3-sum of M, and M,. 


Theorem 5.6. If M is 3-connected and is the 3-sum of M, and M, then M,, M, are 
both isomorphic to proper minors of M. 
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Using Theorems 5.5 and 5.6 we deduce the following from Theorem 4.16. 


Theorem 5.7. Every regular matroid may be constructed by 1-, 2- and 3-sums, 
starting from graphic and cographic matroids and copies of R4o. 


Brylawski (1975) proved the converse. 
Theorem 5.8. [f M is the 1-, 2- or 3-sum of two regular matroids then M is regular. 


A warning; our 3-sum decomposition is not invariant under duality (that is, if 
M is the 3-sum of M, and M, then M* need not be the 3-sum of Mf and 
M3 - indeed, it never is). Since by Theorem 5.5 we can express M as a 3-sum as 
soon as we find a 3-separation (A, B) with |A|, |B} =4, and (A, B) is also a 
3-separation of M*, it follows that we could equally well express M* as a 3-sum, 
and what we do is our choice. This choice makes a significant difference, as 
illustrated by the following, which is an extension of Theorem 4.15 to regular 
matroids, due to Seymour (1981a). 


Theorem 5.9. For a 3-connected regular matroid M, the following are equivalent: 
(i) M has no R,y, M(V,) or M(K,) minor, 
(ii) M is not isomorphic to R,, or M(V,), and M has no M(K,) minor, 
(iii) either M=M(K, ,), or M is constructible by 3-sums starting from cog- 
raphic matroids, 
(iv) M* is constructible by 3-sums starting from graphic matroids and copies of 
M*(K;,). 


Proof. (ii) > (i) follows from Theorem 4.9. That (iii), (iv) > (ii) we omit. Let us 
sce that (i)> (iii), (iv). To prove (i)> (iii) we proceed by induction on the 
number of elements of M. We may assume that M is not a 3-sum of two matroids 
M_,, M,; for if so then M,, M, are not isomorphic to M(K;, ,) since the latter has 
no 3-element circuits, and so by induction M,, M, and hence M are 3-sums of 
cographic matroids. By Theorem 5.7, M is graphic or cographic, and we assume 
that it is graphic, and M=M(G) say, where G is simple and 3-connected. By 
Theorem 4.15, either G is planar or G=V, or K;,,, or there is a 3-separation 
(A, B) of G with |E(A)|, |E(B)|=4. The last cannot occur because M is not 
expressible as a 3-sum; and certainly G is not isomorphic to V,; and so either M is 
cographic, or M= M(K,, 3), aS required. To see (i) > (iv), again we proceed by 
induction, and we may assume that M* is not a 3-sum. By Theorem 5.7, M* is 
graphic or cographic, and we may assume the latter, and M* = M*(G) say, where 
G is simple and 3-connected. As before, either G is planar or G ~ K, ,, and so 
either M* is graphic or M*(G) = M*(K,,;), as required. O 


Although (iii) and (iv) are equivalent structures by Theorem 5.9, they do not 
seem to be obviously equivalent. In particular, they dre not just reformulations of 
One another under duality. For instance, the 3-sum of two graphic matroids is 
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graphic, while the 3-sum of two cographic matroids need not be cographic. Also. 
K,, occurs in (iii) and (iv) in completely different ways. In (iii) it is just an 
exceptional case, while in (iv) it is crucial; without it all the matroids constructed 
would be graphic. The difference between (iii) and (iv) is the more remarkable 
because their derivations from (i) are so similar. 


6. Applications of the regular matroid decomposition 


Next, we sketch several applications of Theorems 4.16, 5.7 and 5.9. The first is to 
totally unimodular matrices, Theorem 5.7 yields a construction for them. To 
obtain this, we have to express the 1-, 2- and 3-sum constructions in matrix terms. 
They become basically ‘‘diagonal” constructions, as follows. (Our pictures in the 
following are not intended to imply any particular ordering of rows and columns. 
u, u, etc. will be column vectors, and vertical and horizontal lines represent 
partitions of the rows and columns.) 


Construction 6.1. /f A and B are totally unimodular matrices, then so is 
A 0 
0 Bo 
Construction 6.2. If A|u and B|v are totally unimodular, then so is 
A u-u" 
0 BT? 
where ' denotes transpose. 


Construction 6.3. If Alu,|u,|u, and B|v,|v.|v, are totally unimodular, ana 
u, tu, + uy =0 and v, + v, + v; = 0, and every column of C is one of +u,, +uy, 
+u,, 0, and every row of C is one of +v,,, tv}, +v;, 0, then 


A Cc 
0 Bt 
is totally unimodular. 


Construction 6.4. If x = +1, and 
A uy uy B | vy v, 
us 0 x vy 0 x 
are both totally baimbidilar. then so is 


T 

A U,'U2 
T 

vu, B 
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In each case we combine two totally unimodular matrices to produce a third. 
Let us define the size of an m Xn matrix to be m+n. If a matrix is produced 
from two other matrices of strictly smaller size by one of these four constructions, 
we say that the large matrix is the composition of the two smaller ones. 

These then will be our constructions; now we need the building blocks. Let T 
be a tree with its edges directed. A path P of T provides a path vector from T, a 
vector of 0, +1s indexed by E(T), where the coordinate corresponding to 
e € E(T) is +1, —1 or 0 depending on whether P uses e forwards, backwards, or 
not at all. A matrix is a tree matrix if there is some directed tree such that every 
column of the matrix is a path vector from 7. Tree matrices are totally 
unimodular, and so are their transposes. 

Let us say a matrix is sporadic if it is 5 x 5 and can be obtained from one of the 
matrices of fig. 1 by permuting rows and columns and scaling some rows and 
columns by —1. 


-1 i oO oO 1 11001 
1. -=1 1 0 60 01101 
0 1 -!1 1 0 00111 
0 oOo 1 -!I 1 1001 1 
1 0 0 1. = 1 111141 


Figure 1. Sporadic matrices. 


Sporadic matrices are also totally unimodular. Our structure theorem for totally 
unimodular matrices is the following. 


Theorem 6.5. Every totally unimodular matrix can be constructed by repeated 
composition, starting from tree matrices, transposes of tree matrices, and sporadic 
matrices. 


Proof. Let A be a totally unimodular matrix. We prove, by induction on the size 
of A, that it can be constructed as claimed in Theorem 6.5. Prefix A with an 
identity matrix, to obtain /| A, and let M be the matroid represented by the 
columns of /|.A. Then M is regular and so Theorem 4.16 applies. But it can be 
shown that 


M=R,, © Ais sporadic , 
M is graphic A is a tree matrix , 


M is cographic A‘ is a tree matrix , 
M has a 1-separation© A is a composition as in Construction 6.1 , 
M has a 2-separation © A is acomposition as in Construction 6.2 . 


Moreover, if none of these apply, then 
M has a 3-separation (X, Y) with |X|, |¥|>4 
< A is a composition as in Constructions 6.3 or 6.4. 
The result follows. O 
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The argument of Theorem 6.5 may be adapted to yield a polynomial algorithm 
to test if a matrix is totally unimodular, using an algorithm of Cunningham and 
Edmonds (1980) to test if M has a 1-, 2- or 3-separation as above. 

Our second application is due to Gerards et al. (1990). It concerns the structure 
of graphs not containing an ‘“‘odd-K,”. An odd-K, is a graph obtained from K, by 
replacing its edges by internally disjoint paths, in such a way that the parity of 
each circuit is unchanged. Some examples of graphs with no odd-K, subgraph are 

(i) graphs G such that G\v is bipartite for some vertex v (G\u is obtained 
from G by deleting v), 

(ii) planar graphs G which can be drawn so that at most two regions have an 
odd number of sides. 

Gerards et al. showed that every graph with no odd-XK, can be constructed from 
these two special kinds by piecing them together at small cutsets. The composition 
rules are quite complicated, however, and we restrict ourselves here to the 
adequately connected case. 


Theorem 6.6. Let G be a simple, 3-connected graph with no 3-separation (A, B) 
with |E(A)|, |E(B)| 25. Then G has no odd-K, if and only if G is of type (i) or 
(ii) above. 


Proof. Take a new element f, and let M be the binary matroid on E(G)U {f} 
such that M/f = M(G) and every circuit of M has even cardinality. Now it is easily 
shown that M has an F> minor using f if and only if G has an odd-K,. Thus we 
may apply Theorems 3.8 and 4.16. The result follows since 


M is graphic G is of type (i) , 
M is cographic © G is of type (ii) . 


(Actually, the paper gives a proof which bypasses the use of Theorem 3.8.) O 


Our third application is similar, and is due to Lovdsz-—see Gerards et al. 
(1990). It concerns the structure of graphs such that any two odd circuits have a 
common vertex. Some examples of such graphs are 

(i) graphs G such that G\v is bipartite for some vertex v, 

(ii) graphs G with three edges e, f, g forming a circuit, such that the graph 
obtained by deleting e, f, g is bipartite, 

(iii) simple graphs with <5 vertices, 

(iv) graphs which can be drawn in the projective plane such that every region 
has an even number of sides. 

Lovasz showed the following. 


Theorem 6.7. Let G be a simple, 3-connected, internally 4-connected graph. Then 
every two odd circuits of G have a common vertex if and only if G is of one of the 
four types (i),..., (iv) above. 
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Proof. Let M be the binary matroid on E(G), with cycles the cycles of M(G) of 
even cardinality. If M has an F, or Fj minor one can show that two odd circuits 
of G are disjoint. Thus by Theorem 1.5 we may assume that M is regular. Except 
in easy cases M is 3-connected and internally 4-connected. If M=R,, then 
G = K,, and if M is graphic some detailed analysis shows that G is of types (i) or 
(ii). Suppose then that M is cographic, and M = M*(H) for some graph H with 
E(H) = E(G). There are |V(H)| circuits of M corresponding to the vertex stars of 
H, and every edge is in exactly two of them. Now we know the rank of M 
(assuming that G is not bipartite), and hence |V(H)| = |E(G)| — |V(G)| + 1. Every 
circuit of M is a disjoint union of circuits of M(G), and so we obtain a 
“2-covering” of the edge-set of G by circuits of G, at least |E(G)|— |V(G)| + 1 of 
them. But any 2-covering may be viewed as the face-boundaries of an embedding 
of G in some pseudo-surface, and so Euler’s formula implies that the number of 
circuits in a 2-covering is at most |E(G)|—|V(G)| +2, with equality only if the 
pseudo-surface is a sphere, and missing equality by 1 only for the projective plane 
and for the sphere with two points identified. The result follows from this and the 
observation that the stars of H have even cardinality (since every circuit of M is 
even). DO 


Our text application is to the “sums of circuits” property. Let M be a matroid 
on a set S. With each circuit C of M let us associate a (0, 1)-function x°: S>R, 
where x“(e) =1 if and only if e € C (R is the set of real numbers). The cone of 
circuits of M is the set of all non-negative linear combinations of these functions 
x°. We say that M has the sums of circuits property if for every p: SR, p 
belongs to the cone of circuits if and only if 

(i) p(e) =0 for all eES, 

(ii) for every cocircuit D and f ED, p(f)< Ui(ple):e ED — {f}). 

(‘Only if” is always true.) The following was proved by Seymour (1979b). 


Theorem 6.8. Graphic matroids have the sums of circuits property. 


Which other matroids have this property? The answer was given by Seymour 
(1981a), as follows. 


Theorem 6.9. For a matroid M, the following are equivalent: 
(i) M has the sums of circuits property, 
(ii) M has no U2, F*, Ry or M*(K;) minor, 
(ili) M* is constructible by 1-, 2- and 3-sums, starting from cographic matroids 
and copies of F;, M(K,,) and M(V,). 


Proof. If M is the 1-, 2- or 3-sum of M, and M,, and M*, M} have the sums of 
circuits property, one can show that so does M*. Thus (iii)> (i), for graphic 
matroids have the property by Theorem 6.8 and one verifies directly that F,, 
M*(K,;,), M*(V,) have the property. (Incidentally, it is not true that if M is the 
3-sum of M, and M,, and M,, M, both have the property then so does M; this 
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explains the dual formulation of (iii).) Now (i) (ii) is easy, and it remains to 
show that (ii) > (iii). Let Mf satisfy (ii). Then M is binary, by Theorem 1.2, and 
we may assume by induction that M* is not expressible as a 1-, 2- or 3-sum. By 
Theorem 4.7, either M = F, or M is regular; and if the latter, then by Theorem 
5.9 applied to M*, either M* = M(V,) or M(K,,) or M* is cographic. The result 
follows. O 


There are many similar results in the same paper of Seymour (1981a), 
formulating a graph optimization theorem (particularly multicommodity flow 
theorems) in terms of matroids, and finding an excluded minor characterization of 
the matroids with the property by applying a structure theorem. We omit further 
details. 

Our final application is to a problem of Tutte. A cocycle in a binary matroid is a 
set which can be partitioned into cocircuits. Let us say that a binary matroid M is 
a tangential 2-block if M is minor-minimal with the property that it has no loops 
and no two cocycles have as union the element set of M. This is connected with 
the four-colour problem (see chapters 4 and 5). Tutte (1966b) proposed the 
following. 


Conjecture 6.10. The only tangential 2-blocks are F,, M(K,;), and M*(P), where 
P is the Petersen graph. 


Using the structure theorems, Seymour (1981b) showed the following. 
Theorem 6.11. Every tangential 2-block is cographic, except for F, and M(K,). 


Proof. Let M be a tangential 2-block, not isomorphic to F, or M(K,). Then it has 
no F, or M(K,) minor. But if M* is the 1-, 2- or 3-sum of Mj and M3, it is easy 
to see that since M,, M, both are covered by two cocycles, so is M, a 
contradiction. Thus M* is not a 1-, 2- or 3-sum, and so, by Theorem 5.4, either 
M = F7, or M is regular; and in the latter case, by Theorem 5.9, either M = Rios 
or M= M(V,), or M = M(K,,), or M is cographic. Now none of F7, Ri, M(V,) 
and M(K, ,) are tangential 2-blocks, and so M is cographic as required. O 


In view of Theorem 6.11, Conjecture 6.10 is equivalent to the conjecture that 
M*(P) is the only cographic tangential 2-block; or equivalently, 


Conjecture 6.12. If G is a graph with no isthmus and with no Petersen graph 
minor, then E(G) may be expressed as the union of two cycles of G. 


(A cycle of G is a subset ofx€(G) which may be partitioned into circuits.) 
It is shown in the Appendix to chapter 4 that Conjecture 6.12 is equivalent to 
Tutte’s well-known ‘‘nowhere-zero 4-flow” conjecture. 
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1. Introduction 


This chapter considers matroid theory from a constructive and algorithmic view- 
point. A substantial part of the developments in this direction have been motivated 
by optimization. Matroid theory has led to a unification of fundamental ideas of 
combinatorial optimization as well as to the solution of significant open problems 
in the subject. In addition to its influence on this larger subject, matroid optimiza- 
tion is itself a beautiful part of matroid theory. 

The most basic optimizational property of matroids is that for any subset ev- 
ery maximal independent set contained in it is maximum. Alternatively, a triv- 
ial algorithm maximizes any {0,1}-valued weight function over the independent 
sets. Most of matroid optimization consists of attempts to solve successive gen- 
eralizations of this problem. In one direction it is generalized to the problem of 
finding a largest common independent set of two matroids: the matroid intersec- 
tion problem. This problem includes the matching problem for bipartite graphs, 
and several other combinatorial problems. In Edmonds’ solution of it and the 
equivalent matroid partition problem, he introduced the notions of good charac- 
terization (intimately related to the NP class of problems) and matroid (oracle) 
algorithm. 

A second direction of generalization is to maximizing any weight function over 
the independent sets. Here a greedy algorithm also works: Consider the elements 
in order of non-increasing weight, at each step accepting an element if its weight is 
positive and it is independent of the set of previously-accepted elements. A most 
significant step was Edmonds’ recognition of a polyhedral interpretation of this 
fact. He used the fact that the greedy algorithm optimizes any linear function over 
the convex hull of characteristic vectors of independent sets to establish a linear- 
inequality description of that polyhedron. He then showed that a greedy algorithm 
also works for the much larger class of polymatroids: polyhedra that are defined 
from functions that, like matroid rank functions, are submodular. This polyhe- 
dral approach led to the solution of the weighted matroid intersection problem 
and extensive further generalizations, culminating in the optimal submodular flow 
problem. Other important related problems are that of minimizing an arbitrary 
submodular function and a polymatroid generalization of nonbipartite matching. 
Complete solutions of these problems remain to be discovered, but substantial 
progress has been made. 

There is a second aspect of matroid theory, other than the optimizational one, 
for which a constructive approach is desirable. This might be called the structural 
aspect. We seek constructive answers to such fundamental questions as the con- 
Nnectivity, graphicness, or linear representability of a given matroid. It is pleasant to 
realize how many of the classical structural results of Tutte and Seymour are essen- 
tially constructive in nature. In fact, the most important structural result to date, 
Scymour’s characterization of regular matroids, yields an algorithm to recognize 
regularity by reducing that problem to recognizing graphicness and connectivity, 
two problems to which Tutte contributed substantially. On the other hand many 
structural questions can be proved to be unsolvable by matroid algorithms. In- 
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deed, recognizing regularity turns out to be esscntially the only question on linear 
representability that is solvable. 

We use the matroid terminology introduced in chapter 9 by Welsh. The following 
additional notation is used. For a subset A of the set S, we use A to denote S\A. 
Where J is an independent set of matroid M = (S,¥), ande €J with J u {e} ¢ ¥, 
we use C(/,e) to denote the unique circuit (fundamental circuit) contained in 
JU {e}. Use of this notation will imply that J U {e} ¢ ¥. 


2. Matroid optimization 


Two of the most natural optimization problems for a matroid M = (S, 4) with 
weight vector c € f° are to find an independent set of maximum weight, and to 
find a circuit of minimum weight. These generalize standard optimization prob- 
lems on graphs. We show in this section that the classical greedy algorithm solves 
the independent set problem. While this problem is easy, it leads to very useful 
polyhedral methods. Considering the greedy algorithm requires discussion of the 
efficiency of matroid algorithms. It turns out that, with respect to the resulting 
notion of algorithmic solvability, the circuit problem above is intractable. 


2.1. The optimal independent set problem 


If we are asked to find a maximum-cardinality independent set, we know from the 
independent-set axioms (chapter 9 by Welsh) that any maximal independent set is 
a solution. Hence the following trivial algorithm works: Where S = {e),e2,...,€n}, 
start with J = @ and treat the e; sequentially, adding e; to J if and only if J U {e;} 
is independent. The maximum-weight independent set problem 


max(c(/): J € ¥) (2.1) 


includes the above problem as a special case (take each cj = 1). Moreover, (2.1) is 
solved by making two simple modifications to the above algorithm. First, we treat 
the elements of S in order of non-increasing weight, and second we do not add 
to J any negative-weight elements. The resulting method is the greedy algorithm 
(GA). 


Greedy algorithm for the maximum-weight independent set problem. 


Order S = {e),€2,...,@n} So that Ce, 2 Ce, Bos 2 Cy, FOP Cony Bi S Ce, 
J:=9@ 
For i= 1 to m do 

IfJ U{e;)} € # then J :=JU {e;} 


Theorem 2.2. For any matroiaM = (S,9) and any c € B®, GA solves (2.1). 


Proof. Suppose that J = {j1,...,j¢} is found by GA, but Q = {q,...,@¢} has 
larger weight. Assume that the j; are in the order in which GA added them, and 
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that Cg, 2 Cg, 2 -** S Cq,- There is a least index i such that cg, > c;, OF cg, > O and 
i>k. Then {j),.--,j;-1} is a basis of A= {j\,...,j:-1,9,---,q:}, for otherwise 
GA would choose one of q),...,q; aS j;. But {q1,...,9;} is a larger independent 
subset of A, a contradiction. 


A pair (S,F#) satisfying only that § #0 and A C B € J implies A € ¥, is some- 
times called an independence system. Both the problem (2.1) and the greedy algo- 
rithm can be stated for any independence system, and one wonders whether other 
independence systems are similarly nice. However, the matroid axioms say that 
(S,#) is a matroid if and only if GA solves (2.1) for every c € {0,1}5. In view of 
this observation, Theorem 2.2 can be restated as follows. 


Theorem 2.3. An independence system (S, ¥) is a matroid if and only if GA solves 
(2.1) for every cE R®. 


A second proof of Theorem 2.2, almost as short and much more useful, leads 
us to the polyhedral method. 


Proof of Theorem 2.2. Let x be the characteristic vector of the set J produced 
by GA, and let x be the characteristic vector of any independent set J’. Then 
cUJ') = SceXe, =¢-x. Let T; = {e),...,e} forO <i <n. Notice that x(7;) 2 x(7;) 
for 1 <i < m, because J NT; is a maximal independent subset of 7;. Then 


m n 
eon) Ce, Xe, + 5 Ce, Xe; 
i=l 


i=m+l 


m n 

= See (x(Ti) — (Ti) + SS Cee 
ve2{ f=m+1 
m1 x 


= So, = Ce,,, ) X(T) + Cmx (Tm) + ». Ce, Xe; 


i=\ i=m+1 
m-1 n 
‘Cie! ao om: - 
< Soe, = Ce, (Ti) + CmX (Tim) + Se CeXe;- 
f= i=—m+1 


But the last line is ¢- x, since the inequality holds with equality forx=x. O 


Notice that the only properties of x, used in the second proof of Theorem 2.2 
were that x > 0 and x(7;) < ¥(7;)(=r(T;)), 1 <i<m. So GA actually solves the 
following linear programming problem, since £(7;) = r(T;) implies x(T;) < x(7j): 

maximize c - x 
subject to x(A)<r(A), ACS; (2.4) 
x20. 


This observation implies the following Matroid Polytope Theorem of Edmonds 
(1970). 
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Theorem 2.5. For any matroid M = (S,¥%), the extreme points of the polytope 
P(M) = {x € RS: x(A) <r(A) for all A C S} are precisely the characteristic vectors 
of independent sets of M. 


Proof. It is easy to see that the characteristic vector of any independent set is an 
extreme point of P(M). Now let x’ be an extreme point of P(M). Then there is c € 
R° such that x’ is the unique optimal solution of max(c-x: x € P(M)). Applying 
GA to M and this c. we obtain x, the characteristic vector of an independent set, 
and x solves the same linear programming problem. Hence x’ = x, as required. 

Oo 


The dual linear program to (2.4) is: 


minimize }>(r(A)y,:A CS) 
subjectto Silva: jE A) 2c, forjeS; (2.6) 
ya 20, for ACS. 


Analysis of the second proof of Theorem 2.2 shows that the following formula, 
sometimes called the dual greedy algorithm, gives an optimal solution y’ to (2.6): 


Vp = te Ces STS MM 


U . . 
Yr, = Cems 


y, =O for allother ACS. 


It follows that (2.6) has an optimal solution that is integer-valued whenever c is 
integer-valued, that is, that the constraint system of (2.4) is totally dual integral. 
(This important notion is developed in chapter 30 by Schrijver. One fundamental 
fact that we do mention, is that if Ax < b is totally dual integral, b is integer-valued, 
and max(cx : Ax < 6) has an optimal solution, then it has one that is integer- 
valued.) One can also use y’ and the linear-programming optimality conditions to 
prove the following converse to Theorem 2.2. (The point here is that there can be 
many choices of the ordering of 5S, because of equal weights or zcro weights.) 


Theorem 2.7, J € ¥ has maximum c-weight if and only if it can be found by GA. 


2.2. Efficient matroid algorithms 


How efficient is GA? Because sorting can be done in polynomial time, GA runs 
in polynomial time if and only if there is a polynomial-time algorithm to answer 
the question: “Is J € #?” (In fact, we necd to answer at most n such questions.) 
It is usual for matroids to be represented, for purposes of algorithms, by such an 
(independence-testing) oracle. an abstract matroid algorithm is govd if both the 
number of calls to the oracle ana the amount of additional computation is bounded 
by a polynomial in n and the size of any additional input (such as the weights). 
Hence GA is a good matroid algorithm. For concrete classes, such as matroids 
represented by matrices, for which there exists a subroutine for independence- 
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testing that is polynomial-time in the usual sense (that is, relative to input size), a 
good matroid algorithm does yield a polynomial-time algorithm. 

Having defined what is meant by “good matroid algorithm” and having described 
an important example, we digress briefly on some related issues. First, let us explain 
why the oracle representation is necessary. A natural alternative would be to have a 
general encoding for matroids, as we do, say, for graphs, and to measure algorithm 
efficiency relative to the encoding-size of the input matroid. The difficulty is that 
the number of matroids on an n-element set (see chapter 9 by Welsh) is so large 
that any general encoding scheme would require space exponential in n, which 
would affect the meaning of polynomial time. 

Since we are not using the usual model of computation, the theory of NP- 
completeness does not play the same role, and it is natural to ask whether there 
is an analogous theory of difficult matroid problems. We call a matroid problem 
intractable if there is no good matroid algorithm solving the problem. In contrast 
to the situation in ordinary complexity theory, we can prove that certain natural 
problems are intractable. To illustrate a typical proof method, we consider the 
problem of finding the girth (minimum size of a circuit) of a matroid. Let |S| = 
n = 2m, let M be the uniform matroid of rank m on S, and, for each A C S with 
|A| =m, let M, denote the matroid obtained from M by making A a circuit. If 
an algorithm concludes that the girth of M is m+1 after making fewer than (27) 
oracle calls, then there is some A such that the same (incorrect) conclusion would 
have been reached if the input had been M,. (Notice that the argument uses the 
fact that all calls other than A receive the same answer for both M and M,.) 
Hence a correct algorithm requires at least Cy calls, which is not bounded by a 
polynomial in n, and so computing the girth is intractable. (We mention that it 
is an open problem whether there is a polynomial-time algorithm to compute the 
girth of the matroid of a given binary matrix; on the other hand, finding a smallest 
circuit containing a fixed element in such a matroid is known to be NP-hard.) 
Other examples of intractable problems are described in later sections. Proofs can 
be more intricate than this example, but use similar “adversary” arguments. 

Finally, we discuss why independence is an appropriate choice of oracle on which 
to base a theory of matroid algorithms. Consider the following alternatives: (i) 
“What is r(B)?”; (ii) “Is B a circuit?”; (iii) “What is the minimum size of a circuit 
contained in B?” It is easy to see that (i) is equivalent to the independence-testing 
oracle, since there is a good matroid algorithm for each with respect to the other. 
On the other hand (ii) is weaker than independence-testing: |B|+1 calls to the 
independence-testing oracle will be enough to determine whether B is a circuit, 
but there can be no polynomial-time simulation of independence-testing by an 
oracle for (ii). (To see this, use an adversary approach, considering matroids on S$ 
having at most one circuit.) So (ii) would lead to a theory of matroid algorithms 
with respect to which independence-testing’is intractable—-in contrast to what the 
standard classes of matroids would lead us to expect. On the other hand, we have 
seen that (iii) would lead to a theory in which the existence of a good matroid 
algorithm would not imply the existence of a polynomial-time algorithm for the 
corresponding matrix problem. 
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3. Matroid intersection 


We are given matroids M,,M2 on the same set S. We want to find a maximum 
weight (or maximum cardinality) common independent set. Obviously this problem 
generalizes the optimal independent set problem (2.1), taking M, = M2. Some 
examples of applications are the following: 


Problem 3.1. finding a maximum-weight matching in a bipartite graph; 


Problem 3.2. finding a maximum-weight branching (forest in which each node 
has indegree at most 1) in a digraph; 


Problem 3.3. finding a maximum-weight subgraph that is the union of k forests 
in a graph; 


Problem 3.4. given bases B,,B, of a matroid M and X, C B,, finding X2 C B, 
such that (B,\X)) U X2, (Bo\X2) UX are both bases. 


Problem (3.1) is a direct special case of weighted matroid intersection. Where 
{V., V2} is a bipartition of G = (V,E), take S = E and 9 = {J C S: each ve V; 
is incident with at most one element of /}. Problem (3.2) is also a special case; 
(3.3) and (3.4) will be treated later. We begin the discussion with the maximum- 
cardinality case. 


3.1. Matroid Intersection Theorem 
It is obvious, for any J € 4, 1.9% and any A CS, that 
J] = OA] +E NA] <r (A) 4 (A). 


So if we can find J and A satisfying this relationship with equality, then we know 
that this J is maximum. In fact, such a pair (J, A) always exists; this is the content of 
the Matroid Intersection Theorem of Edmonds (1970). We remark that applying 
this theorem to maximum-cardinality bipartite matching as above immediately 
yields K6nig’s Theorem (chapter 3 by Pulleyblank). While the theorem follows 
from the algorithm described later, we first give a short non-constructive proof. 


Theorem 3.5. For matroids M,,M> on S, 
max(|J|: J € 4, 9 £2) = min(r,(A) +1r2(A): A C S$). 


Proof. The proof is by induction on |S|. Let k be the minimum of r)(A) + r2(A), 
and choose e € S with {e} € ¥%, 9 J. (If none exists, k = 0, and we arc finished.) 
If the minimum of r|(A) + 72(S’\X) over subsets A of S’ = S\{e} is k, then we are 
finished, by induction. If M/ denotes M;/{e} and the minimum of 7} (B) +73(S'\B) 
over subsets B of S’ is at least kK — 1, then induction gives a common independent 
set of M!,M, of size k — 1; adding e to it gives the desired J. We conclude that, if 


Matroid optimization and algorithms 559 


there is no common independent set of size k, then there exist subsets A, B of S’ 
such that 


r(A) +7(S'\A) <k—1 
and 
ni(BUu {e}) —l+m((S'\B)u {e}) -1 <k -2. 
Adding and applying submodularity, we have 
r(AUBU {e}) +r (ANB) +r (S\(ANB)) +r (S\(AUBU {e})) 
< 2k -1, 
it follows that the sum of the middle two terms, or the sum of the other two terms, 


is at most K—1, acontradiction. QO 


(Notice that this proof gives no hint of how to find the desired subsets J and A 
efficiently.) 


3.2. The Matroid Intersection Algorithm 


The Matroid Intersection Algorithm (MIA) uses an augmenting path approach that 
generalizes a common method for bipartite matching. It maintains J € $,M J, at 
each step either finding a larger such J, or finding A giving equality with J in 
the min-max formula, proving that J is maximum. Given J € %, M 4:, an auxiliary 
digraph G = G(M,, M2,/) having node-set S U {s,} is constructed. It contains: 


an arc (e,t) for every e € S\J such that J U {e} € 4); 
an arc (s,¢) for every e € S\J such that J U {e} € 4; 
an arc (e, f) for every ee S\J,f EJ such that UU {e})\{f} € A; 
an arc (f,e) for every e € S\J, f EJ such that (JU {e})\{f} € Fo. 


The following “augmenting path” theorem is the basis for the algorithm. 


Theorem 3.6. (a) /f there exists an (s,t)-dipath in G, then J is not maxi- 
mum; in fact, if 8,e1,fi,..-, ks fesCkatt is a chordless (s,t)-dipath, then J'= 
JU {e1,-. eet) \Ufise- fet € Fr OF. (b) Uf there exists no (s,t)-dipath in G, 
then J is maximum; in fact, if A C S and 6*(AU{s}) =9, then |J| =1(A) + 72(A). 


We remark that, while the restriction to chordless dipaths is not necessary for 
the special case of bipartite matching, it is needed in general. The proof of (b) is 
easy. Consider e € A\J. Since (e,f) is not an arc, J U {e} contains an M;-circuit C. 
Since there is no arc (e,f) with f € S\A, we have C C (ANJ)U {e}. Therefore, 
JOA My-spans A. Similarly, J 7A M2-spans A. Hence 


J] = OA] +|FN A| =7;(A) +7(A). 
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We can deduce (a) from the following result, whose proof is a straightforward 
induction. (Recall that o(A) denotes the closure of the subset A.) 


Lemma 3.7. Let M = (S,F£) be a matroid, let J € F and let x\,y\,...,X4, Vx bea 
sequence of distinct elements of S such that 

(a) x, ¢J,y,€J forl <ick; 

(b) y, ECU, x;)) for 1 <ick; 

(c) yw €CU, x) forl <i<j<k. 
Then where J' = (JU {x1,.0-.%% P\ Oi. Ve}, J’ € F and o(s’) = a). 


3.3. Efficiency of the Matroid Intersection Algorithm 


Since MIA will terminate after at most = |S| augmentations, since the dipath 
or set A as in Theorem 3.6 can be found with standard methods, and since each 
auxiliary digraph can be constructed with O(n’) independence tests, it follows that 
MIA is a good matroid algorithm. Here we mention the complexity for some re- 
finements and special cases. Just as MIA generalizes a basic bipartite matching 
algorithm, more efficient versions of MIA generalize some of the ideas used to 
speed up matching and network flow algorithms. (See chapter 2 by Frank.) The 
most important refinement is a natural one: At each step augment J using a shortest 
(s,t)-dipath of the auxiliary digraph. Of course, any such dipath will automatically 
be chordless. The next result implies that a large proportion of the resulting aug- 
mentations will be on very short augmenting paths. It is from Cunningham (1986), 
and part (a) is also due to Gabow and Stallmann (1985). 


Theorem 3.8. Jf MIA using shortest augmenting paths is applied to matroids 
M,,M) on S, |S| =n, then: 

(a) The length of a shortest augmenting path never decreases, and the number 
of different lengths is O(./n); 

(b) The sum of lengths of augmenting paths used is O(nlogn). 


It follows from Theorem 3.6(a) that the work of the algorithm can be divided 
into O(/n) stages; during each stage all augmenting paths have the same length. It 
is possible to find and perform all of the augmentations of a stage more efficiently 
than if the auxiliary digraph were reconstructed after each stage. This leads to 
a version of MIA (Cunningham 1986) that requires O(n?) independence tests 
rather than the O(n?) of the basic algorithm. How close is this bound to being best 
possible? One of the few results on this theme is that the greedy algorithm for 
finding a basis of a single matroid is optimal; it requires exactly n independence 
tests, and an easy argument shows that no algorithm uses fewer. For matroid 
intersection nothing more is known, that is, it is an open problem to find a nonlinear 
lower bound on the number of independence tests required. 

For concrete classes of matroids, one can usually obtain better bounds than 
arise from simply multiplying the number of oracle calls by the oracle complexity. 
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For example, for matroids arising from two matrices each having at most n rows, 
Theorem 3.8(b) can be used to show that the complexity of MIA is O(n lognz), 
assuming that arithmetic operations are counted as single steps. As another exam- 
ple, Gabow and Stallmann (1985) have given an O(p?>) time bound for MIA on 
the cycle matroids of two graphs, each having at most p nodes. 


3.4. Matroid partitioning 


Many of the applications of matroid intersection are most easily derived through 
the theory of matroid partitioning. In fact this theory is equivalent to that for 
matroid intersection and actually was discovered earlicr by Edmonds. 

The matroid partitioning problem is, given matroids M; = (S, ¥;), 1 <i <k, to 
find a maximum cardinality subset J C S that is partitionable, that is, J = UU; : 
1 <i<¢k) where J; € %, 1<i<k. Obviously, we may assume that the J; are 
disjoint. Moreover, the assumption that all M; have underlying set S$ is made only 
for convenience. The main result of the theory is the following Matroid Partition 
Theorem. It is implicit in Edmonds and Fulkerson (1965). The special case in which 
the M; are all equal is the subject of Edmonds (1965a). We mention that a theorem 
of Rado (1942) (see chapter 9 by Welsh) can be shown to be equivalent. 


Theorem 3.9. Let J be a maximum-cardinality partitionable subset with respect to 
M; = (S, $i), 1<i <k. Then \J| =minacs (hy ri(A) + \Al) 


As usual we can observe that for any such J and A, we have 


\J| =|F\A] + 2 Al] 
<|S\A| +2, NA] 
<|A|+%r;(A). 


The matroid partitioning problem is reduced to a matroid intersection problem 
as follows. [This construction, and the reverse one described later, are due to 
Edmonds (1970).] Make k disjoint copies S),52,...,5% of S, and imagine M; as 
being defined on S; rather than S. Let Nz be the direct sum of the M, and let 
N, be the matroid on S’ = US; in which a set is independent if and only if it 
contains at most one copy of e for cach e € S. It is easy to see that there is a 
correspondence between partitionable sets with respect to M;,...,M, and common 
independent sets of Na, Np. It is also easy to see that a set B C S’ that minimizes 
ra(B) + r,(S’\B) can be chosen to consist of all the copies of elements of A, for 
some A C S. It follows from the Matroid Intersection Theorem that the maximum 
size of a partitionable set is min(|A| + =7;(A): A C S), proving Theorem 3.9. 

There is an important strengthening of Theorem 3.9, namely that, in its state- 
ment, “maximum-cardinality” can be replaced by “maximal”. It follows from the 
observation that whenever a copy of e is deleted from the common independent 
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set of N,,N, by the intersection algorithm, it is replaced by another copy of e, so 
that no element is ever deleted from the partitionable set. Since the same argu- 
ment could be applied to maximal partitionable subsets of an arbitrary subset B, 
we conclude that every maximal partitionable subset of B has the same cardinality. 
A consequence is the following result, which appeared explicitly for the first time 
in Nash-Williams (1967). 


Theorem 3.10. The subsets of S partitionable with respect to the M;, form the 
independent sets of a matroid. Its rank function r is given by r(B) = min(|B\A|+ 
ir;(A): AC B). 


There is a neater description of the partitioning algorithm, obtained by identi- 
fying all copies of each element e of S in the auxiliary digraph for the intersection 
algorithm. The resulting digraph has node-set S U {s,t} and has 


an arc (s,e) for each e € S\/; 
an arc (e,t) for each e € S such that J; U {e} € 4; for some i; 


an arc (e, f) for each e, f € S such that f € C;(;, e) for some i. 


At termination of the algorithm, any set A C S such that 6*(A U {s}) = 0 has the 
property that J; A M,-spans A, and so |J| = |A| + &7;(A). It is worthwhile also to 
observe that every set A that minimizes |A| + %7;(A) must have this property and 
so must satisfy 5*(A U {s}) =. 

Now recall Problem 3.3 at the beginning of the section. By Theorem 3.10 the 
feasible solutions form the independent sets of a matroid. Hence Problem 3.3 can 
be solved by the greedy algorithm, with independence tests requiring applications 
of the matroid partition algorithm to k copies of the cycle matroid of the graph. The 
Matroid Partition Theorem applied to this example yields standard graph results 
of Nash-Williams and Tutte on the existence of disjoint spanning trees and the 
covering of edges by forests. (See chapter 2 by Frank.) There are also beautiful 
combinatorial applications in transversal theory; see chapter 9 (by Welsh) and 
Mirsky (1971). 

Finally, let us describe Edmonds’ reduction of intersection to partitioning. The 
proof is easy. 


Theorem 3.11. Let B be a maximal partitionable subset with respect to M, and 
Mj, and let J\,J2 be an associated partitioning. Extend Jz to a basis Bz of M}. Then 
B\Bz is a maximum-cardinality common independent set of M, and Mp. 


3.5. Basis exchange 


Recall Problem 3.4. If X2 has the required properties, then B)\X2, Xz provides a 
partitioning of B, with respect ‘b the matroids M, = M/X, and M, = M/(B,\X;). 
Thus X, can be found with the Matroid Partition Algorithm. Moreover, applying 
the Matroid Partition Theorem one gets, after a short calculation, the following 
result of Brylawski and Greene. 


Matroid optimization and algorithms $63 


Theorem 3.12. Let B,, Bz be bases of a matroid M and let X; C B,. Then there 
exists X2 G By such that (B\\.X\) UX. and (B2\X2) UX; are also bases. 


3.6. Solution of the Shannon game 


The Shannon game, proposed by Shannon and generalized to matroids and solved 
by Lehman (1964), is a game played on a matroid with a single distinguished 
element e. (See also chapter 43 by Guy.) The two players, Short and Cut, alternately 
choose elements of S\{e}, with elements chosen by Cut deleted from M and 
clements chosen by Short contracted. Short’s (Cut’s) objective is to reach a minor 
in which e is a loop (cotoop). 

A game (M,e) is called short (cut) if there is a winning strategy for Short (Cut) 
playing second (and hence also playing first). The game is neutral if it is neither cut 
nor short, that is, if the first player, whether Cut or Short, has a winning strategy. 
It is easy to see that (M,e) is a cut game if and only if (M*,e) is short, and that 
(Me) is neutral if and only if (M,e) is not short and (M’,e) is short, where M’ 
is obtained from M by adding an element parallel to e. Hence it is enough to 
characterize short games. 


Theorem 3.13. (M,e) is short if and only if there exist disjoint independent sets 
11,4 of M such thate € o(1,) =o(h) ande gl Uh. 


The “if” part of the theorem can be proved by checking that the following strat- 
egy, applied iteratively, works. If Cut plays f € J, then Short plays e € i, where 
(with respect to the current minor just before f is deleted) (7, U {e})\{f} is inde- 
pendent, and similarly for f € 4. (If Cut does not play an element of 7) Us, then 
this only makes life easier for Short.) The “only if” part can be proved (this is 
harder) by showing that /,,/. exist for one of (M,e),(M*,e),(M’',e). 

The condition of Theorem 3.13 is easily recognized by the partitioning algorithm 
applied with M,; = M, = M. (In fact, Lehman’s work was one of the motivations 
for Edmonds’ development of matroid partitioning.) We know that the minimiz- 
ers of g, where g(A) = |S\A|+2r(A), are precisely the sets A CS such that, in 
the auxiliary digraph at termination of the algorithm, 5*(A U {s}) = 0. There is 
a unique smallest such A (easily found); call it A’. If e € A’, then the minimum 
of g remains the same when ¢ is deleted, so there is a maximum partitionable set 
J =J, Uh withe ¢J. Then e € A! = oJ, NA’) = 0 (2 NA’), 80, NA NA’ are 
the required sets. On the other hand, we claim that if such sets J), /) exist, then 
necessarily e € A’. 

For suppose we start the partition algorithm with J; = 11, J. =. Then in the 
auxiliary digraph, we have 5*(o(/;)) =, so no augmenting path can use any el- 
ement of o(/,) and so 1) C4), b Ch, e ¢ J, UJ will be maintained throughout 
execution of the algorithm. At termination, 6*(A’U {s}) =@, and so e€ A’, as 
required. Finally, we point out that the sets /;,/,, can be found by applying the 
algorithm to M\e, M \e after first checking that ¢ € A’. 

Further analysis of the game and interesting extensions can be found in Edmonds 
(1965b), Bruno and Weinberg (1971), and Hamidoune and Las Vergnas (1986). 
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3.7. Weighted matroid intersection 


Just as r;(A) +r2(A) provides an upper bound for the size of a common indepen- 
dent set J, where c € R® we can define a simple upper bound for c(/). Namely let 
(c',c?) be a “weight-splitting”. that is, c! +c? =c. Then 


cJ) =cl(VJ)+ es) < maxc'(J:) +maxc"(J2). (3.14) 


Hence, if we find J € 4,4, and a weight-splitting (c!,c?) such that equality 
holds in (3.14), we know that J has maximum weight. In fact, this is always possible. 


Theorem 3.15. For matroids M,,M2 on S and ce RS, there exists a weight-splitting 
(c!,c*) and a set J such that J has maximum c'-weight among independent sets of 
Mi, for i=1 and 2. 


We shall also see that, if c is integer-valued, then there is an integral weight- 
splitting in Theorem 3.15. Actually, Theorem 3.15 is equivalent to Theorem 3.16, 
the Matroid Intersection Polytope Theorem of Edmonds (1970). [Notice that the 
converse of (3.16) is trivially true.] 


Theorem 3.16. For matroids M,,M2 on S, every extreme point of P(M,)M P(M2) 
is the characteristic vector of a common independent set. 


Proof of equivalence of Theorem 3.15 and 3.16. Suppose that Theorem 3.15 holds 
and let * be an extreme point of P(M,) P(M2). Choose ¢ € R® such that x is the 
uniquc optimal solution of max(c-x: x € P(M,)M P(M2)). Then by Theorem 3.15 
we have J € #, M F and (c!,c”) such that c(J) =cl(VJ) +e?) oc! -¥ +0? -¥=c-k. 
It follows that < is the incidence vector of J. 

Now suppose that every cxtreme point of P(M,)M P(M2) = {x € RS :x(A) < 
r\(A), x(A) < r2(A), for A CS} is the characteristic vector of a common inde- 
pendent set. Given c € R®, let J be a maximum-weight common independent set, 
and let x be its characteristic vector. Let (y',y*) be an optimal solution of the 
linear program dual to max(c- x: x € P(M;) P(M>)). Then the optimality condi- 
tions imply that |J 1 A] = r;(A) whenever y‘(A) > 0. For i = 1 and 2 andj € S, let 
c= 0'(A): j € A); then c <c! +c. Thus Z,y' satisfy the optimality conditions 
for max(c’ -x: x € P(M,)) and its dual, so J is c'-optimal in M,;, for i =1 and 2. 
Moreover, c(J) =c' (J) +e?(/), so c) +c; >; implies j ¢ J, so G can be lowered 
to c; —c} without affecting the c?-optimality of J. O 


Although Edmonds (1970) used the idea of weight-splitting in a non-constructive 
proof of Theorem 3.16, it was M4ank (1981) who showed how to use the optimality 
conditions based on Theorem 3.15 to simplify the weighted Matroid Intersection 
Algorithms of Edmonds (1979) and Lawler (1975). We describe here an algorithm 
based essentially on Frank’s, but with an additional simplification. 
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The basic idea is to generalize the unweighted MIA by using a weight-splitting 
to assign costs to the arcs of the auxiliary digraph. Where c, denotes max(ci,: e¢ 
J,J U{e} € 4), the arc costs wz are defined by: 


We shall require that (c',c?),/ satisfy the properties c? = 0, and w,, > 0 for each 
arc uv. If in addition, we have c} <0, then the conditions of Theorem 3.15 are 
satisfied (by Theorem 2.7) and we are finished. We can begin with c! =c, c? = 
0, J =9@. (Notice that, if we augment on an (s, ¢) dipath P to obtain J’ from J, 
then the cost of P is cj +c) + c(J) — c(J’). This motivates choosing P to have least 
cost. In fact, solving a least-cost dipath problem gives a way to update the weight- 
splitting too. This observation makes possible the following simpler presentation 
of Frank’s algorithm. We remark that Lawler also used a shortest-path calculation, 
but without weight-splitting.) 


Iteration of weighted MIA. 

If ci <0, stop; 

Form G = G(M,,M2,J,c!,c’); 

Compute a dipath from s to uv of least-cost d, for each uv € SU {s,t}; 

For each v € S, let a, = min(d,,d,,cl), and replace c} by c} — ay, c2 by 2 + ay; 
If c4 <0, stop; 

Augment J on a zero-cost (s,t) dipath having as few arcs as possible. 


Notice that the resulting algorithm has essentially the same complexity as its 
unweighted version, since the non-negative-cost shortest-path calculation can be 
done in time O(n”) (see chapter 2 by Frank). We outline a proof of validity of the 
algorithm. There are three things to check: (i) that the change in (c!,c”) preserves 
the properties required of it; (ii) that J remains common independent after an 
augmentation; (iii) that an augmentation does not violate the properties required of 
(c',c?). It is straightforward to check (i), using the fact that the d, satisfy d, + Ww 2 
d,. Notice that (ii) is not obvious, since the dipath may have chords (but not zero- 
cost ones). One actually shows that the subgraph induced by the zero-cost arcs 
is G(M;, Mj,J) for new matroids Mj, M} for which J is common independent, 
and every common independent set is also independent in both M, and M2. Then 
the result follows from the validity of the unweighted MIA. To define M. I let p > 
Pp, >-++ > p, be the distinct values of c! that occur in J. Let Ty = 0, let T; = {e € S: 
ch > pi} for 1 <i < k—1, and let 7, = o(J). Where N; = (M/T;)\Tin1,0 <i Sk, 
put M; = No ®---@ Nx. For (iii), it is easy to show that the change in J preserves 
the property that or = 0; to show that it also preserves w > 0, we use the fact that 
the latter condition is equivalent to J being c'-optimal of its cardinality in 4,, i =1 
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and 2. Consider i = 1, and let e,,,; be the second-from-last node of the augmenting 
path P. Then cl, =cj, soJ U {ém.1} is c'-optimal of cardinality |J| +1 in 4,. But 
c'(J') = c'(J U {emai }), since each element of J on P has the same c!-weight as 
the preceding node on P, and we are done. 

A good deal of work has been done on faster implementations of an MIA for 
special classes of matroids. See Gabow and Tarjan (1984), Brezovec et al. (1988), 
and Gabow and Xu (1989). 


4. Submodular functions and polymatroids 


Let f be a function defined on subsets of S, with values in R; f is submodular if 
f(A) +f(B) 2 f(AUB)+f(ANB) forall A, BCS. 

Some examples: 

Example 4.1. Let M be a matroid on S, and let f be the rank function of M. 


Example 4.2. Let G=(V,E) be a digraph, lets © V, let S=V\{s}, and let 
f(A) = |& (A)|. (Specifying s is not necessary here, but will be useful later.) 


Given a set function f on S we let P(f) denote the polyhedron {x € RS: x(A) < 
f(A) for all A CS}. A polymatroid is a polyhedron of the form P(f) where f is 
submodular and non-negative. It is said to be integral if f is integral. It follows 
from Theorem 2.5 that the polymatroid defined by f of Example 4.1 is the convex 
hull of characteristic vectors of independent sets of M@.In Example 4.2, if there 
exists in G a family of arc-disjoint directed paths, each beginning at s and ending 
in S, and we let x, denote the number of dipaths ending at v, then it is easy to 
see that x € P(f). It is a consequence of a form of Menger’s Theorem that every 
integer-valued element of P(f) arises in this way. 

The first result (Theorem 4.3) shows that our polymatroids do satisfy the central 
condition of the original geometric definition of Edmonds (1970): For any u € R¥, 
all maximal (with respect to component-wise order) vectors x € P with x < u, have 
the same component-sum. Another consequence of this result is a construction for 
matroids: The {0, 1}-valued vectors in an integral polymatroid are the characteristic 
vectors of the independent sets of a matroid, and Theorem 4.3 gives a formula for 
its rank function. In particular, the fact that r of Theorem 3.10 is a matroid rank 
function follows from applying this to f = 7;. 


Theorem 4.3. Let f be submodular on S, let uc RS, and let x be any maximal 
vector satisfying x < u,x € P(f). Then x(S) = mingcs(f(A) + u(A)). Moreover, if f 
and u are integer-valued and_x is required to be integer-valued, the conclusion is 
still satisfied. ‘ 


Proof. First we observe that, for any A C S and any x (maximal or not), we have 
x(S) = x(A)+x(A) < f(A) + u(A). Therefore, it will be enough to prove that there 
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is some A for which equality holds. Obviously, for each j € S, if x j #u,, then by 
the maximality of x, there is a set Aj C S with j € A; such that x(A;) = f(A;). Call 
such a set x-tight, or just tight. A fundamental fact is: 


Claim 4.4. The intersection and union of tight sets are also tight. 


Proof of Claim 4.4. If A,B are tight, we have 
xX(AUB)+X(ANB) < f(AUB)+f(ANB) 
< f(A) + f(B) 
= x(A) +x(B) 
=x(AUB)+x(ANB), 
so x(AUB) = f(AUB) and x(ANB)=f(ANB). O 


Now if we choose A to be the union of the A;, then A is tight and x(A) = u(A), 
so we are finished. The same proof applies to the integer-restricted version. 


It is a consequence of Theorem 4.3 that a greedy algorithm maximizes x(S) over 
P(f) (or more generally over {x: x € P(f), x <u}). The algorithm begins with 
x = 0, and for each j € S increases x; as much as possible subject to the restriction 
that x € P(f). Just as in the special case of matroid polytopes, we generalize to 
arbitrary weight-vectors by treating the elements in order of non-increasing weight. 


Polymatroid Greedy Algorithm (PGA). 
Order S = {e,...,€,} so that c,, 2--- See, POBCe,,, Bi BCe, 
x:=0 
For i=1 to mdo 
Choose x,, as large as possible so that x € P(f). 


Theorem 4.5. For any non-negative submodular function f on S and any c € RS, 
PGA optimizes c- x over P(f). Moreover, if f is integer-valued, the output of PGA 
is integer-valued. 


Where 7; = {e,,...,e;} and x is the output of PGA, one can apply Theorem 4.3 
to deduce that ¥(7;) = max(x(7;): x € P(f)). With this observation, Theorem 4.5 
can be proved in the same way as Theorem 2.2. Other results for matroid polytopes 
immediately generalize. These include the fact that the extreme points of P(f) are 
precisely the vectors that can be the output of PGA, the dual greedy algorithm, 
and the total dual integrality of the linear system for P(f). One also can prove a 
converse of Theorem 4.5 similar to Theorem 2.3, characterizing polymatroids as 
the compact subsets of R* that are closed below and for which a greedy algorithm 
always works. 

Call a function f a polymatroid function if f is submodular, normalized (f(@) = 
0), and monotone (f(A) > f(B) if A D B). For an arbitrary submodular function f, 
the function f’ defined by f’{A) = min(f{(B): B D A), is obviously monotone, and 
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is easily seen to be submodular. Notice that, for the f of Example 4.2, f(A) is the 
maximum number of edge-disjoint directed paths beginning at s and ending in A. 

More generally, applying Theorem 4.3 with u; —0,j ¢A and u; large, j € A, 
yields 


f'(A) = max(x(A): x € P(f)), for A #0. (4.6) 
It follows that P(f) C P(f’). But f’ < f by definition, so P(f) = P(f’). Notice also 


that if f is a polymatroid function, then f = f’ and PGA reduces to a formula 
(which works in the slightly more general case when f is non-negative and mono- 


tone); namely, the output x of PGA satisfies 


Xe = 4 F(T) -fTi-), 2Ki gm (4.7) 
0, i>m,. 


It is easy to derive from these observations the following result of Edmonds (1970). 


Theorem 4.8. Every polymatroid is determined by a (unique) polymatroid func- 
tion. 


4.1. Submodular function minimization 


We have seen from (4.7) that when f is non-negative and monotone, the greedy 
algorithm is especially simple. Its efficiency depends on the ease with which we 
can obtain function values, and the size (number of digits) of the values. There 
is no difficulty with the first of these, since we assume that the function is given 
via an evaluation oracle. On the other hand, the maximum size of function values 
must, like n = |S|, be treated as a measure of input size. (We must do the same 
for the element weights c;.) With these ground rules, the greedy algorithm can be 
regarded as a polynomial-time oracle algorithm, when f is monotone. 

On the other hand, if no additional assumption on f is made, computing compo- 
nent e of x in the greedy algorithm requires minimizing f(A) — x(A) over subsets 
A containing e, a problem easily seen to be equivalent to that of finding the min- 
imum of an arbitrary submodular function. This latter problem is fundamental; 
it includes as special cases both the minimum cut problem and, by Theorem 3.9, 
the problem of finding the maximum size of a partitionable set. Using some of 
the above results, we shall show that the minimum of a submodular function can 
be well-characterized. (The obverse problem of maximizing a submodular func- 
tion, on the other hand, includes NP-hard special cases, and can be easily proved 
intractable in the oracle context.) 

It is useful to reduce the general problem of submodular function minimization 
to that of minimizing a function of the form f(A) — u(A), where f is a polymatroid 
function and u € R$. Let g be a submodular function on S and let u; = g(S\{j}) - 
a(S), 7 ¢S. If u; <0, it is easy to see that no minimizer of g will include j, so 
the problem could be restricted to subsets of S\{j}. Hence we may assume that 
u > 0. The function f defined by f(A) = g(A) + u(A) — g(9) is easily shown to be 
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a polymatroid function. Hence minimizing g is equivalent to minimizing f(A) + 
u(A), since this differs from g by the constant u(S) — g(@). Then Theorem 43 
characterizes the minimum. That this is a useful characterization is not completely 
obvious, since the maximizing x must be certifiably in P(f). But x can be expressed 
as a convex combination of at most n +1 extreme points of P(f), by a standard 
result in polyhedral theory, and these can be generated by PGA, since f is a 
polymatroid function. 

In fact, a minimization algorithm can be based on the above ideas: Maintain 
x € P(f) with x < u explicitly as a convex combination of extreme points of P(f), 
and at each step either find A giving equality in Theorem 4.3, or find a new x with 
x(S) larger. This combinatorial approach was first developed (Cunningham 1984) 
for the special case in which f is a matroid rank function. The resulting algorithm, 
which can be viewed as a generalization of the matroid partition algorithm, runs 
in polynomial time. For general f, a finite algorithm occurs in Bixby et al. (1985), 
and it was modified to run in “pseudo-polynomial” time (Cunningham 1985). 

Gr6tschel et al. (1981) did find a polynomial-time algorithm for submodular 
function minimization. It is based on the equivalence, via the ellipsoid method, of 
the optimization and separation problems for polyhedra. (See chapter 30 by Schri- 
jver.) Since we wish to minimize f(A) — u(A), it is enough to be able to determine, 
given K CR, either a set A C S for which f(A) — u(A) < K, or the information 
that no such A exists. (For then one could search over K for a sufficiently small 
K for which A does exist.) The function fx defined by fx(B) = f(B) — K is sub- 
modular and monotone, and such A cxists if and only if u ¢ P(f,). But this is the 
separation problem for P(fx), and since fx is monotone, the optimization problem 
for P(fx) is solvable, and we are done. The resulting algorithm, while theoreti- 
cally acceptable, is not computationally useful. An important open question is the 
existence of a polynomial-time combinatorial minimization algorithm. 

A set function is symmetric if f(A) = f(A) for all A CS. It is easy to see that 
a symmetric submodular function is minimized by 9 and by S. However, it is 
an interesting problem to minimize such a function over {A:6C Sc V}. This 
problem includes the “global minimum cut” problem for undirected graphs. It 
is solved by a combinatorial algorithm requiring O(|S|*) function evaluations in 
Queyranne (1994). 


4.2. Polymatroid intersection 


The Matroid Intersection Theorem (3.5) is a special case of a result of Edmonds 
(1970) on polymatroids. 


Theorem 4.9. Let f\, f2 be polymatroid functions on S. Then 
max (x(5) : x € P(fi) 1 P(f2)) = min (fi(A) + fo(S\A))- 


Moreover, if f,\,f2 are integer-valued, then the maximizing x can be chosen to be 
integer-valued. 
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We remark that, if f;, f2 are not required to be monotone, then by monotoniza- 
tion arguments the same result holds, except that the right-hand side becomes 
min({|(A) + fo(B) : AUB =S), The inductive proof of Theorem 3.5 outlined in 
section 2 generalizes to a proof of the integral version of Theorem 4.9. (The in- 
duction is now on &(min(f, ({/}), f({7})): 7 € S), and the appropriate analogue of 
contracting e € S is to form the function ff by f(A) = min(f;(A), fi(A U {e}) — 1). 
The non-integral version can be deduced from the integral version in a straight- 
forward way. Later we shall see other proofs and algorithmic aspects of (general- 
izations of) Theorem 4.9. 

The following sandwich theorem of Frank (1982) is a useful and attractive re- 
statement of Theorem 4.9. Its resemblance to classical results on separation of 
convex and concave functions is evident; we shall see that the relationship is more 
than an analogy. A set function f is supermodular if —f is submodular, and is 
modutar if it is both sub- and supermodiular. It is easy to see that a function m is 
modular on subsets of S if and only if m(A) = x(A)+k for some x € RS,K ER. 


Theorem 4.10. Let g,h be defined on subsets of S such that g is supermodular, h is 
submodular, and g < h. Then there exists a modular function m satisfying g <m <h. 
Moreover, if f and g are integer-valued, then m may be chosen integer-valued. 


To derive Theorem 4.10 from Theorem 4.9 one proceeds as follows. Add a 
constant to g and # to make g(@) = 0. Lower h(@) to 0. Raise g(S) to A(S). Add a 
function p of the form p(A) = M |A| to make f and g monotone. Now take f, = kh, 
define fp by f2(A) = g(S) — g(S\A), apply Theorem 4.9 to find x € P(fi)N P(f) 
with x(S) = g(S), and define m by m(A) = x(A). A similar construction allows the 
derivation of Theorem 4.9 from Theorem 4.10. 


4.3. Optimization over the intersection of polymatroids 


The problem of optimizing a linear function over the intersection of two polyma- 
troids may be stated as a linear program: 


maximize c-x 

subject to x(A) <f\(A), ACS; 
(A) <f,(A), ACS; 
x,20, jeES. 


(4.11) 


The dual linear program is 
minimize J ~(f,(A)y), + fi(A)y4 2A CS) 
subject to Yun ty, :fE ACS) 3c; fES; (4.12) 
yaya 20, ACS. 


The main result (Edmonds 1970) on this topic may be stated as follows. 
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[Theorem 4.13, Jf f,, f: are integer-valued, then (4.11) has an optimal solution that 
s integer-valued. If ¢ is integer-valued, then (4.12) has an optimal solution that is 
nteger-valued. 


A number of important results are consequences of Theorem 4.13. For example, 
taking f\,f2 to be matroid rank functions, we can conclude that the intersection 
of two matroid polyhedra is a polyhedron with {0, 1}-valued extreme points, and 
thus derive the Matroid Intersection Polyhedron Theorem. A second consequence 
is the Polymatroid Intersection Theorem (4.9), obtained by taking each c; = 1 and 
observing that y', y2 can be required to take a very special form. 

A proof of Theorem 4.13 based on the theory of total dual integrality and total 
unimodularity can be found in chapter 30 by Schrijver. In the next section we treat 
a generalization of Theorem 4.13. 

It is worthwhile to identify the results that extend to the intersection of three 
(or more) polymatroids. It is possible to optimize any linear function over the 
intersection of three polymatroids, using the ellipsoid method, since the separation 
problem can be solved efficiently. However, the Integrality Theorem (4.13) does 
not generalize, and optimizing over the integral vectors in three polymatroids, even 
over common independent sets of three matroids, is difficult. Although we are not 
aware of a proof that this problem is unsolvable in the oracle context, it is well 
known to contain NP-hard problems. 


4.4. Some extensions 


It is frequently useful in applications to relax some of the assumptions in the def- 
inition of polymatroid. A first such variant is to drop non-negativity, considering 
QO(f) = {x € R®: x(A) < f(A) for all A C S}. For such submodular polyhedra The- 
orem 4.3 still holds with the same proof. In addition the greedy algorithm works 
(for any c € R*) with the same proof. Moreover, we do not need monotonicity for 
the formula (4.7) to be correct. 

This greedy algorithm for Q(f) motivates the definition (Lovdsz 1983) of an ex- 
tension of a set function. For c € RS let fc) denote max(c- x: x € Q(f)). It is easy 
to see (essentially from the proof of the greedy algorithm) that if f is submodular, 
f(c) can also be calculated as follows: c can be expressed (uniquely) as ear Aix" 
for Aj > O and 7; CS with T; > T, D+» D Tg. Then f(c) = *, Aif(T;). This can 
be taken as the definition of the extension of any set function, submodular or not, 
to a function on RS. Now we can make the connection between submodularity and 
convexity more explicit. 


Theorem 4,14, f is submodular on S if and only if f is convex on RS. 


This result, from Lovdsz (1983), has a straightforward proof. When combined 
with a standard result on separation of convex and concave functions, it implies 
the first part of Frank’s Theorem (4.10). 

Another useful extension is to allow a function to take value oo on some of 
the subsets of S. With this extension essentially all of the previous results still 


$72 R.E. Bixby and W.H. Cunningham 


obtain, with obvious exceptions caused by unboundedness of P(f). (It és true that 
if P(f) is bounded and non-empty, then it is a polymatroid.) This idea is often 
combined with another important extension, namely, requiring the submodular in- 
equality to hold only for certain pairs of sets. We say that f is intersecting (crossing) 
submodular if f(A) + f(B) > f(AUB)+ f(A NB) whenever ANB #AO(ANBFO 
and AUB ¢5S). (When considering such weaker notions, we sometimes refer to 
ordinary submodular functions as fully submodular.) Edmonds (1970) considered 
intersecting submodular functions and proved extensions of most of our earlier 
results. For example, the next result generalizes Theorem 4.3; the same idea un- 
derlies the proof. 


Theorem 4.15. Theorem 4.3 is true if f is intersecting submodular, with mingcs(f(A) 
+u(A)) replaced by min(Zf(A;) + u(UA;): @ 4 A; C S, A; pairwise disjoint). 


As for Theorem 4.3, we can conclude from Theorem 4.15 that if f is integer- 
valued and intersecting submodular, then the {0,1}-valued vectors in P(f) corre- 
spond to the independent sets of a matroid. A classical example of this construc- 
tion results in the forest matroid of a graph G(V,E). Here we take S = E and 
f(A) =|V(A)| —1 for A #0, with f(0) =0 

The following consequence of Theorem 4.15 is also useful. 


Theorem 4.16. Jf f is intersecting submodular on S and f' is defined by f'(A) = 
min(Xf(A;): A =UAi, 04 A; CS, A; pairwise disjoint), then f' is submodular on 
S, and Q(f’) = Q(f). 


There is also a construction that produces an intersecting submodular function 
beginning with a crossing submodular function; it is from Frank (1982) and also 
implicitly Fujishige (1984). 


Theorem 4.17. /f f is crossing submodular on S with f(S) finite, and f' is de- 
fined by f(A) = min(Zf(A;): A = UA,, A; C S, Aj pairwise disjoint), then f' is in- 
tersecting submodular; moreover, Q(f') MN {x € RS: x(S) = f(S)} = Q(f) N {x ERS: 
x(S) = f(S)}. 

Notice the essential difference between Theorems 4.16 and 4.17. If f is crossing 
submodular, it need not be true that Q(f) = Q(/”); in fact, O(f) need not be a 
submodular polyhedron. It is the base polyhedron B(f) = Q(f) N {x € RS: x(S) = 
F(S)} that is preserved. However, we can still construct a matroid from a crossing 
submodular function (Frank and Tardos 1984). 


Theorem 4.18. Lert f be integer-valued and crossing submodular on S and let 
keZ,. Then {B CS: |B) =k, x? € Q(f)}, if non-empty, is the basis family of 
a matroid. 


5. Submodular flows and other general models 


In this section we describe several more general models. We give considerable 
attention to submodular flows, a generalization of polymatroid intersection. In 
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particular, we describe the basic ideas behind solution algorithms. (This has not 
been done for polymatroid intersection.) We also report on the polymatroid match- 
ing problem, a submodular model that includes graph matching as a special case. 
Finally, we describe two of the many additional generalizations of matroids, delta- 
matroids and greedoids. 


5.1. Submodular flows: Models and applications 


The optimal submodular flow problem is a generalization of the problem of op- 
timizing over the intersection of two polymatroids, that keeps the important in- 
tegrality and algorithmic properties of the latter problem. In addition, it contains 
several other fundamental problems. There are a number of closely related mod- 
els, introduced under other names, and many of the important contributions have 
been made in these differing contexts. (Schrijver (1984) explains the connections 
between these models.] However, we define just two of the models, and state the 
main results in the language of one of them. 

First we describe the polymatroidal network flow model of Lawler and Martel 
(1982); it was introduced independently by Hassin (1982). Let G= (V, £) be a di- 
graph, let s,¢ be distinct elements of V, and for each u € V let f,, gy be polymatroid 
functions on 5 (v), &*(v). A feasible flow is a vector x = (x;: j € E) satisfying 


x(8" (v)) — x(6*(v)) =0, forv € V\{s,¢}; 
x(A) <f,(A), for allveV, all A C8 (); 
x(A) <g (A), for allu eV, all A C &*(v); 
x; 20, forall jeE. 


If we R® and we take f,(A) =u(A), gy(A) = u(A), then the feasible flows are 
the feasible flows of an ordinary (single-source, single-sink) flow network. If we 
take V = {s,t} and allow g,, f, to be arbitrary polymatroid functions, the feasible 
flows are the elements of P(g.) M P(f,). 

The model for which we shall present the main results is one introduced by 
Edmonds and Giles (1977). Let G = (V, E) be a directed graph, let b be a crossing 
submodular function on V, and let £,u,c € R&. We allow values of u and 6 to be 
oo, and we allow values of @ to be —oo. The optimal submodular flow problem is 


maximize Slejx;: J €£) 
subject to x(5°(A)) — x(6*(A)) < D(A), ACV; (5.1) 
bp SX QU, JCE. 
We shall use the term feasible flow to refer to a vector x satisfying the con- 
straints of (5.1). The special case where b is identically 0, is the well-known optimal 
circulation problem of network flow theory. Let us also show that the polymatroid 


intersection problem (4.11) can be cast in this form. For each j € S, let j, j. be 
copies of j, let A; denote {j;: j € A} for any A C S, and let G = (V,E) be defined 
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by V = S| US, and EF = S with j = (1, j2) for each j € S. Define £; = 0, u; = 00 for 
each j. Define b(A2) = fi(A) and b((S;\A1) U S2) = fa(A) for all A CS, b(A) = 00 
for all other A C V. The resulting (5.1) is exactly (4.11). We encourage the reader 
to check that 6 is crossing submodular. 

The main integrality result for (5.1) is due to Edmonds and Giles (1977). 


Theorem 5.2. If £,u,b are integer-valued and (5.1) has an optimal solution, then 
it has one that is integer-valued. If c is integer-valued and the dual of (5.1) has an 
optimal solution, then it has one that is integer-valued. 


The proof of Theorem 5.2 uses an idea similar to that of (4.13) in chapter 30 by 
Schrijver: the dual of (5.1) has an optimal solution whose non-zero components 
form an optimal solution to a linear program having a totally unimodular constraint 
matrix. Such an optimal solution is obtained from any optimal solution y by suc- 
cessive “uncrossings”, that is, given sets A,B with An B, A\B, B\A, V\(AUB) 
all non-empty and y,4,yg > 0, decreasing y4, yg by ¢ = min(y,, yg) and increasing 
Yaus, Yang by é. 

As yet another illustration of the power of the submodular flow model, we 
show how the Lucchesi-Younger dicut-covering result (chapter 2 by Frank) can 
be derived from Theorem 5.2. Given G=(V,E), we put 2; =0, cj =—1, and 
u; = oo for each j € E. For each A C V such that 6 (A) = 0 and 0 # A # V we put 
b(A) = —1, and b(A) = on for all other A. Then it is easy to check that an optimal 
integer-valued feasible flow is the characteristic vector of a minimum-cardinality 
cover of directed cuts, and an optimal integer-valued solution of the dual problem 
picks out a collection of arc-disjoint directed cuts. Hence the Lucchesi- Younger 
result follows from Theorem 5.2 and the linear programming duality theorem. 


5.2. Submodular flow algorithms 


Suppose we are given x = (x;: j € E) and want to determine whether «x is a feasible 
flow. Then it will be enough to be able 10 minimize g(A) = b(A) — x(8"(A)) + 
x(6*(A)) over A C V. Since g is a (crossing) submodular function, there exists a 
polynomial-time (ellipsoid) algorithm to minimize it. Hence by the equivalence of 
separation and optimization, there exists a polynomial-time algorithm for (5.1). The 
resulting algorithm uses the ellipsoid method on two different levels. The search 
for better algorithms has succeeded in decreasing this reliance on the ellipsoid 
method. We shall outline an efficient combinatorial algorithm for (5.1), assuming 
the availability of a subroutine for minimizing a submodular function. It follows 
that, if an efficient combinatorial algorithm for the latter problem is discovered, 
then (5.1) is also solved in a satisfactory way. In addition, there exist instances 
of (5.1) for which the submodular functions arising can be minimized by efficient 
combinatorial algorithms. An example is the problem of re-orienting the arcs of a ~ 
digraph at minimum cost so as to make it k-arc-diconnected (Frank 1982). 
Notice that x is a feasible flow if and only if €<x<wu and Bx € Qo(b) = 
Q(b) nm {z € RY: z(V) = 0}, where B is an appropriately defined matrix. (Recall 
that Q(b) = {x € RS: x(A) < b(A), A C S}.) The results (4.16) and (4.17) tell us 
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that Qo(b) = Qo(b’) for some fully submodular function b’, since there is no harm 
in assuming that b(V) = 0. Hence the feasible flows remain the same when 5 is re- 
placed by b’; this is a result of Fujishige (1984). In addition the submodular oracle 
that is used has the same output for b and for b’. Therefore, we may pretend that 
b is fully submodular even if it is not. (There are two exceptions to this statement; 
for purposes of this exposition we ignore them.) 


5.3. Submodular flow algorithms: Maximum flows and consistent BFS 


The maximum (submodular) flow problem is to find a feasible flow that maximizes 
x, for a fixed arc f € E. (Notice that it is an equivalent problem to minimize x,, 
since f’s direction could be reversed.) This is a special case of (5.1), and includes as 
special cases the ordinary network maximum-flow problem (chapter 2 by Frank), 
and the problem of finding a maximum component-sum vector in the intersection 
of two polymatroids. (The first is easy to see; the second requires modifying the 
previously-described submodular flow representation of polymatroid intersection.) 
The shortest augmenting path technique of maximum flow theory (see chapter 
2 by Frank) generalizes to this context, but we also need an important further 
refinement of breadth-first search. 

The algorithm for the maximum flow problem generalizes the cardinality Ma- 
troid Intersection Algorithm, as well as the usual network maximum-flow algo- 
rithm. To motivate the augmentation used, we first describe two special cases. 
Suppose that we find a circuit in G including f and such that x, < u, for all arcs 
having the same orientation as f and x, > &, for all arcs having opposite orientation 
to f. Then x, could be increased by sending flow around the circuit, increasing x. 
by « for arcs of the first kind and decreasing x, by ¢ for arcs of the second kind; ¢ 
must not exceed 4, — x, for any arc of the first kind, or x, — @ for any arc of the 
second kind. Next, suppose that we have a path with the same properties, say from 
q to p, and we attempt to send flow along the path. Now there is an additional 
limitation on «; for the new flow to satisfy the constraints of (5.1), ¢ cannot ex- 
ceed £,(p,q), defined to be min(b(A) — x(8" (A)) + x(8*(A)): p € A,g ¢ A). This 
limitation could be represented as an upper bound for the flow on a fictitious arc 
(p, q). An actual augmentation in the algorithm will consist of a sequence of aug- 
mentations on paths linked into a circuit by the addition of fictitious arcs, and is 
best described via an auxiliary digraph. 

Given a feasible flow x, let G’ = G'(G, b,£,u,x) have node-set V and: 


for each e = (p,q) € E with x, < u,, an arc (p,q) with capacity u, — Xe; 

for each e = (p,q) € E with x, > @,, an are (q,p) with capacity x, — &; 

for each p,q € V with e,(p,q) > 0, an arc (p,q) with capacity «, (p,q). 
We call the arcs forward, backward, and jumping, respectively. Suppose that f = 
(t,s). A dipath P in G’ from s to ¢ together with (t,s) yields a directed circuit C 


in G’. Let ¢ be the minimum capacity of its arcs. The augmentation corresponding 
to P increases x, by ¢ if a forward arc of C arises from e, and decreases x, by 
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if a backward arc of C arises from e. The next lemma occurs in Frank (1984a); 
similar results for other models can be found in Fujishige (1978), Hassin (1982), 
Lawler and Martel (1982), and Schénsleben (1980). 


Lemma 5.3. /f P is a chordless (s,t)-dipath in G’, then the augmentation corre- 
sponding to P results in a feasible flow. 


On the other hand, if there is no (s,1)-dipath in G’, then x; is maximum, and 
the algorithm may terminate. To see this we observe that in this case there is a 
set ACV with se A, t¢A such that x, =, for all e € 5*(A), x = & for all 
e € 5-(A)\{f}, and for every p€ A, q ¢ A there is a tight set containing p and 
not q. [A set is tight if its inequality in (5.1) holds with equality.) Since we may 
assume that b is fully submodular, it is easy to prove that the intersection and 
union of tight sets is tight. It follows that A is tight. Therefore, 


x = B(A) — £(8-(A)\{F}) + u(6"(A)). 


But obviously no feasible flow can have x; exceeding this, so x, is maximum. The 
resulting min-max theorem is the following. 


Theorem 5.4. [f there is a maximum flow and b is fully submodular, then 
max(x,: x a feasible flow) 
= min(u,;, min(b(A) — (8 (A)\{f}) + u(6*(A)): s € A C V\{e})). 


There is a more general version of Theorem 5.4 for crossing submodular func- 
tions. It can be derived from Theorem 5.4 by applying Theorems 4.16 and 4.17. 
In addition, feasibility can be tested by applying the maximum submodular flow 
algorithm to a certain auxiliary problem and so the following feasibility charac- 
terization is also a consequence. Again a more general version (Frank (1984a)) is 
available. 


Theorem 5.5. /f b is fully submodular then there exists a feasible flow if and only 
if, for all A CV, €(8*(A)) — u(6-(A)) < B(A). 


Now we discuss the efficiency of the algorithm. First, we observe that if b,u, @ and 
the initial x are integer-valued, then x remains integer-valued, and the algorithm 
terminates (assuming there exists a maximum flow). However, we would like to 
have a bound on the number of augmentations that is not so dependent on the 
size and form of the input numbers. A similar difficulty arises in ordinary network 
flows, where the classical solution is to find shortest augmenting paths, found by 
“breadth-first search”: scanning nodes in the order in which they are labelled. The 
analysis of that method is based on the following facts. For a flow x, let k(x) denote 
the length of a shortest augmenting path with respect to x, and let E(x) denote the 
set of arcs contained in some shortest augmenting path with respect to x. Suppose 
that an augmentation replaces flow x by flow x’. Then: 

(a) k(x’) 2 k(x); 
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(b) if K(x’) = k(x), then E(x’) Cc E(x). 

It follows from (a) that the computation is divided into at most n = |V| stages 
and it follows from (b) that each stage takes at most n? augmentations. In the more 
general situation of submodular flows, (a) still holds but (b) fails. However, careful 
examination of how it can fail, leads to a further refinement. In addition to scanning 
nodes in the order labelled, we label nodes (from a node being scanned) in an order 
consistent with a fixed ordering of V. The resulting path has a node-sequence that is 
lexicographically least among node-sequences of shortest augmenting paths. This 
important technique was introduced independently by Schénsleben (1980) and 
Lawler and Martel (1982), who used it in contexts similar to the present one. 
Cunningham (1984) used it in his algorithm for testing membership in matroid 
polyhedra. He also labelled the technique “consistent breadth-first search”, and 
described its essential properties in a context-free way. For the submodular flow 
model we are using, the following result is due to Frank (1984a). 


Theorem 5.6. Jf consistent breadth-first search is used, the maximum flow algo- 
rithm terminates after O(n*) augmentations. 


5.4. Submodular flow algorithms: Optimization 


Recall that the weighted MIA used the following three ideas: (i) simpler optimal- 
ity conditions (using weight-splitting) than those coming from a straightforward 
usc of linear programming duality; (ii) use of the same auxiliary digraph as the 
unweighted algorithm, except that arcs were assigned costs; (ili) use of a least-cost 
dipath computation to update the dual solution, followed by use of the unweighted 
algorithm on arcs of cost zero. These three ideas will be used in extending the work 
of the last section to an algorithm for the optimal submodular flow problem. 

The first important idea, suggested by Frank (1982), is that an optimal dual 
solution for (5.1) can be represented by a vector of potentials m,, v € V. Given 
such a vector 7 and an arc e = (p,q) € E, C, denotes Ce + tp — Ty. 


Theorem 5.7. Suppose that x is a feasible flow and 1,x satisfy 
(a) if ce >O then x, = ue, fore € E; 
(b) if <0 then x, = &, fore € E; 
(c) if t) > mq, then &,(p,q) =O, for p,q eV. 

Then x is optimal. 


This result is easily proved, by showing that linear-programming optimality con- 
ditions are satisfied by x and the dual solution (y,f,g) constructed from a as 
follows. Let 7 < a <---< m be the distinct values of 7, and let A; denote 
{ve V: m >a},1<i<k. Define y, to be a; — 7, if A = A; and to be 0 oth- 
erwise. Define f, to be max (¢.,0) and g, to be max (—¢,0),e¢€ E. (The dual 
variable f, corresponds to the constraint x. < up; ge corresponds to —x, < —é.) 
Notice that (y, f,g) is integer-valued if 7 is. 

The algorithm maintains a feasible flow x and a potential a satisfying Theorem 
5.7(c). Let f =(t,s) be an arc violating condition (a); the other case is similar. 
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It is convenient to assume that f is the only arc violating (a) or (b). This can 
be accomplished by temporarily changing the appropriate bound of any other 
offending arc e to x. Now we form the auxiliary digraph G’ of the last section and 
assign arc-costs as follows: A forward arc (p,q) has cost Wp, = —C.; a backward 
arc (q,p) has cost Wg, = Ce; a jumping arc (p,q) has cost 7, — ap. Now each arc 
of G' (except the forward arc (¢,s), which we delete) has non-negative cost, and 
indeed, this is equivalent to the condition that (a)-(c) are violated only by f. 

Next we use an O(n’) shortest-path algorithm to find a least-cost dipath in 
G' from s to uv for each v € V; let d, be its cost. For v € V we replace 7m, by 
m, —min(d,,C,). It is quite easy to check that the arc-costs remain non-negative. 
Now either ¢; = 0, in which case we have ended f’s violation of the optimality 
conditions, or ¢, > 0, in which case there exists in G’ an (s,r)-dipath consisting 
of arcs having weight 0. In this case we use consistent breadth-first search to find 
such a dipath and perform an augmentation. [It is possible that this dipath yields a 
directed circuit of G’ having no finite capacity; in this case (5.1) is unbounded, and 
the algorithm terminates.}] That such an augmentation preserves (a)-(c) is obvious. 
That it delivers a feasible flow can be proved as for the weighted MIA, by showing 
that it is an augmentation of the maximum submodular flow algorithm applied to 
a more restricted submodular flow problem; see Cunningham and Frank (1985). 
From this observation it follows also that there will be at most O(7°) augmentations 
before a changes again. The number of potential changes can be shown to be finite, 
and better bounds hold when c is nice. Refinements based on scaling c lead to a 
polynomial bound (Cunningham and Frank 1985) and to a bound polynomial in n 
alone (Fujishige et al. 1989) for the number of augmentations. 


5.5. Polymatroid matching 


An important common generalization of graph matching and matroid intersection 
may be formulated as a problem on 2-polymatroids: polymatroids whose polyma- 
troid function f is integer-valued and satisfies f({j}) < 2 for each j € S. A matching 
isaset J C § such that f(/) = 2 |J|. Ordinary matching on a graph G = (V, E) arises 
when we take S = E and f(A) =|V(A)| for A C S. Matroid intersection for ma- 
troids M,, Mz on S arises when we take f =r, +72. Historically the first common 
generalizations of these two important problems were equivalent models called 
matroid parity (Lawler), matchoids (Edmonds), and matroid matching. The latter 
is the case of polymatroid matching in which we are given a graph G = (V,£) 
and a matroid M on V and take S = E with f(A) = r(V(A)). However, the max- 
imum matching problem, even in this special case, is intractable from the oracle 
viewpoint (Lovdsz (1980b), Korte and Jensen 1982), and also contains NP-hard 
problems. 

Nevertheless, a great deal of progress has been made on the 2-polymatroid 
matching problem, mainly due to the work of Lovdsz. He has described (Lovdsz 
1980b) general reduction techniques for computing a maximum matching. For 
several important special classes these lead to efficient algorithms and min-max 
results. Among the applications are finding a maximum-cardinality forest in a 3- 
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uniform hypergraph, and finding a maximum family of openly disjoint A-paths in 
a graph. (See chapter 2 by Frank for more on the latter problem.) 

The most important special case is that of linear 2-polymatroids. Here S is a set of 
lines (equivalently, pairs of points) in a vector space, and f(A) is ar(J{e: e € A}), 
where ar denotes affine rank. Graph matching and matroid intersection for two 
matroids linearly represented over the same field are both special cases, and their 
min-max formulas are generalized by the next result from Lovdsz (1980a). 


Theorem 5.8. The maximum size of a matching in a linear 2-polymatroid is 


k 
minar(Ag) + ai Ay, 


i=} 


where the minimum is over sets Ap, A\,---,A, Of points of the space such that, for 
every e € S, either ar(Ayg Ue) < ar(Ag) + 1 on, for some i, ar(A; Ue) = ar(A,). 


Lovasz has also given a (complex) polynomial-time algorithm for linear 2- 
polymatroid matching. Gabow and Stallmann (1986) contains a different, much 
more efficient, algorithm. Its running time is O(n*), where n = |S}, surprisingly 
close to the best bound known for the special case of linear matroid intersection. 
An outstanding question is the solvability of the corresponding weighted problem, 
about which little is known. 


5.6. Delta-matroids and bisubmodular polyhedra 


Many of the optimizational properties of matroids are preserved in an interest- 
ing generalization introduced under various names by Dress and Havel (1986), 
Bouchet (1987), and Chandrasekaran and Kabadi (1988). A delta-matroid is a pair 
(S,¥) where S is a finite set and # is a family of subsets of S, called the feasible 
sets, satisfying the following symmetric exchange axiom: 


If Fi, Fo €F¥ anda c F/AFy, there exists 6 € Fy) AF) such that F; A{a,b} € #. 
(5.9) 


(Here A denotes symmetric difference.) Matroids, defined by their basis fami- 
lics, are precisely the delta-matroids in which the feasible sets all have the same 
cardinality. They form the fundamental examples, although the independent sets of 
a matroid also form the feasible sets of a delta-matroid. Other interesting examples 
are matching delta-matroids (S is the node-set of a graph and F CS is feasible if 
and only if it is the set of end-nodes of some matching), and linear delta-matroids 
(where Ass is a skew-symmetric (or symmetric) matrix over a field, F C S is fea- 
sible if and only if Ay, is non-singular). 

The maximum-weight feasible set problem can be solved by a type of greedy 
algorithm. This algorithm needs an oracle that, for disjoint subsets A,B of S, 
determines whether there is a feasible set F satisfying A C F C S\B. 
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Symmetric greedy algorithm for the maximum-weight feasible set problem. 
Order S = {e),€2,...,€n} so that |ce,| > |ce,| 2 +++ 2 |ce, 
For each i, let R; denote {¢;,.-.,én} 
J:=6 
For i = 1 ton do 
If c., 2 0 and there exists F € ¥ with JU {e,} CF CJUR; then J :=JU {e;} 
If c,, < 0 and there exists no F € ¥ withJ CF CJUR;,; then J :=JU {e;} 


Theorem 5.10. (5S, #) is a delta-matroid if and only if for any c € RS, the symmetric 
greedy algorithm finds a maximum-weight feasible set. 


Again the “if” part is an easy consequence of the definition. The other part 
can be proved by methods similar to the ones used for matroids. However, it is 
also possible to derive many of the results for delta-matroids from corresponding 
matroid facts. One useful technique is the following. (The reader should consider 
the effect of choosing N to be the set of negative-weight elements of S.) 


Proposition 5.11. (S,%) is a delta-matroid if and only if for every N CS, the 
maximal members of {F AN: F € ¥} are the bases of a matroid. 


We define a rank function f for a delta-matroid by f(A, B) = max(|F 0 A| — 
|F 1 B|: F € &) for subsets A, B of S. Clearly the characteristic vector of any 
feasible set, and therefore any convex combination of such vectors, satisfies the 
inequalities 


x(A) —x(B) <f(A,B),  A,BCS, ANB=%. (5.12) 


Theorem 5.13. The convex huil of characteristic vectors of feasible sets of a delta- 
matroid is precisely the set of solutions of (5.12). 


As for matroids, there is one property of the rank function that is the essential 
one for proving such results. Let f be a function defined on ordered pairs of disjoint 
subsets of S. We say that f is bisubmodular if it satisfies 


f(A, B) + f(A’, B’) > f(AN ABB’) 
+f((AUA)\(B UB’), (BUB)\(AUA)). (5.14) 


Kabadi and Chandrasekaran (1990), Nakamura (1988), and Qi (1988) have shown 

that for any bisubmodular f, the system (5.12) is totally dual integral. In particular, 

this implies that if f is also integer-valued, then every extreme point of the cor- 

responding bisubmodular polytope, that is, the solution set P(f) of (5.12), is also 

integer-valued. This result easily implies Theorem 5.13. The total dual integrality 
of (5.12) can again be proved by a greedy algorithm for maximizing c - x over the 

bisubmodular polyhedron, a generalization of the one for delta-matroids. In fact, 

this algorithm already appears in Dunstan and Welsh (1973). For P CRS, ACS, 

and x € R4, we say that xéP if there exists y ¢ P such that y; =x, for all j ¢ A. 
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Symmetric greedy algorithm for bisubmodular polyhedra. 
Order S = {e,€2,...,,} so that |ce,| > |ee,] > --- 2 |ce, | 


For each i, let R; denote {e;,...,&:} 

J:=0 

For i=1 ton do 
If ce, 2 0 choose x-, as large as possible so that (Xe,,...,%e, )EP 
If ce, < 0 choose x,, as small as possible so that (Xe,,...,%, )EP 


Dunstan and Welsh call a compact set P C RS greedy if the above version of 
the greedy algorithm maximizes c-x over P for every c € RS. To describe their 
characterization of greedy sets, we need to define some terms. For a point x € 
R° and a subset A CS, we denote by xAA the point obtained by replacing x; 
by its negative for each j € A. For a subset P of R°, we write PAA to denote 
{xAA: x € P}. Finally, the hereditary closure {y € RS: y < x for some x € P} of 
P is denoted by dn(P). The equivalence of (a) and (c) below is due to Dunstan 
and Welsh (1973); that of (a) and (b) is due to Kabadi and Chandrasekaran (1990) 
and Nakamura (1988). It is an interesting puzzle to understand why the recent 
interesting work on delta-matroids and bisubmodular polyhedra occurred so long 
after the initial work of Dunstan and Welsh. 


Theorem 5.15. For a polytope P CR’, the following statements are equivalent: 
(a) P is greedy; 
(b) P is a bisubmodular polyhedron; 
(c) for every A CS, dn(PAA) is a submodular polyhedron. 


The bad news about delta-matroids and bisubmodular polyhedra is the absence 
of an intersection theorem. In fact, the matroid matching problem is easily seen to 
be the intersection of a matroid with a matching delta-matroid, so the delta-matroid 
intersection problem is intractable. Frank (1984b) introduced a special class of 
bisubmodular polyhedra that is better behaved in this respect, but still includes 
polymatroids and base polyhedra. A generalized polymatroid is a bisubmodular 
polyhedron determined by a bisubmodular function f satisfying f(A, B) = g(A) — 
A(B), where g is submodular, / is supermodular, and they satisfy 


g(A) —h(B) <g(A\B)—h(B\A), A,BCS. (5.16) 


Actually, the definition of Frank is slightly more general. The paper of Frank and 
Tardos (1988) is an excellent reference, giving a wealth of results on generalized 
polymatroids and related topics. One result that helps to explain their good be- 
haviour is that every such polyhedron can be obtained by projection from a base 
polyhedron. The main integrality result, generalizing the Polymatroid Intersection 
Theorem 4.9, may be stated as follows. 


Theorem 5.17. The union of the defining systems of two generalized polymatroids 
in RS is totally dual integral. 
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5.7. Greedoids and independence systems 


A pair (S, #), where # is a family of subsets of S containing 9, is a matroid if it 
satisfies: 


ifACBe ¥,then Ae F; (5.18) 
ifA, B¢ ¥, |A| > |B], then there isa € A\B with BU{a}eF¥. (5.19) 


Requiring only one of (5.18), (5.19) yields two different generalizations. The first, 
of course, is independence systems. These objects seem to be too general to have 
an interesting theory. We do mention one result on the effectiveness of the greedy 
algorithm. For any A CS, let r*(A) denote max(|F|: A D F € ¥), and let r~(A) 
denote min(|F|: A> F €¥%, F maximal). Of course, an independence system 
is a matroid precisely when r*(A) =r (A) for all A CS. That is, the quantity 
min(r~(A)/r*(A): A CS, r*(A) > 0) =p(S, &) is 1 for matroids, and p(S, ¥) is 
a measure of how close (S, ¥) is to being a matroid. We know from Theorem 
2.2 that the maximum-weight independent set problem is solved optimally by the 
greedy algorithm when p(S, #) = 1. Jenkyns (1976), and independently Korte and 
Hausmann (1978), have proved more generally that the greedy algorithm works 
well if p(S, #) is not too small. 


Theorem 5.20. Let (S, ¥) be an independence system. For any c € RS the greedy 
algorithm delivers a feasible set of weight at least p(S, #)max(c(F): F € F). 


Now let us consider the objects obtained when we drop (5.18) and keep (5.19). 
The resulting structures arc called greedoids. They were introduced by Korte and 
Lovasz and, while very general, have a surprising amount of structure. (In fact, 
enough to justify a book; see Korte et al. (1991).) Here we mention a few of the 
connections with optimization. 

For a greedoid (S, #) a basis of (S, #) is a maximal feasible set. Given a weight 
function c such that c(F) € R for each F € ¥, we consider the problem of finding a 
basis F maximizing c(F’). We do nor assume that c is linear, that is, determined by 
c(F) = X(c;: j € F) for an element-weighting c € R°. We state a greedy algorithm 
for the maximum-weight basis problem. 


Greedy algorithm for a greedoid. 

J:=90 

While there exists e € S\J with J U {e} € ¥ do 
Choose such e with c(J U {e}) maximum 
J:=JU {e} 


Notice that this algorithm, for the case when c is linear and (S, ¥) is a matroid, - 
is equivalent to the greedy algorithm for finding a matroid basis of maximum 
weight. In that case the algorithm finds an optimal solution. The situation for 
greedoids is more complicated. We illustrate two of its aspects by considering a 
class of greedoids arising from a digraph G = (V, £) with a fixed node s € V. We 
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take § = E and take F to be the arc-sets of arborescences rooted at s. Notice that 
(5.19) is preserved under the truncation operation ¥* = {F € F: |F| <k}. Let 
T CV\{s} be a set of target nodes and assign (linear) weights of 1 to arcs having 
head in T and 0 to the others. By finding a maximum-weight basis of (S, ¥*), we 
can decide whether there is an s-rooted arborescence having at most k edges and 
including all the target nodes. This latter problem is a version of the “Steiner tree 
problem” and is NP-hard. So optimizing linear functions over greedoid bases is, 
in general, difficult. 

On the other hand there are interesting nonlinear functions that the greedy 
algorithm optimizes. For an example, consider again (S, ¥#) from above. Given 
d ERS and F € F, we let c(F) denote D evie(D jer, —dj) where P, is the arc-set 
of the unique dipath in F from s to v. Then a basis F of (S, #) maximizing c(F) 
provides least-cost dipaths (with respect to d) from s to vu for all v € V, and the 
greedy algorithm will find such a basis. (These facts are consequences of standard 
results on shortest path problems; see chapter 2 by Frank.) More generally, there is 
a large class of functions that the greedy algorithm optimizes over the bases of any 
greedoid. There is also a characterization of greedoids in terms of the functions 
that the greedy algorithm optimizes, and a characterization of the greedoids for 
which it optimizes every linear function. For these results and others, we refer the 
reader to Korte et al. (1991). 

Finally, we describe a generalization of the matroid intersection theorem to a 
larger class of greedoids. A distributive supermatroid (we do not attempt to explain 
the name) is a greedoid (S, ¥) together with a partial order on S§ satisfying 


if AC Be & and A is an ideal, then A € ¥. (5.21) 


Notice that every matroid satisfies (5.21), by taking the partial order to be trivial. It 
is a result of Tardos (1990) that, for two distributive supermatroids (S, ¥,), (S, ¥2) 
on the same set with the same partial order, the quantity max(|F|: F € #94) 
is well-characterized. If the partial orders are not assumed to agree, then the 
intersection problem can be shown to be intractable in general. We state a simple 
version of Tardos’s min-max theorem. This is a bit misleading since it involves 
quantities that are difficult to compute; her paper provides a more complicated 
formula that does not have this drawback. For i = 1 and 2 and A C5, let B;(A) 
denote max(|F N Aj: F € ¥). (Of course, if (S, ¥) is a matroid, then ; is just 
the rank function.) 


Theorem 5.22. Let (S, ¥,), (S, F:) be distributive supermatroids with the same 
partial order. Then max(|F|: F € ¥,M ¥) = min(fi (A) + fe(A): A CS). 


6. Matroid connectivity algorithms 
Connectivity is a fundamental structural property of matroids. Algorithmically 


this notion took on a central role with Seymour’s work on regular matroids. The 
connectivity-based decompositions applied there have since been further applied 
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and significantly extended by Seymour (1981a), Truemper (1986), and Truemper 
(1992). 

Let M be a matroid on S with rank function r. A partition {S,,52} of S is an 
m-separation of M for m > 1 if 


|Si| =m < |Sol, (6.1) 
and 
r(Si) + r(S2) — r(S) <m—1. (6.2) 


Define M to be k-connected for k > 2 if M has no m-separation for m < k (Tutte 
1966); in this case we say that M has connectivity k. 2-connected matroids are 
called connected or nonseparable. It is easy to see that {S,, 5S} is a 1-separation 
of M if and only if every circuit of M is either a subset of S, or a subset of Sp. 

The above definition is motivated by graph connectivity. Tutte proved that the 
polygon-matroid M (G) of a graph G is k-connected if and only if G is k-connected 
in the usual sense (that is, connected and remains so upon the deletion of any 
k —1 or fewer nodes), and either |S| < 2k —1 or G has no cycle of size less than 
k. To obtain an exact analogue of node k-connectivity in graphs we can replace 
the cardinality condition (6.1) by r(S,) 2m < r(S2). This definition, however, is 
not invariant under duality, whereas Tutte’s definition is: it is easy to show that 
r(S,) + r(S2) — r(S) =r*(S,) +7°(S2) — r*(S) for any pair {5,, 5}. 

The following table summarizes the best algorithms available for testing k- 
connectivity for various valucs of k. Note that for general k, the complexity does 
grow exponentially with |S|, as it inevitably must since computing the connectiv- 
ity of a matroid specified by an oracle is easily shown to be intractable (see the 
argument used for girth in section 2). 


Table 1 
Algorithms for testing &-connectivity 


k Complexity (oracle calls) Method 

2 O(|SI) Shifting 

3 O(\S|’) Bridges or shifting 

4  — O(\S|45 \/log |S]) Shifting 
>5 acst !) Matroid intersection 


6.1. Partial representations 


Partial representations (Truemper 1984) will be used in several places in this sec- 
tion. Let X be a basis of a matroid M. The partial representation, abbreviated PR, 
of M with respect to X is the {0,1}-matrix R with rows indexed on elements x ¢ X 
and columns indexed on elements y € Y = §\X such that the (x,y) entry is 1 if 


Matroid optimization and algorithms 585 


and only if x € C(X,y). If M is binary, (/,R) is an actual representing matrix for 
M over GF(2), where J is an identity matrix of appropriate dimension. 

For X’ C X and Y' CY, let R(X’, Y’) denote the submatrix of R with row index 
set X’ and column index set Y’. R(X’, Y') is then a PR of M\(Y\Y’)/(X\X’) with 
respect to X\X". Define the R-rank of R(X’, Y') by 


rk(R(X', ¥')) = r(¥"U (X\X")) — |X\X", 


where r is the usual matroid rank function for M. If rk(R(X’, Y’)) = |X"|= IY’, 
then R(X’, Y’) is called nonsingular. 


6.2. 2-Connectivity 


By definition, a matroid is disconnected (separable) if there is a partition {5,5} 
of S with S$; 49 4S, and r(S,)+r(S2) =r(S) (r(S1) +r(S2) > r(S) holds by sub- 
modularity). There is an easy algorithm for testing this condition using PRs. 

Let X be any basis of M and let R be a corresponding PR. R has a naturally 
associated bipartite graph G(R) in which there is one node for each row and 
column of R and one edge for each nonzero entry. 


Theorem 6.3. Let M be matroid, X a basis of M, R the PR corresponding to 
X and G(R) the associated bipartite graph. Then M is connected if and only if 
G(R) is connected. (Thus, M is connected if and only if R has no nontrivial block 
decomposition.) 


Proof. We prove one half of the theorem. Suppose that M is not connected, let 
{S,,S2} be a separator, and let X be a basis of M. Define X; = S$; X (i =1,2). 
Now X; C S; is independent and so 


|X| = r(S) = r(S1) + r(S2) > [Xi] + [Xo] = |X. 


It follows that X; is a basis of 5; ( = 1,2). Let Y; = S;\X;. We conclude that every 
fundamental circuit C(X,e) for e € Y; is contained in S;. That is, the submatrices 
R(X;,Y;), i# j, are identically zero. It follows that G(R) is not connected. 


Theorem 6.3 yields an algorithm for testing the connectivity of M, as follows. 
First construct a basis and the corresponding PR R. This step requires at most |S| 
+ r($)((S| — r(S)) < |S|? calls to an independence oracle. Now test the connectivity 
of the graph G(R). The time for this latter computation is dominated by that for 
the first, and so the total is O(|S|*) oracle calls. 


6.3. Computing minors containing a fixed element 


The relationship introduced in the last subsection between 2-connectivity and bi- 
partite graph connectivity is the basis for an algorithmic proof of the following 
result. This approach is typical of several proof techniques in the subject. It plays 
a fundamental role in Truemper (1992). 
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Theorem 6.4 (Seymour 1977a). If N and M are connected matroids, N is a minor 
of M, and e € S(M)\S(N), then there exists a connected minor N' of M such that 
N =N’\e or N =N'/e (these are actual equalities — no isomorphism is involved). 


Proof. Let M and N beas in the statement of the theorem. Let X’ be independent 
and Y’ coindependent in M such that N = M\Y’/X’. Take a PR R corresponding 
to a basis X such that X’C X CS\Y’. Let e € S(M)\S(N) = X’ UY’. By duality 
assume e € Y’, Now if column e of R has a nonzero in the rows of N, we are done: 
N’ = M\(Y'\{e})/X’ satisfies the requirements of the theorem (by Theorem 6.3). 
Otherwise, we find a chordless path in the bipartite graph G(R) from e to any 
element in N. Deleting the column elements not in this path, and contracting the 
row elements not in the path yields the following picture (or a slight variant), 
where all entries not in N and not on the path are zero: 


¢ 


Denote the set of elements corresponding to non-N columns in this picture by Y” 
and the set corresponding to non-N rows by X”. One can verify that the minor 
obtained by deleting X” and contracting Y”\{e} has the property asserted in the 
theorem. O 


Theorem 6.4 may be applied in the following way. Suppose that we know an 
excluded-minor characterization for some class of matroids and ask the question, 
whether one of the excluded minors, when it occurs, can be isolated to a part of 
the ground set of a given matroid, or must be present throughout that matroid. 
This question leads to the notion of 1-roundedness: We say that a subfamily ?’ of 
a family ¥ of connected matroids is 1-rounded if whenever M € # has a minor 
N isomorphic to a member of #’, then for every element e € S(M), there is a 
minor N’‘ of M such that N’ is isomorphic to a member of #’ and e € S(N’). An 
example is provided by the binary matroids. In that case we take ? to be the 
class of connected non-binary matroids, and P’ = {U2,4}, where U2, denotes a 
matroid on four elements every two of which form a basis. By Theorem 6.4, to 
prove that {U4} is 1-rounded, we need only prove that for every connected 1- 
element extension of U4, the new element in the extension is an element of a 
minor isomorphic to U24. This result together with Tutte’s theorem (Tutte (1965a)) 
that every non-binary matroid has a minor isomorphic to U2, implies a result of 
Bixby (1974): If M is a connected non-binary matroid, then for every element 
e € M, there is a minor of M isomorphic to U2,4 and containing e. Note that the 
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proof of Theorem 6.4 gives an algorithm to find such a minor, given any single U2, 
minor of M. This idea is used later in computing the special 3-separations needed 
for testing the regularity of general matroids not known to be binary. 

To use the algorithm suggested in the proof of Theorem 6.4 we need to know N 
in the form N = M\Y"/X", for disjoint sets X” and Y". In O(|S|) oracle calls we 
can find a maximal independent subset of X”, extend it to a maximal independent 
set in ¥”U S(N) and then further extend to a basis X of M. Let X'= XN(X"U 
Y”) and Y’=(X” UY")\X'. Then X’ is independent, Y’ is coindependent and 
N = M\Y'/X'. Now constructing the partial representation R corresponding to XY 
and finding the chordless path described in the above proof takes time O(|S|*). 
Finally, computing the necessary contractions and deletions is no more than the 
work for computing a PR for N. Thus, the complete algorithm runs in O(|S|?) 
oracle calls. 


6.4. 3-Connectivity and bridges 


Connectivity (i.e. 2-connectivity) is relatively easy to analyze. 3-connectivity is 
thus the first difficult instance of k-connectivity. It is particularly important in 
the theory of regular matroids and in decomposition theory in general. In this 
section we present a special-purpose recursive algorithm for testing 3-connectivity 
in connected matroids. Some of the ideas in this algorithm will be used later in a 
graph-realization algorithm. 

Define the elementary separators of a matroid to be the minimal nonempty sets 
S’ such that {S’,S\S'} is a 1-separation. Let M be a matroid, and let D be a 
cocircuit of M. The elementary separators of M\D are called the bridges of D. D 
is called a separating cocircuit if it has more than one bridge. The corresponding D- 
components are the matroids M /(S\(D UA)) where A is a bridge. The A-segments 
of D, for a bridge A, are the parallel classes of M/(S\(D UA)) in D. 

The above quantities are easily computed using PRs. Take a PR of M such 
that D is a fundamental cocircuit. Then D minus some single clement is a row 
of this matrix. Deleting this row and the incident columns gives a PR for M\D. 
The bridges of D can then be computed using the connectivity algorithm given 
by Theorem 6.3. The corresponding D-component for a particular bridge A is 
obtained by deleting the columns and rows not in D UA. The computation of the 
A-segments is then straightforward. 

One more definition is required. Say that two D-bridges A, and A: avoid if 
there are A;-segments Y; (i = 1,2) such that Y; U Y, = D. Define the bridge graph 
of D to have nodes corresponding to the bridges of D and an edge joining two 
nodes if and only if they do not avoid. 


Theorem 6.5 (Bixby and Cunningham 1979), Let M be a connected matroid that 
is both simple and cosimple, and let D be a cocircuit of M. Then 
(a) M has a2-separation {S,,S2} such that D C S, if and only if the matroid ob- 
tained from some D-component by identifying parallel elements has a 2-separation; 
(b) M has a 2-separation {S,,S_} such that D meets both S, and S> if and only 
if the bridge graph of D is not connected. 
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The above theorem suggests a recursive procedure for testing 3-connectivity, 
given a method for finding separating cocircuits. But this latter task is easy, because 
of the following lemma. 


Lemma 6.6. Let M be a simple, connected matroid, and let X be a basis of M. 
if every fundamental cocircuit with respect to X is nonseparating, then M is 3- 
connected. 


The algorithm based on the above results proceeds as follows. First construct 
a PR. Check its rows to see whether any is separating. If none is, the matroid is 
3-connected by Lemma 6.6; otherwise, take a separating cocircuit D and compute 
its D-components. Compute the corresponding bridge graph. By Theorem 6.5 we 
can determine whether M is 3-connected by examining the connectivity of the 
bridge graph and determining whether the D-components are 3-connected. Since 
the latter matroids are smaller than the original matroid, the suggested procedure 
can be applied recursively. An appropriate implementation gives a bound of |5|3 
oracle calls. 


6.5. k-Connectivity 


We describe two algorithms for testing k-connectivity for general k. The first is 
based on an elementary shifting idea in partial representations. For 2-connectivity, 
this algorithm reduces to the one given by Theorem 6.3. The second algorithm 
uses the cardinality Matroid Intersection Algorithm. 


6.5.1. A shifting algorithm 

This algorithm was suggested by work of Cunningham (1982) and Truemper (1985). 
To describe it we need an interpretation of m-separation in PRs. Let M be a 
matroid, and let R be a PR of M determined by a basis X. Let {S,,S.} be a 
partition of M and define X; = S; X and Y; =S;N Y (i = 1,2), where Y = S\X. 
Define R;; = R(X;, Y;). This situation is depicted below: 


Y; Y2 

x Ry Ry 
x2 Ro, Ray ; (6.7) 

PR for M 


Using the above notation, an equivalent definition of m-separation in terms of 
PRs is obtained from the computation: 


Tk(R21) + rk(Ry2) = r((X\X2) U Y)) — |X\ X2| 
+r((X\X1) U Yo) — [X\X1| 


=r(S|)+r(S$2) —r(S). (6.8) 
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Thus, a partition of the elements S of a matroid into two sets is an m-separation if 
and only if its blocks are sufficiently large and, in any partial representation R, the 
corresponding “off-diagonal” submatrices determined by this partition have total 
R-rank at most m — 1. 

We now present the shifting algorithm. It is convenient to assume that M is 
known to be m-connected (we could first apply the algorithm for smaller values 
of m). Consider a basis X and the corresponding PR R, and suppose R{, and R), 
are submatrices of R having disjoint row and column sets and total R-rank m— 1 
(m > 1). These matrices induce a partitioning of R, as illustrated below: 


Yi Y; 
x [Ri] LR | 
xy [Rn | 

PR for M 


We also assume that |X}U Y;| >m. The basic routine of the shifting algorithm 
determines whether R/,, R5, can be extended to a pair Riz, Ro; determining an 
m-separation as in (6.7). (We call Riz, Rp) a legal extension of R',, R},.) It consists 
of the following two operations. 


Row shifting: Given x ¢ X such that rk(R(X}U {x}, Y})) > rk(R),), set X} = 
XU {x}. 


Column shifting: Given y € Y such that rk(R(X}, Y> U {y})) > rk(R),), set Yo = 
Yi U{y}. 


It is easy to see that these operations are valid, in the sense that any le- 
gal extension R27, R2 of Rj, R4, must also be an extension of the new Ri, 
R’,,. Now suppose that we repeatedly apply these operations. If rk(R{,) +rk(R},) 
ever increases, then we can stop; no legal extension exists. On the other hand, 
if rk(R},) + rk(R5,) =m —1, but no shifting operation is possible, then X; = Xi 
X, = X,U(X\X1), Yi =Yj, Yo = YZU(Y\Y/) defines a legal extension, unless 
|X, U Y3] < m, in which case no legal extension exists. 

Next we explain how to use the basic routine to test M for the existence of an 

_m-separation. For matrices R,2, R2, as in (6.7) and determining an m-separation 
of M, there exist square non-singular submatrices P of one and Q of the other 
whose total R-rank is m — 1. Suppose that we are given P and Q, and we want to 
find Rj2, Rz;. Let z € Y such that z does not index a column of P or of Q. We run 
the basic routine twice. First we initialize R}, to be P and Rj, to be Q with column 
2 appended, and second we initialize R}, to be Q and Rj, to be P with column z 
appended. (We are taking advantage of the fact that we may assume z indexes a 
column of R},.) The shifting algorithm applies this procedure for all choices of P, 


Q. 
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6.5.2. Complexity of shifting 

For m = 1, testing 2-connectivity, the shifting algorithm reduces to the algorithm 
given by Theorem 6.3. First, since m—1=0, the matrices P, Q must be 0 x 0 
matrices, implying there is only one pair to consider. There is also only one case 
to consider for the special element z — being appended to P is now equivalent 
to being appended to Q. Finally, the shifting procedure is equivalent to using a 
standard graph algorithm to compute the component containing z in G(R). 

In general, the shifting algorithm can be shown to have a complexity of O(|S|””) 
oracle calls for computing m-separations. This involves an O(|S|?) implementation 
of the basic routine for a given pair P, Q and the observation that there are at 
most O(|S|?~*) pairs of nonsingular matrices with total R-rank m—1. A defect 
in this approach is that, for general m, we do not know any device to decrease 
the estimate of the number of pairs considered. Such a device is known for the 
matroid intersection approach given in the next subsection. 

In a quite different setting, Cunningham (1982) suggested the following idea, 
which can be applied to the special case m = 2. Assume M is connected and select 
a spanning tree of G(R). Since m — 1 = 1, il follows that one of the matrices Ri, 
R>, in (6.7) must be a zero matrix, so that the other must then contain one of the 
elements from the spanning tree. Using this observation, we see that only |S|—1 
pairs P, Q must be considered, yielding an overall! bound of O(|S|*) oracle calls, 
the same as that derived from Theorem 6.5. For the binary case, this bound can be 
further improved to O(|S|*) running time using a graph decomposition algorithm 
of Spinrad (1989). 

Finally, an improved bound is also known for testing 4-connectivity. The details 
are too involved to present here, but using a graph-theory lemma of Szegedy, 
Rajan (1986) has shown how to reduce the bound to O(|S{** log |S|) oracle calls. 


6.5.3. An algorithm using matroid intersection 

This algorithm is due to Cunningham and Edmonds; see Cunningham (1973) for an 
early version. Let M be a matroid on S and suppose that we wish to test whether 
M has an m-separation for some m > 1. To do this it suffices to test for each pair 
of disjoint sets U,, U2, both of cardinality m, whether there exists an m-separation 
{V\,V2} such that U; C V;. For a particular choice of U, and U2, define matroids 
M, = M/U,\U, and M, = M/U,\U;. These are matroids on S’ = S\(U, U U2), 
and for any partition {V/, V5} of S’ we have 


Lal (Vi) +1(V>) = r(Vi U U,) - r(U,) + r(V5 U U2) _ r(U>). 


Since r(U,) + r(U2) is a constant, minimizing this quantity over partitions {V{,V3} 
of S’ is equivalent to minimizing r(V,) + r(V2) over partitions {V;, V2} of S such 
that U; C V; (¢é = 1,2). Thus, using Theorem 3.5 and the Matroid Intersection Al- 
gorithm we can determine whether a given pair {U,, U2} induces an m-separation. 
It follows that with O(|S|””) applications of the matroid intersection algorithm we 
can test (mm + 1)-connectivity. 

This bound can be significantly improved using the following observation. Fix 
some set Q C S with |Q| =m. Fix a partition {Q,,Q2} of Q. There are O(|S|”) 
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ways to complete this partition to a pair of disjoint sets {U,, U2} such that |U;| = 
|U,| =m. Since there are 2” partitions of Q into two sets, and this number is a 
constant relative to |S|, we see that O(|S|”) applications of matroid intersection will 
do. Since matroid intersection takes times O(|S|?>) we obtain an overall bound 
of O(|S|"*2>) oracle calls. By taking into account the similarity of the matroid 
intersection instances being solved, the bound can be reduced to O(|S|*?) oracle 
calls, and in the linear case, to O(|S|”*?) total work. 


6.5.4. Menger’s Theorem for matroids 
Tutte has generalized Menger’s Theorem to matroids (Tutte 1965b). Let M = 
M(G) for a graph G, and let S be the edge-set of G. Pick two nodes u,v of G, let P 
and Q be the stars of these nodes and assume that P M Q = 0. Menger’s Theorem 
for graphs (see chapter 2 by Frank) asserts that the minimum number of nodes, 
distinct from u and v, the deletion of which separates u and v equals the maxi- 
mum number of internally node-disjoint paths joining them. If G is 2-connected, 
this minimum can be expressed in matroid terms as 

pani, 976A) +r(S\A) - r(S) +1. 
To express the maximum imagine that we have found a family of m node-disjoint 
paths joining u and v. Deleting all edges not on these paths, and contracting all 
remaining edges other than those in P and Q, yields a graph in which u and v are 
still joined by mm paths. This minor of G corresponds to a minor M’ of M, and for 
this minor we have m = r'(P)+7'(Q) —r'(P UQ) +1, where r’ is the rank function 
of M’. Now it is easy to prove that for any such minor, 7/(P) +7/(Q) —r'(PUQ)+1 
is no bigger than the minimum above. Tutte proved that equality can always be 
achieved. 


Theorem 6.9 (Tutte). Let M be a matroid on S and let P and Q be disjoint subsets 
of S. Let A’ be the family of minors of M on the element set P UQ. Then 

“(P " -r = i A)+r(S\A) — r(S). 

imax r'(P) +r(Q) —r'(P UQ) pamin, 976 )+r(S\A) —r(S) 

Note that the quantity on the right can be computed using the matroid inter- 

section algorithm. This fact was the basis for the connectivity algorithm in the 

subsection 6.5.3. It works by defining M,; = M\P/Q, M, =M/P\Q and finding a 

maximum-cardinality set J jointly independent in M, and Mp. It can then be proved 

that for this J, the matroid M’ = M/J\(S\(P UQUJ)) achieves the maximum in 
Theorem 6.9. This idea, due to Edmonds, yields a proof of Theorem 6.9. 


7. Recognition of representability 


7.1. Graph realization for general matroids 


A matroid M is graphic if there is a graph G such that S = E(G) and the circuits of 
M are exactly the edge-sets of simple cycles in G. In this case we write M = M(G). 
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Graph realization (GR) is the problem, given a matroid M, of determining that 
M is not graphic or finding a graph G such that M = M(G). Seymour (1981b) first 
solved GR. We use here a slight variation of Seymour’s result due to Truemper 
(1984). 


Theorem 7.1. Let M and M' be matroids on S, and let G be a graph with edge-set 
S. Suppose that 

(1) M and M' are connected, 

(2) M and M' have a common basis X such that the corresponding partial 
representations are identical, 

(3) M'=M(G), and 

(4) for every node v of G, the star of v contains a cocircuit of M. 
Then M = M’, which implies that M is graphic. 


We remark that (1) is necessary. Let § = {a,6,c,d,e} and define two matroids 
M and M’ on S as follows: M is the direct sum of U? and U} where S(U!) = {e}, 
and M' = M(G) where G has four nodes the stars of which are {a,c,d}, {b, c,d}, 
{a,b,e} and {e}. Then the pair M@,M’ satisfies (2)-(4) with X = {a,b,e}, but M # 
M'. 


Lemma 7.2. Assume that M is a connected matroid, X is a basis, C is a@ circuit, 
and S\(X UC) #96. Then there is an element e € S\(X UC) such that either M\e 
is connected or e is in series with some element of X. 


Lemma 7.3. Assume that (M.M') satisfy (2)-(4). Let e€ S, and assume e is a 
coloop of neither M nor M’. 

(a) Foree S\X, (M\e,M'\e) satisfy (2)-(4) with G replaced by G\e. 

(b) Fore € X, if e is parallel to no edge in G, then (M /e, M'/e) satisfy (2)-(4) 
with G replaced by G/e and X replaced by X\{e}. 


Lemma 7.4. Assume that (M,M’) satisfy (1)-(4). 

(a) Ife € S\X is paraltel to some element of X (in either M or M'), then (1)-(4) 
hold for (M\e,M'\e) with G replaced by G\e. 

(b) Ife€X is in series with some element of S\X (in either M or M'), then 
(1)-(4) hold for (M /e, M'/e) with G replaced by G/e and X replaced by X\{e}. 


Proof of Theorem 7.1. Suppose that M # M’, and that S has been chosen mini- 
mal subject to this condition. Let C be a circuit of one of M, M’, independent in 
the other. Then by Lemmas 7.2, 7.3(a), and 7.4(b), C > S\X, and so r(S) 2 r*(S). 
Let D be a cocircuit of one of M, M’, coindependent in the other. Then, by the dual 
of Lemmas 7.2, and by 7.3(b) and 7.4(a), D DX, and so r*(S) 2r(S). It follows 
that r(S) =r*(S), C=S\X and D= X. 

Now suppose that C is a circuit of M and that D is a cocircuit of M. Then 
for e € X, adding e to C creates no circuit in M, because of D. Hence, by the 
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minimality of S, C U {e} is a circuit in M’, But if |X| > 2, this implies X contains 
a circuit of M’, since M‘ is binary. Moreover, |X| # 1 (otherwise M = M’), and so 
we conclude that C is a circuit of M’ and D is a cocircuit of M’. But then deleting 
D from M', that is, G, leaves M’ connected, because of C, which implies that D is 
the star of a node in G. On the other hand, C is a basis of M, and so D contains 
no cocircuit of M, contrary to (4). This proves the theorem. 


In the next subsection we will describe two algorithms for recognizing when a 
binary matroid is graphic; given such an algorithm, Theorem 7.1 yields an algo- 
rithm for GR, as follows. Let M be a matroid and assume that M is connected (if 
not, apply the algorithm to its connected components). Construct a partial repre- 
sentation R for M and determine whether the associated binary matroid M((J,R)) 
is graphic. If not, stop — M is not graphic; otherwise, taking G to be a graph with 
representation (7, R), use Theorem 7.1 to test whether M = M(G). 

The computational complexity of this algorithm may be derived as follows. The 
construction of a partial representation R requires O(r(M)|S}) calls to an inde- 
pendence oracle. To determine whether (/,R) is the binary representation of a 
graphic matroid, we may use the algorithm of Bixby and Wagner described in 
the next subsection. The work for this step is bounded by O(za(z,r(M))), where 
a(-,+) is an inverse of the Ackermann function (see Theorem 7.8) and z is the 
number of nonzero entries in R. Finally, to apply Theorem 7.1 one must check 
the star of each node in G to see whether it contains a cocircuit. This checking 
requires a total of O(r(M)|S|) calls to an independence oracle. Thus, the entire 
algorithm uses O(7(M)|S|) oracle calls, plus O(za(z,r(M))) other work. 

In closing we mention another result that can be used to solve GR. This result 
generalizes a result of Tutte for binary matroids. It uses concepts defined in the 
discussion preceding Theorem 6.5. 


Theorem 7.5 (Bixby). Let M be a matroid and let D be a cocircuit of M. Then 
M is graphic if and only if 

(a) the D-components of M are graphic, and 

(b) the bridge graph of D is bipartite. 


This theorem suggests a recursive algorithm for GR, very similar to the first algo- 
rithm discussed in the next subsection. The bound for this algorithm is O(r(M )z), 
where z is the number of nonzero elements in some partial representation of M. 


7.2. Graph realization for binary matroids 


Numerous algorithms have been proposed for GR on binary matroids. We describe 
two. The first is based on Theorem 7.5 for the case of binary matroids. Let M be 
a binary matroid and let R be a partial representation of M. The steps in the 
algorithm, due to Tutte (1960), are the following: 
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Step 1. If (7,R) has at most 2 ones in each column, M is graphic; otherwise, 
choose a column having a one in each of three rows, corresponding to cocircuits 
D,,D2 and D3. 

Step 2. If each of D,,D2 and D3 has just one bridge, then M is not graphic 
(because edges of graphs can be in at most two nonseparating cocircuits, corre- 
sponding to their end vertices); otherwise, let D be a separating cocircuit. (For 
definitions of terms used here, see the discussion preceding Theorem 6.5.) 

Step 3. Compute the bridge graph of D. If it is not bipartite, M is not graphic; 
otherwise, apply the above steps recursively to the D-components of M. M is 
graphic if and only if each of these is graphic. 

In Bixby and Cunningham (1980) it is shown that this procedure can be imple- 
mented to run in time O(r(M)z) where z is the number of nonzero elements in 
the representation (/, R). 

As our second algorithm, we describe one (Bixby and Wagner 1988) based on 
an idea of Lofgren. The algorithm has the same computational complexity as an 
algorithm outlined by Fujishige, which is also based on Léfgren’s procedure. This 
complexity is the best for any known GR algorithm on binary matroids. It is 
conjectured in Bixby and Wagner (1988) to be best possible. 

Let G be a 2-connected graph with edge-set E. For E’ C E, let V(E’) denote 
the set of nodes in G incident to some edge in E’. Let {E,, £2} be a partition 
of the edge-set of G such that V(E,) N V(E2) = {u,v}. Let G, be the subgraph 
of G induced by £,. Define G’ to be the graph obtained by interchanging in G, 
the incidences of the nodes u and uv. Then G’ is said to be obtainable from G by 
reversing G,. In general, G” is 2-isomorphic to G if G" is obtainable from G by a 
sequence of subgraph reversals. 


Theorem 7.6 (Whitney 1933). Let G and G' be 2-connected graphs on the same 
edge-set. Then G and G' are 2-isomorphic if and only if they have the same matroid. 


Let R be a partial representation of a binary matroid M. Define R to be graphic 
if M is graphic. Let R, denote the matrix made up of the first k columns of 
R, where rows consisting entirely of zeros have been deleted. R is called totally 
nonseparable if R, is nonseparable for 1 < k < |S|—r(M). For any matrix A such 
that G(A) is connected, it is easy to compute a permutation of the columns such 
that the resulting matrix is totally nonseparable. , 

Where C, is the fundamental circuit defined by column k of R, define P, = 
Ci 1 {Uj<x Cj}. A set of edges P of a graph G is a hypopath of G if P is a path 
in some graph 2-isomorphic to G. The following statement, provable directly from 
Theorem 7.6, is the “subrearrangement theorem” of Lofgren (1959). 


Theorem 7.7. Let R be a totally nonseparable {0,1}-matrix with c columns. As- 
sume that for some 1<k <c, R, is graphic with realization G,. Then R,,,, is graphic 
if and only if P,.,, is a hypopath of Gx. 


Lofgren suggested the following procedure for testing whether R is graphic. 
Assume R is totally nonseparable. Clearly R; is graphic. Suppose there exists a 
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graph G, that realizes R,. Further, suppose P,,; is a hypopath of G,. Then there 
exists a graph G, 2-isomorphic to G, such that P,,, is a path in G,. Add the 
edges of Cy,1\Px1; to G, so that they form a path between the ends of P,,, but 
are not incident to any other vertices of G,,. It is straightforward to verify that the 
resulting graph G,,, is a realization of R,,,. If the above procedure breaks down 
at any point, it follows that R is not graphic. 

To implement the above idea requires a polynomial-time method for construct- 
ing Gi, from G,. A natural approach is to invoke some representation of G, that 
“displays” all graphs 2-isomorphic to Gy. For this representation we use a graph- 
decomposition theory developed by Tutte in which a 2-connected graph is uniquely 
decomposed into polygons, bonds and 3-connected graphs. Using this decomposi- 
tion, we can determine in polynomial time whether a given subset of edges is a 
hypopath. 

The complexity of the algorithm is stated in the following theorem. The function 
a(-,-) is an inverse of the Ackermann function, and is very slow growing, being 
for all practical purposes never bigger than 4. 


Theorem 7.8 (Bixby and Wagner). Given anr x c {0,1}-matrix (1, R) with z non- 
zero entries, there is an algorithm that runs in time O(za(z,r)) and uses space O(z) 
to determine whether the binary matroid M((I,R)) is graphic. 


7.3. An application to linear programming 


A (linear) network (flow) problem is a \inear programming problem min{c'x: 
Nx = b,x 20}, where is N is a {0,+1}-matrix with no column having two equal 
nonzero entries. We call a matrix with this property a network matrix. These are 
exactly the matrices that occur as submatrices of node—arc incidence matrices of 
digraphs. 

Two matrices A and R are projectively equivalent if one can be obtained from 
the other by elementary row operations and nonzero column scaling. A linear 
program min{c'x: Ax = b,x > 0} is called a hidden network if its constraint matrix 
A is projectively equivalent to a network matrix N. In this case, given explicit 
knowledge of N, one can easily produce an equivalent network problem with 
constraint matrix N. The motivation for doing so is primarily computational: It is 
well established that network linear programs can be solved much more efficiently 
than general linear programs. 

The relationship between GR and hidden networks is summarized in the fol- 
lowing theorem. In particular, it follows from this result that any polynomial-time 
algorithm for GR on binary matroids implies a polynomial-time algorithm for 
testing whether a given linear program is a hidden network. 


Theorem 7.9 (Iri 1968). Let A = (1, R) be a@ real-valued matrix, where I is an iden- 
tity matrix, and let A' = (1,R') where R' is obtained from R by replacing nonzero 
entries with 1's. Then A is projectively equivalent to a network matrix if and only if 
the following two conditions hold: 

(a) R’ is the partial representation of a graphic matroid M; and 
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(b) where G is any graph whose matroid is M, D is any orientation of G, and 
N is its corresponding network matrix, A is projectively equivalent to N, 


7.4, Recognizing total unimodularity 


A {0,1}-matrix is totally unimodular if every square submatrix has determinant 
+1 or 0, The significance of these matrices in optimization was pointed out by 
A.J. Hoffman and J. B. Kruskal (see chapter 30 by Schrijver for an extensive dis- 
cussion of total unimodularity), who observed that linear programming problems 
min{c'x: Ax = b,x > 0} with integral 6 and totally unimodular constraint matrix 
A have integral basic feasible solutions (a simple consequence of Cramer’s Rule). 
It is well known that network matrices are totally unimodular, and Hoffman and 
Kruskal asked whether other interesting classes could be found. Seymour gave an 
answer to this question: 


Theorem 7.10 (Seymour 1980). Let M = M(A) where A is totally unimodular, 
and assume that M is 3-connected and has no 3-separation {S,,S,} with |S;| > 
4 <|S,|. Then M is either graphic, cographic (the dual of a graphic matroid), or 
isomorphic to Rip. (See chapter 10 by Seymour for a definition of Rip.) 


The above theorem is perhaps best viewed as a “decomposition” result. In chap- 
ter 10 by Seymour, the connectivity-based notions of 1-, 2- and 3-sum are defined. 
In terms of these sums, Theorem 7.10 can be stated as follows: Every matroid 
arising from a totally unimodular matrix can be constructed using 1-, 2- and 3- 
sums starting with only graphic and cographic matroids and copies of Ro. Thus, 
apart from the matrices representing Rj, all “indecomposable” totally unimodu- 
lar matrices arise from graphs, or duals of graphs. This view suggests an algorithm 
for testing whether a matrix is totally unimodular. More generally, it can be used 
to test whether a given matroid M is regular (ie. M = M(A) where A is totally 
unimodular). 

To see that a solution to the second problem above yields a solution to the 
first, suppose that M = M((/,A)) has been shown to be regular where A is a 
given real matrix with {0,+1} entries, and / is an identity matrix of appropriate 
dimension. Let A, denote the submatrix of A made up of the first k columns. 
A, is clearly totally unimodular. In general, assume that A, (k > 1) is known to 
be totally unimodular. Find the components of the bipartite graph G(A;,), and 
let i’ and i” be two row indices in the same component of G(A,) and such that 
Qi zat FO F Ays,. Find a shortest path from i’ to i” in G(A,). The row and 
column indices of this path together with the column index k + 1 determine a square 
submatrix of A;,,, since the path was chosen to be shortest. If this submatrix has 
determinant other than +1 or 0, then A,,; is not totally unimodular, and so neither . 
is A; otherwise, after checking all such pairs (i',i"), it follows that A;,,, is totally 
unimodular. The validity of this last assertion can be deduced by showing that if 
(1, A‘) is a totally unimodular matrix with the same zero, nonzero pattern as (7, A) 
((1, A‘) exists because M is regular), then the success of the checking procedure 
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for A implies that A can be scaled to A’ by multiplying some rows and columns 
of A by —1. 

We now sketch an algorithm for determining whether a general matroid is reg- 
ular, using Seymour’s theorem above. The description proceeds inductively on 
|S(M)|. Thus, we assume that a matroid M is given, specified by an independence 
oracle, and that we can test for regularity any matroid that has a smaller number 
of elements. 

Obviously we can test whether M is isomorphic to Rio, and using the results on 
graph realization given earlier in this section, we can test whether M is graphic 
or cographic. If any of these tests is positive, then M is regular and we are done; 
otherwise, using results of section 6, we test whether M has a 1-separation {S,, S3}. 
If so, let M; = M\S; (= 1,2); it is easy to see that M is regular if and only if M, 
and M) are regular. Hence, we may assume that M is connected. Now, test for 
a 2-separation {S,,S2}. If one exists, find a circuit C of M such that C; =C nS; 
x ) (i = 1,2). Select e€ Ci Gi = 1,2) and let M; = M\(S;\C;)/(C;\ {e:}) (i = 1,2). 
Then both M, and M2 are smaller than M, and it can be shown that M is regular 
if and only if M, and M; are regular. 

Finally, it remains to consider the case when M is 3-connected, neither graphic 
nor cographic, and not Rio. This case is difficult because we must, in effect, consider 
representing M as a 3-sum, 3-sums are defined only for binary matroids, and M 
is not known to be binary (moreover, if M is not regular we have no way to test 
whether it is binary, short of determining that it is regular, since testing whether a 
general matroid is binary is intractable). The method given below for dealing with 
this difficulty is due to Truemper (1982). 

Using the connectivity algorithm based on matroid intersection (see section 6), 
we can easily determine whether M has a 3-separation {5,,52} such that |S,| 2 
4 < |S2|. Suppose this is the case. Let X2 be a basis of 52, and extend X2 to a basis 
X = X, UX, of M, where X,N X, =4. Let Y; = S;\X; (i = 1,2). Let R be the PR 
of M determined by X: 


R= A, 0O 
D A; 
where A; = R(X;, Y;) (i = 1,2) and rk(D) = 2. Suppose that D has the structure 


J, 0 
0 Js, 


D= 


where J; and J» are matrices of all 1’s. If D does not have this structure, we take 
R’=R, X} = X2 and Y; = Y2; otherwise, since M is 3-connected, it follows that 
there is a shortest path in A, joining some row of J, and some row of J2. A pivot 
on a nonzero element r,, of R is a sequence of elementary row operations (over 
GF(2)) that remove all nonzeros in column y, except rry, followed by a resetting 


$98 R.E. Bixby and W.H. Cunningham 


of all entries in this column to their original values. Performing GF(2) pivots on 
appropriate 1's along this path in Az we obtain a matrix 


A, 0 
R= 


where D’ = R’(X}, Y,) and A, = R(X}, Y;), and where D’ has some row of all 1's 
and X,UY; = S2. Note that if R’ is not a PR of M, then M is not binary, and 
hence not regular. 

Now define M, = M/X, and M7 = M?\Y,. Let N, be the binary matroid with PR 
(D’, Az), and let Nz be the binary matroid with PR A,. Remove loops, coloops, 
series and parallel elements from N, and N2. If these elements are not loops, 
coloops, series and parallel in M, and M2, respectively, then M is not regular; 
otherwise, let Mj and Mj be the corresponding reduced versions of M; and Mo, 
respectively. Now M, and Mp2 are smaller than M, and Truemper has proved that 
they are both regular if and only if M is regular. 

The above procedure clearly runs in polynomial time. We will not attempt to 
estimate its complexity. Truemper (1990) has given a more complicated, direct 
algorithm with complexity O(|S|*). 


7.5. Intractable problems 


In section 2 it was shown that computing the girth of a matroid is intractable (i.e. 
that there is no oracle polynomial-time algorithm for this problem), and, as noted 
at the beginning of section 6, the same argument applies to show that computing 
the connectivity of a matroid is intractable. 

In this section we have given polynomial-time algorithms for a small set of “rep- 
resentability” questions: Testing whcther a matroid is graphic, testing whether a 
binary matroid is graphic, and testing whether a matroid is regular. The surprising 
fact is that these are essentially the only “interesting” representability questions 
for which polynomial-time algorithms are possible. Thus, at one extreme, test- 
ing whether there exists a field over which a given matroid is representable is 
intractable, as is the other extreme case of testing whether a given matroid is 
representable over a given field. 

The most basic result along these lines was proved by Seymour (1981b), who 
showed that testing whether a matroid is binary is intractable. His proof runs as 
follows. Let S = {x,,...,X%,¥1s+.., yx} be a 2k-element set, define Y = {y,,..., yx} 
and define two families of subsets # and & of S as follows: 


A= {{xi,yinxj,yj} CS:i <j} 
B={Z= {z,...,Z%} CS: 2; =x; or y;, and |ZNY| is even} 
Then € = 4U & is the family of circuits of a binary matroid M on S. Now, for. 
each Z € 2, let Mz denote the nonbinary matroid that is identical to M except 
that Z is independent (Mz is a matroid since Z is a circuit and hyperplane of M). 


The existence of the matroids Mz implies that any algorithm proving that M is 
binary must have made at least |@| = 2*! calls to the independence oracle. 
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8. Matroid flows and linear programming 


8.1. Maximum flows 


Let M be a matroid on S. Fix / € S, and let ©; be the family of circuits of M 
containing /. Let A be the {0,1}-matrix with columns indexed on elements e &€ 
S\{1} and rows indexed on circuits C € €;, such that ace = 1 if and only if e EC. 
Let &; be the family of cocircuits of M containing /. We say that M is an /-MFMC 
matroid, that is, has the (integral) max-flow min-cut property with respect to I, if 
for every choice of nonnegative integral vector w defined on S\{/}, 


min w(D\{/}) = max(I'y: y'A < w',y >0 and integral), 
Ed, 


where 1 is a vector of 1’s. 

A nonnegative vector y satisfying y'A < w’ is called a flow, or I-flow, and Iy 
is its value. A flow of maximum value is a maximum flow. The special element 
/ is called the demand element. Define M to be a MFMC matroid if it is an /- 
MFMC matroid with respect to every choice of demand element /. (In chapter 10 
by Seymour MFMC matroids are called free flowing.) 


Theorem 8.1 (Seymour 1977b). A matroid M isan MFMC matroid if and only if 
it is binary and contains no Fj minor. 


This theorem can be proved using “splitter theory” (Seymour 1980), from which 
it follows that a 3-connected matroid M is binary and has no FF minor if and only if 
it is cither regular or isomorphic to F;. This structural result gives an algorithm for 
computing maximum flows in MFMC matroids and for testing whether a matroid 
is an MFMC matroid. We describe this algorithm for computing flows. Truemper 
(1992) contains an extensive treatment of this topic. 

Suppose M is not 2-connected. Then it is easy to see that M is an MFMC 
matroid if and only if each of its 2-connected components is. In particular, for any 
choice of demand element, the computation of a maximum flow for this demand 
element can be restricted to the component containing it. 

Assume M is 2-connected but not 3-connected. Let {S;,52} be a 2-separation of 
M and let C be acircuit of M such that C; = CONS; #@ (i = 1,2). Select e; EC; G = 
1,2) and let M; = M\(S;\C,)/(C;\{e;})) (= 1,2). Then M is an MFMC matroid 
if and only if M, and M2 are. Assuming that , and M) satisfy this property, 
we may compute maximum flows for M as follows. Select an element / € S and 
assume / € S(M,). Let w be an integral vector defined on S. Restricting w to S2, 
compute a maximum e)-flow in M2. Let r be the value of this flow. Now define w’ 
on S| by w’(S;\{e2}) = w and w’(e.) =r. Now compute the value of a maximum 
[-flow on M, with respect to w’. It is easy to prove that this is the maximum flow 
value for M; moreover, it is not difficult to construct the actual flow for M from 
the flows for M, and M2. 

Finally, suppose that M is 3-connected. Then M is either isomorphic to F7 or 
regular. Both of these properties can be checked, the latter using the algorithm 
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given in section 7. To complete the description of the algorithm for finding max- 
imum flows in MFMC matroids, we describe a method for computing maximum 
flows in regular matroids. This can be done in a variety of ways. The simplest is 
via linear programming. Let M be regular and assume K is a totally unimodular 
matrix such that M = M(K). Then it can be proved that the optimal value of the 
following linear program is the value of a maximum /-flow in M: 


max(x;: Kx =0,0< x. < we (e £!)) 


Since K is totally unimodular, this LP will have an integral optimal solution x. By 
decomposing this x into multiples of rows of A, we obtain the desired flow vector 


The characterization of the /-MFMC matroids, for a fixed /, is very similar to 
that for the MFMC matroids, but the proof is much more difficult. 


Theorem 8.2 (Seymour 1977b).. A matroid M is an !-MFMC matroid with respect 
to some fixed demand element | if and only if it is binary and contains no F} minor 
containing 1. 


Let us denote the class of binary matroids that are /-MFMC matroids with re- 
spect to some fixed / by .ft;. Truemper (1987) has given a polynomial-time algorithm 
for testing membership in .f;, and has shown that the maximum flow problem on 
this class always has an integral solution by giving an algorithm for computing such 
a solution. We describe Truemper’s algorithm for computing maximum flows on 
M,. To simplify the presentation, we do not concern ourselves with finding integral 
flows, We also do not treat membership, although the development given below 
can easily be modified to do so. 

The problem we will actually solve is the problem of finding shortest /-paths. 
An I-path is a set P of the form C\{/} where C € €;. Given a real-valued “length” 
function d defined on S, the length of P is d(P). P is shortest if its length is 
minimum. Now, to sce that we can solve maximum flow problems by computing 
shortest paths, note first that to solve a given maximum flow problem, it is sufficient 
to solve the corresponding dual: 


min(w'x: Ax >1, x > 0). (8.3) 


Of course, (8.3) generally has an enormous number of constraints, and it is no 
more obvious how to solve it than to solve the primal. Note, however, that for a 
given x > 0, checking Ax > 1 is nothing but the problem of checking whether 


i 1. 
mina(C\{1}) 2 


Thus, if we can compute shortest /-paths in polynomial time, then we can check 
for violated inequalities in (8.3) in polynomial time, and so we can solve (8.3) in 
polynomial time using the ellipsoid method. 

Let M € , and consider the problem of computing a shortest /-path with respect 
to some nonnegative weight function w defined on S. We begin by discussing 
several special cases. 
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lf M is not 3-connected, this shortest-path problem on M can be solved, or, 
more precisely, reduced to smaller shortest-path problems, using exactly the same 
methods we used to compute maximum flows in the non-3-connected MFMC ma- 
troids. 

Suppose M is either regular or isomorphic to F;. The F7 case obviously presents 
no difficulty. For the case when M is regular, we can use the following computation, 
where M* ~ M(K*) and K* is totally unimodular: 


max(x;: K’x =0, 0 < x < We (e £1). (8.4) 


Since regular matroids are MFMC matroids, the optimal value in (8.4) equals the 
minimum weight of a circuit containing /. 

Suppose that there is a triad {x,y,z} of M, not containing /. (A triad is a cocircuit 
of cardinality 3.) In this case we make use of the following result: 


Lemma 8.5. Suppose that M € Mt, {x,y,z} is a triad of M not containing |, and 
{e, f,g} is disjoint from S. Then M’ € At, where M’ is obtained from M by creating 
circuits {e,x,y}, {f,y,z} and {g,z,x}. 


This lemma justifies the following construction. Assign weights we = Wy + Wy, 
wr=W,t+w, and w,=w,+w, to the elements e, f and g, respectively. Let 
M” = M'\{x,y,z}. Now, using any shortest /-path in M”, we easily construct a 
shortest /-path in M. This procedure reduces by one the number of triads not 
containing J. 

We are now reduced to considering the case when M is 3-connected, not isomor- 
phic to F>, not regular, and contains no triad missing /. In this final case, M can be 
appropriately decomposed, and the shortest-path problem solved on it by solving 
appropriate problems on the smallcr matroids that result from the decomposition. 

Assume that {S,,52} is a 3-separation of M with the properties that ||| >4< 
|S2|, 2 € S. and S2\{/} spans /. Let X2 be a basis of S2\{/} and extend X2 to a 
basis X = X,UX2 of M, where X,NX,= 0. Let Y; = S;\X; G = 1,2). Let R be 
the PR determined by X (in fact, R is a representation of M since M is binary). 
Let X} C X> be such that X} U {/} is the fundamental X-circuit determined by /, 
and let X37 = X2\X3. Then we may display R as 


where 1 and 0 are matrices of 1’s and 0’s, respectively, of appropriate dimensions. 
Now if D’ is identically 0, then there must be an element in Y> such that the cor- 
responding column of R has a 1 in both Aj and AJ; otherwise, M is not connected. 
By pivoting on a 1 in the intersection of such a column with A, D’ becomes 
nonzero. We would also like to arrange that rk(D’) = 2. Suppose this is not the 
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case. Let X} be the subset of elements in X3 determined by the nonzero rows in 
D’, and let XJ be the subset of elements in Xj determined by the nonzero rows in 
D", different from those in R(X, Y;). Then there is a path in G = G(R(X2, Y2)) 
between some element of X}, and some element of X¥, for in the alternative case 
M is not 3-connected (where T is the vertex-set of the component of G containing 
X3, {7,S\T} is either a 1- or 2-separation of M if S\T 4 6). Taking a shortest such 
path, and pivoting on appropriate 1’s in the path, results in rk(D’) > 2. Now, ap- 
plying essentially the same argument to A), letting any nonzero row in A play the 
role of J, it follows that we may assume RK has the following form, for appropriate 
elements e, f, x, y and z: 


yzl 


From the above representation for M@, we construct two matroids M, and M) with 
PRs given by R; and R3: 


We can now state 


Theorem 8.6. Let M € A; be 3-connected. Assume that M is not regular, not iso- 
morphic to F; and has no triad missing |. Then there is a 3-separation {S,,S2} of 
M such that |S\| > 4 < |S2|, 1 € S2, S\{l} spans land M, is either isomorphic to F; 
or is regular. 


There are now two steps left. We must show how to compute a 3-separation of 
the form guarantced by Theorem 8.6, and we must show how the resulting decom- 
position into M, and M) can be used to compute shortest /-paths. For the compu- 
tation of the 3-separation we use the following expedient algorithm. (A much more 
efficient, direct algorithm is given in Tseng and Truemper 1986). For each disjoint 
pair {U,, U2}, where J € U, and |U,| =4 = |U2|, we apply the Matroid Intersec- 
tion Algorithm to the pair of matroids M’ = M/U,\U2 and M”=M/U2\U, to 
determine the unique minimal set A C S’ = S\(U, U U2) that minimizes ry(A)+ | 
ru(S’\A).! Then {U, UA, S\(U; UA)} is a 3-separation if and only if there exists 
' To see that we can compute A with this property, note that A here is the same as the A in Theorem 


3.6(b), where M’ plays the role of M, and M” the role of M2. The minimal A is just the set of vertices 
uv #s of G such that there exists an (s,v)-dipath. 
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some 3-separation {V;, V2} such that U; C V; (i = 1,2). Now, by the minimality 
of the A’s, for any 3-separation {S,,52} that satisfies the conditions of Theorem 
8.6, we will find some A determining a 3-separation and such that U; UA CS). It 
follows that this 3-separation also satisfies the conclusion of the theorem. 

We are now in a position to complete the description of a method for find- 
ing a shortest /-path in M. By the computation of the previous paragraph, there 
is a 3-separation giving rise to matroids M, and M2 satisfying the conditions of 
Theorem 8.6. The method for finding shortest paths is then straightforward. Set 
We = Wy = w,; = 0 in My. Find the shortest e-, f- and J-paths in M,\{f,/}, M;\{e, 1} 
and M,\{e,f}, respectively, and let their corresponding lengths be d., d, and d,. 
(These computations are possible since M, is either regular or isomorphic to F7.) 
Now add new elements e’, f’ and g’ to M2 so that {e’,x,y}, {f’,y,z} and {g',z,x} 
are circuits; then delete x, y and z, denoting the resulting matroid by M3. By 
Lemma 8.5, Mj € A); moreover, M; has fewer elements than M, and so we may 
assume (by induction on |S|) that an algorithm is available to compute shortest 
!-paths in M5. Set w; = d; for i =e’, f’,g’. It is straightforward to see that a shortest 
[-path in Mj, together with the three shortest paths constructed in M, can be used 
to build a shortest /-path in M. 


82. Oriented matroids and linear programming 


Matroid theory arises as a combinatorial abstraction of properties of linear de- 
pendence. From this viewpoint, oriented matroid theory arises when attention is 
restricted to ordered fields. Hence, it allows interpretation and generalization of 
ideas of real linear algebra. We indicate here how this can be done with linear 
programming. It is a very attractive theory, which has also produced new insights 
and methods in linear programming. Good references for this subject are Bachem 
and Kern (1992) and Bjérner et al. (1993). 

Oriented matroids are treated in chapter 9 by Welsh. We give a definition that 
is useful for our purposes, and explain the relationship with the earlier definition. 
A sign vector on S is an element of {0,1,—1}5. To each vector in R° we asso- 
ciate a sign vector in the obvious way. For any vector x € R°, the support of x 
is {7 ¢ S: x; #0}. For a subspace L of R*, a vector x € L is elementary if x #0 
and the support of x is minimal. The supports of elementary vectors of L are the 
circuits of a matroid M, and those of L+, the orthogonal complement of L, are the 
circuits of M*. In particular, where A is a matrix with columns indexed by S, and 
i, = {x: Ax =0}, M is the matroid whose independent sets correspond to linearly 
independent sets of columns of A. One can easily check that the sets ¥, #* of 
sign vectors corresponding to elementary vectors of L, L+ satisfy: 


the supports of members of & are the circuits of a matroid M, 
and those of #* are the circuits of M*; 


for every circuit C of M(M*), there are exactly two elements x, y 
of #(¥*) such that C is the support of both, and x = —y; 


(8.7) 


(8.8) 
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ifx Ee F,y € ¥*, and x;y; #4 0 for some {, then there exists j,k € S (8.9) 
with x;y; = 1 and x,y, = —1. j 
For a triple (S, ¥, #*) such that ¥, ¥* are sets of signed vectors on S, we take 
(8.7)-(8.9) as the definition of a dual pair of oriented matroids. It is not difficult 
to see that (S, #) determines #*, so oriented matroids do not need to be defined 
in dual pairs. (See chapter 9 by Welsh.) It is also unnecessary (and a little in- 
convenient) to deal with elementary sign vectors. We have done so because the 
corresponding matroidal objects (circuits) are more familiar. The price we pay is 
that, in applications, we often have “elementary” as a qualifier when we do not 
really want it. In many cases, the following result can be used to get rid of it. 


Lemma 8.10. Let L be a subspace of R° and let x € L, x 40. Then there exist ele- 
mentary vectors x',x?,...,x* of L such that Yy*_, x! =x and x # O implies xix; > 0. 


As an example, suppose that we arc interested in the existence of a non-negative 
solution of A’x’ = b. By taking A = (A’, —b), this is equivalent to the existence of 
a non-negative solution of Ax = 0 with a particular component x, positive. By 
Lemma 8.10 the latter is cquivalent to the existence of such a solution that is 
also elementary. So the following oriented matroid theorem (due independently 
to Bland and Las Vergnas, sce Bland 1977) gives a characterization. (Notice that 
both situations in Theorem 8.11 cannot occur, because of (8.9).) 


Theorem 8.11. Let (S,¥%,#*) be a dual pair of oriented matroids, and let e € S. 
Then either there exists x © ¥ with x >0 and x, =1, or there exists y © #* with 
y 2Oand y, = 1. 


Applying Lemma 8.10 and Theorem 8.11, we have that A’x’=b has a non- 
negative solution if and only if there does not exist a vector z > 0 in the row space 
of (A’, —6) that is positive in the last component, that is, if and only if there does 
not exist y such that y'A’ > 0, y'b <0. This is the classical Farkas Lemma; see 
chapter 30 by Schrijver. 

Somewhat similar techniques allow a generalization of the duality theorem 
of linear programming. One form of that theorem says that if the problems 
max(c'x’: A’x' = b,x' >0) and min(y'b: yA’ 2c") have feasible solutions, then 
they have ones for which c'x’ = y'b. An equivalent statement of the latter con- 
dition is that (y"A’ — c")x’ = 0. We handle the objective function c'x’ by adding 
a new variable x» and a new equation x; —c'x’ =0 as the last equation in the 
system. Thus, where the components of x' and the columns of A’ are indexed by 
elements of S\{e, f}, we get a new problem with unknown vector x € R°. So the 
existence of x’ such that A’x’ = b and x’ > 0 is equivalent to the existence of x such 
that Ax =0, xe = 1 and x, > 0 for j € S\{f}. The existence of a vector y such that 
yA’ >" is equivalent to the existence of a vector z = (y',1)A in the row space of 
A, such that z; >0 for j € S\{e} and z, =1. If x, y and z have the above proper- 
ties, then the condition (y™A’ — c')x’ = 0 is equivalent to x;z; =0 for j € S\{e, f}. 
Hence, the following result of Lawrence generalizes the duality theorem. 
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Theorem 8.12. Let (S,#,%*) be a dual pair of oriented matroids and let e,f € S 
with e # f. If there exists x € F with x-=1 and x; 20 for j E S\{f}, and there 
exists z © #* with zs =1 and z; 20 for j € S\{e}, then there exist such x, 2 with 
XjZj =O, jE S\{e, f}. 


Considerable effort has gone into finding constructive proofs of results such 
as Theorems 8.11 and 8.12. One would like to have, for example, an oriented 
matroid algorithm that constructs the x and z of Theorem 8.12. Bland (1977) 
found an extension to oriented matroids of the simplex method of linear program- 
ming, although the extension is not as complete as one might hope. [More recently 
other algorithms for “optimizing” in oriented matroids have been introduced. See 
Bachem and Kern (1992) for references.] 

We briefly indicate how the simplex method can be imitated. Consider a basis 
B of M, such that e ¢ B and f € B. We define a tableau (a;;: i € B,j € S) deter- 
mined by B as follows. For each i € B, (a;;: j € S) is the vector z € ¥* whose 
support is the fundamental circuit C(S\B,i) of M*, and that also satisfies z; = 1. 
It is easy to show that for each j € B, the vector x defined by x; = —1, x, = a,; for 
k € B, and x, =0 for k € (S\B)\{j}, is in ¥, and its support is the fundamental 
circuit C(B,j) of M. In particular, the z € #* determined by choosing i = f above, 
and the x € # determined by choosing j = e, satisfy the condition x;z; = 0 for all 
j c S\{e, f} of Theorem 8.12. If this x satisfies x; > 0, j € S\{f}, then the tableau 
is primal feasible; if this z satisfies z; 20, j € S\{e}, it is dual feasible. 

A strengthening of Theorem 8.12 is that (under the same hypotheses) there 
exists a tableau that is both primal and dual feasible. A primal simplex method 
is one that begins from a primal feasible tableau and performs pivot operations, 
replacing B by a basis B’ = (BU {j})\{k} where j € S\(B U {e}) violates dual fea- 
sibility. There is a simple rule in ordinary linear programming for choosing k, but 
it is based on numerical comparisons, and so is unavailable in the oriented ma- 
troid setting. However, it can be proved that there does exist a choice for &, if 
the hypotheses of Theorem 8.12 are satisfied. A more serious difficulty is that cy- 
cling (repeating a basis) can occur in a different way from that in ordinary linear 
programming, making rules for finite termination considerably more complicated. 
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1. Introduction 


The theory of permutation groups can be traced to the work of Galois on the 
solubility of equations, if not earlier. It became an independent discipline with the 
work of the Paris schoal, notably Jordan and Mathieu, after the mid-19th century. 
Frobenius, Burnside and Schur linked it to representation theory, while Zassenhaus 
and Witt obtained the first general classification theorems. In the flourishing of 
finite simple group theory in the period from 1955 to 1980, permutation group 
theory both used and contributed to the new techniques; some of the contributions 
were Fischer’s work on 3-transposition groups and the construction of several 
sporadic groups. Throughout this period, the subject had clear goals, though these 
appeared to be out of reach. Perhaps the most important problem was: are there 
any 6-transitive groups apart from the symmetric and alternating groups? 

Also, throughout this period, permutation groups went hand-in-hand with parts 
of finite geometry and graph theory. This was also a two-way process: Dembowski’s 
book oan finite geometries shows clearly the impact of the new group-theoretic 
techniques, while the sporadic groups appeared as automorphism groups of com- 
binatorial objects. 

Then, in 1980, the classification of the finite simple groups was completed. 
Overnight, the balance changed; many of the long-term problems about permu- 
tation groups were settled, using this new and powerful machine. Thus, group 
theory became an even more powerful tool for finite geometers, but the key réle 
of combinatorial argument in permutation group theory gave way to increasing use 
of representation theory to understand better the finite simple groups. A typical 
theorem about permutation groups in the 1980s asserts that a group with some 
specified properties is one of a long list (perhaps taking a page or two to state) of 
known groups. 

In the first part of this chapter, I have taken a post-1980 view of the subject. 
After an introductory section giving the basic theory, I turn to the theorem of 
O'Nan and Scott, which reduces many problems to specific questions about simple 
groups. Then there is a brief discussion of the finite simple groups. (Fortunately, it 
is not necessary to understand how the classification is obtained in order to apply 
it - the proof is estimated to be 10000 pages long.) Then I turn to some results on 
primitive groups: classification theorems (including the list of multiply transitive 
groups), results about degrees and orders, the Sims conjecture and its application 
to distance-transitive graphs, and so on. 

As mentioned carlier, representation theory (and, from a slightly different point 
of view, character theory) has been associated with permutation groups for a long 
time, In combinatorial terms, we are led to study “coherent configurations”, which 
generalise association schemes and Schur rings. There are applications to enumer- 
ation theory, statistics, and other fields. A specific topic having very strong links 
with combinatorics is the character theory of the symmetric group. 

One part of the theory which has become more important recently is the study 
of algorithms for permutation groups. At the most basic level, if we are given a 
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set of permutations, we want to obtain information about the group they gener- 
ate, such as its order, or a membership test for arbitrary permutations. I describe 
these techniques in detail, and give a summary of results about more recondite 
properties. 

The final section concerns infinite permutation groups. This has seen very rapid 
developments during the last decade, due in large part to its very close links with 
model theory. 


2. Transitivity and primitivity 


This section contains the basic theory of permutation groups: transitivity and prim- 
itivity, coset spaces, the orbit-counting lemma, direct and wreath products, and so 
on. Much of this material can be found in Wielandt (1964). 

A permutation group is simply a subgroup of the symmetric group, the group 
of all permutations of a set. It is convenient, however, to have a more flexible 
concept available. An action of a group G on a set 2 is a homomorphism from G 
into the symmctric group Sym(2) on 2. This allows us to consider actions of the 
same group on different sets (such as the points and lines of some geometry). Two 
group elements may inducc the same permutation in a given action. Nevertheless, 
it is convenient to suppress the homomorphism in the notation for an action. Thus, 
writing permutations on the right, we let ag denote the image of the point a € 2 
under the permutation (corresponding to) g € G. 

Thus, if G is a permutation group, then the identity map on G is an action. 
(This will be the usual situation.) Also, if G acts on 2, then the image of the 
action homomorphism is a permutation group, which is called the group induced 
on 22 by G (and denoted by G”). 

If G acts on £2, we often refer to Q as a G-space or G-set. 

Actions of G on sets 92, and 2, are isomorphic if there is a bijection 6 : Q; — Sh 
such that 


(ag)0 = (a8)g 


for allgé€ Ga € QQ. 

Let G act on 92. Define a relation ~ on 2 by the rule that a ~ if and only if 
ag = £8 for some g € G. Then ~ is an equivalence relation on 2. (The reflexive, 
symmetric and transitive laws for ~ follow directly from the group axioms asserting 
the existence of an identity, inverses, and closure respectively.) The equivalence 
classes are the orbits of G on Q. We say that G is transitive on 1, or that 22 is a 
transitive G-space, if there is just one orbit. In general, a transitive constituent of 
G is the permutation group induced by G on one of its orbits. Thus, we have the 
following 


Theorem 2.1. Any G-space can be expressed in a unique way as a disjoint union 
of transitive G-spaces. 
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Things are not quite so simple, however, if we consider permutation groups 
rather than G-spaces. Any permutation group is a subgroup of the cartesian prod- 
uct of its transitive constituents. More specifically, it is a “subcartesian product”, 
which is to say that it projects onto each factor. (The cartesian product is the set of 
all those permutations of 2 whose action on each orbit is the same as that of some 
element of G.) We could say that the problem of describing all permutation groups 
has been reduced to that of describing transitive groups modulo the problem of 
constructing subcartesian products. 

In the case of finite permutation groups (or, more generally, groups with finitely 
many orbits), the cartesian product coincides with the direct product, and the latter 
term is more commonly used. 

A special class of transitive actions of a group is given by the coset spaces. Let 
H be a subgroup of G, and H\G the set of right cosets Hx for x € G. Then G acts 
on H\G by right multiplication: the permutation of H\G induced by the group 
element g € G is 


Hx Hxg. 
This action is clearly transitive. And it is not so special after all. 


Theorem 2.2. 
(i) Any transitive G-space 2 is isomorphic to the coset space H\\G, where H is 
the stabiliser of a point a € N (the subgroup 


{g € G: ag =a}). 


(ii) The actions of G on the coset spaces H\G and K\G are isomorphic if and 
only if H and K are conjugate subgroups of G. 


The stabiliser of a is denoted by G, or Stabg(a). This notation can be extended 
to stabilisers of sequences of points, or of subgroups of 2. (The notation is not 
quite standard; but, if A is a subset of 2, it is fairly common to use G4 or G,4} to 
denote the setwise stabiliser of A, and G4) for the pointwise stabiliser.) 


Corollary 2.3 (Lagrange’s theorem). Jf G acts transitively on 2, then 
|2| -|Gal = |G]. 


Corollary 2.4 (Cayley’s theorem). Every group is isomorphic to a transitive permu- 
tation group. 


To see this, we take the action of G on itself by right multiplication (that is, 
on the coset space 1\G), which is obviously faithful. An action (or a permutation 
group) is regular if it is transitive and the stabiliser of a point is the identity. By 
Theorem 2.2, any regular action is isomorphic to the action described in Cayley’s 
theorem. 
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For finite permutation groups, the orbit-counting lemma (often called Burnside’s 
lemma: it is given without attribution in Burnside’s 1911 book but has been traced 
back to Cauchy by Neumann 1979) gives a formula for the number of orbits. 


Theorem 2.5 (Orbit-counting lemma). Let the finite group G act on the finite set 
Q. For g € G, let fix(g) be the number of points a € 2 for which ag = a. Then the 
number of orbits of G in Q is the average, over G, of fix(g). 


Proof. Suppose first that G is transitive. Count in two ways the number of pairs 
(a,z) with a € 2, g € G, and ag = a, to obtain 


> fix(g) = |2|- |Gol. 


REG 


By Corollary 2.3, we obtain 


xe) =1, 


KEG 


as required. 
In general, let G have ¢ orbits Q,,...,2,. and let fix;(g) be the number of fixed 
points of g in Q; for 1 <i <t. Then 


fix(g) =} fix,(g): 


i=1 


summing over / and using the result for transitive groups we obtain 


ig txs) =! 


géG 


as required. O 


This result stands at the root of two major areas, the Redfield—Pélya theory of 
enumeration under group action, and the character theory of permutation groups. 
The second of these is discussed in section 5; for the first, see chapter 21. 

Let G act transitively on 2. A congruence on 2 is an equivalence relation which 
is invariant under G. Its equivalence classes are called blocks of imprimitivity, and 
the set of blocks is a system of imprimitivity. A non-empty subset A of 2 is a block 
of imprimitivity if and only if 


AndAg=@orA 


for all g € G. We say that G is primitive if the only congruences are equality and 
the universal relation which holds between all pairs of points. 
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Proposition 2.6. The transitive group G is primitive if and only if the stabiliser of 
a point is a maximal subgroup of G. 


Proof. If A is a non-trivial block of imprimitivity (that is, neither a singleton nor 
all of 2), then Gg < Gy, < G. Conversely, if Ga < H < G, then the H-orbit con- 
taining a is a block. O 


Just as an intransitive action can be “reduced” to transitive ones, an imprimitive 
action can also be reduced to smaller actions, in a manner which I now describe. 

Let H and K be permutation groups on sets F and A respectively. Let Q =I x A, 
regarded as a covering of A with fibres Bs = {(y,8): y € I"} bijective with F. Now 
let N be the group of permutations of 2 which fix each fibre set-wise and act 
on it as an element of H acting on I’; and let K° be the group of permutations 
of 2 induced by K acting on the index set of the set of fibres. Then the group 
generated by N and K° (which is their semi-direct product) is called the wreath 
product of the permutation groups H and K (written H Wr kK). If neither F nor 
A is a singleton, then the wreath product is imprimitive; the fibres are blocks of 
imprimitivity. 


Theorem 2.7. Let G be a transitive permutation group on 2 with system of imprim- 
itivity A containing a block T. Let H be the permutation group induced on I by its 
set-wise stabiliser, and K the group induced on A by G. Then G is embeddable in 
H Wr K in such a way that its actions on Q) and on I’ x A are isomorphic. 


For finite permutation groups, this reduction reaches primitive groups after a 
finite number of steps. Thus, information about primitive groups tells us a great 
deal about transitive groups, though there is still a non-trivial problem involved 
in re-assembling the imprimitive group from its components. For infinite groups, 
this is no longer true, but there is a more general concept of wreath product of an 
arbitrary partially ordered set of groups which does the same job, defined by Hall 
(1962) and others. 


3. The O’Nan-Scott theorem 


While every abstract group is isomorphic to a transitive permutation group (Cay- 
ley’s theorem), the structure of primitive groups is more restricted. The normal 
subgroup structure was considered by O’Nan and by Scott (1980), who proposed 
this as a means for using the classification of finite simple groups (which was immi- 
nent when they wrote) to study maximal subgroups of symmetric groups. We will 
see that it does this and much more. In fact, their result is only a little stronger 
than was obtained over a century earlier by Jordan (1861). I will give the result in 
a fairly combinatorial form. To state it, a little terminology is required. 

The socle of a finite group is the product of its minimal normal subgroups. (Each 
minimal normal subgroup is itself the direct product of isomorphic simple groups.) 
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A power structure on a set 2 (also called a hypercube or Hamming scheme) is 
a bijection between £2 and the set F4 of all functions from A to IF’, for some sets 
I and A, neither of which is a singleton. If H and K are permutation groups on 
lr and A respectively, then the wreath product H WrK defined previously has a 
natural action on J. (If F and A are finite, say || =m, |4| =k, then 4 can 
be identified with the set of all k-tuples of symbols from the “alphabet” I’. Now 
the factors of N act on the symbols in each coordinate, while K° permutes the 
coordinates. This is called the product action of the wreath product.) It can be 
shown that a group G preserving a power structure can be embedded in H WrK 
so that its actions on Q and on F are isomorphic, where H and K are obtained 
from the action of G. Furthermore, H Wr K acts primitively on I+ if and only if 
fT is primitive but not regular on IF and K is transitive on A. 

We call a primitive group basic if there is no power structure on 2 preserved by 
G. So we have a “reduction” for non-basic groups similar to that for imprimitive 
groups. 

Next, we define some special classes of permutation groups. 

Let V be a finite vector space, H a group of linear transformations of V. Let G 
be the group 


{ur vA+¢c:A€H,ceV} 


of permutations of V. It is a semi-direct product of the translation group of V by 
H. Then G is transitive on V; it is primitive if and only if H is an irreducible linear 
group; and it is basic if and only if H is a primitive linear group (that is, preserves 
no non-trivial direct sum decomposition of V). We call G a group of affine type. 
If H is the general linear group GL(n,q) (all non-singular linear transformations), 
then G is the affine general linear group AGL(n,q). (Here V is n-dimensional 
over GF(q).) 

Let 7 be a non-abelian finite simple group, G the direct product of k copies of 
T, and H the diagonal of G (the subgroup {(t,t,...,f):£€ T}). Let G act on the 
coset space H\G. The normaliser of G in the symmetric group is generated by G 
together with 

(i) the automorphism group Aut(7) of 7, acting in the same way on each factor 
of G (note that inner automorphisms are induced by elements of H); and 

(ii) the group of permutations of the factors of G. 

Any group lying between H and its normaliser is said to be of diagonal type. 

A group G is called almost simple if it has a minimal normal subgroup T which 
is a non-abelian simple group and has trivial centralizer in G (equivalently, if 


T <G<Aut(T) 


for some non-abelian simple group T). 
The O’Nan-Scott theorem can be stated in two parts as follows. 


Theorem 3.1. A basic primitive permutation group is either of affine or diagonal 
type, or almost simple. 
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Theorem 3.2. Let G be a primitive group preserving a power structure 4, and 
embedded in H Wr K in the natural way. Let S be the socle of G. Then § = N*, 
where k =|A| and N is either the socle or a regular minimal normal subgroup of 


Several proofs have appeared, though some of the early versions have a small 
error — the second case in Theorem 3.2, where S is a non-abelian regular normal 
subgroup of G, is omitted. See Scott (1980), Cameron (1981), Aschbacher and 
Scott (1985), Kovacs (1986), Liebeck et al. (1988). 

A corollary of the theorem, which is easily proved directly (and was known 
much earlier), is the following. 


Corollary 3.3. A primitive permutation group has at most two minimal normal 
subgroups; if there are two, then they are regular and non-abelian. 


4. Finite simple groups 


In 1980, the classification of the finite simple groups was completed (apart from 
a couple of details about some of the groups which were resolved subsequently). 
This result has probably the longest proof of any single theorem, and hundreds of 
mathematicians contributed to its achievement. The techniques needed to apply 
the theorem, however, have little to do with those in its proof, and it is the former 
that concern us here. 

The result is as follows. 


Theorem 4.1 A finite simple group is one of the following: 
(i) a cyclic group of prime order, 
(ii) an alternating group An, n> 5; 
(iii) a group of Lie type over a finite field; 
(iv) one of 26 “sporadic” groups. 


For an overview of the proof, see Gorenstein (1982). 

I will now attempt to give a brief description of the groups in the conclusion of 
the theorem. The cyclic groups of prime order require no comment. 

The alternating group A, consists of all even permutations of {1,...,n}. Its 
simplicity for all n > 5 was known to Galois, as an ingredient in his proof of the 
‘unsolvability by radicals of the general polynomial equation. 

Groups of Lie type form the most important class, and are also the most difficult 
to define. They can be approached in different ways. A large subclass consists of 
the Chevalley groups, defined in a uniform way by Chevalley (1955), roughly as 
follows. For each of the simple Lie algebras over C (of types An, n 21; B,,n 2 2; 
Cy, n 23; Dy, n > 4; Es, Er, Eg, Fy and Gz), it is possible to choose a Chevalley 
basis, relative to which the multiplication constants are integers. Then the matrices 
of the adjoint mappings corresponding to these basis vectors have entries which are 
polynomials with integer coefficients. These matrices can then be used to define a 
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group over any field K, the Chevalley group over K. Twisted groups, modifications 
of the Chevalley groups, were defined by Steinberg (1959) and Ree (1961). They 
exist whenever the Dynkin diagram of the Lie algebra has an automorphism, and 
the field has an automorphism of the same order; for diagrams with multiple bonds, 
the characteristic of the field is restricted as well. The types are 7Ay,?Dn,°Dy,"E6 
(Steinberg), ?F4,7G2 (Ree), 7B) (Suzuki); the superscript denotes the order of the 
graph automorphism. For further discussion, a good general reference is Carter 
(1972). See also chapter 13 on buildings. 

However, for everyday purposes, a different approach is often more useful, in- 
volving a different subdivision. The first class consists of the classical groups. These 
are the Chevalley and most twisted groups associated with the Lie algebras Ay. Bn, 
C, and D,,. They have been identified with the projective special linear, symplectic, 
orthogonal and unitary groups. (Take a finite-dimensional vector space V, possibly 
carrying a non-degenerate sesquilinear or quadratic form f; let G be the group of 
all invertible linear transformations of V, or all those which are isometries of f; 
in all but a few small cases, G has a unique non-abelian composition factor.) See 
Dickson (1900), Dieudonné (1963) or Artin (1957) for these groups, and Carter 
(1972) for their identification with groups of Lie type. 

One particularly important class is that consisting of the projective special lin- 
ear groups PSL(d,q) = SL(d,q)/Z, where SL(d,q) is the group of d x d matri- 
ces of determinant ! over GF(q), and Z the subgroup of SL(d,q) consisting of 
scalar matrices. In Lie notation. PSL(7 + 1,g) = A,,(q). It is known that PSL(d, q) 
is simple except in the cases (d,q) = (2,2) or (2,3). (In fact some of these groups 
occur in other guises: PSL(2,2) = $3, PSL(2,3) = Ag, PSL(2,4) = PSL(2,5) = As, 
PSL(3,2) = PSL(2,7), and PSL(4,2) = Ag.) The group PSL(d,q) is a permutation 
group on the set @ of 1-dimensional subspaces of the d-dimensional vector space 
over GF(q) (that is, the points of the projective space — see chapter 13), since Z 
is the kernel of the action of SL(d,q) on this set. We have 


(q4 — 1)(q" —q)---(q?-q4 ') 
PSL(d,q)| = ; 
IPSL(4.q)) (q—1(q—1.d) 


which is roughly a polynomial in q of degree d? — 1. 

(As a matter of notation, GL refers to the general linear group of all invertible 
matrices; replacing G by S$ denotes determinant 1; replacing G or S by I or & 
denotes adjoining field automorphisms; prefacing by P denotes taking the quotient 
by the group of scalar matrices; and prefacing with A denotes taking the semi- 
direct product with the translation group of the vector space.) 

classical groups are C,,(q) (the symplectic group PSp(2n,q)), “An(q?) (the uni- 
tary group PSU(n +1,q)), and D,,(q), Ba(q) and *D,(q7) (the orthogonal groups 
PO? (2n, g), PA(2n + 1,g) and PQ” (2n,q)). 

The remaining groups of Lie type are referred to as the exceptional groups. The 
Most significant fact about them is that an exceptional group over a field K has a 
matrix representation over K of bounded degree; and so, if K = GF(q), its order 
is bounded by a fixed polynomial in g. 
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Table 4.1. 
The finite groups of Lie type 
Group Condition N d 
Chevalley groups: 
An(q) 21 grr TT" (git! -1) (n+1,q~-1) 
Bnlqg)  n>2 a” TT, (@ -1) (2,4 -1) 
Calg)  n>3 a” Tq -) (2,9 - 1) 
q odd 

Dig) 224 gh Deg" —1) TP -1) (4,q" —1) 
E6() q(ql2 — 1)? - 11148 - 1)(48 - 1095 - 1g? ~ 1) (3,9 -1) 
E(q) g®3(q'8 — 1)¢q!4 — 1)(g!2 — 1)¢g!9 — 1)¢48 — 1g — 1)(g?2 1) (2,9 -1) 
Ex(q) }20(q30 — 1)(q74 — 1)(q?0 — 1)(q!8 — 1)(q"4 - 1) 1 

x (gl? — 1)(q8 - 1)(q? - 1) 1 
F,(q) 474 (gl? — 1)(g8 — 1)(48 ~ 1)(q? - 1) 1 
G,(q) 4®(q® — 1)(q? —1) 1 
Steinberg groups: 
24,(42) n>2 grlrtl)/2 Ths (qi*) _ (-1)*1) (n+1,q4+1) 
Dig?) n>4 —— gh-N(g" +1) TP) @® - 1) (4,q" +1) 
*E6(q7) 4>0(qi2 — 1)¢4? + 1)(g® — 1)(g6 — 1)(45 + 1)(4? - 1) (3,4 +1) 
3D4(q°) g!2(g8 + q* + 1)(q® — 1)(q? - 1) 1 
Suzuki and Ree groups: 
2Bo(q) gq = 2m! g2(q2 + 1)(g-1) 1 
2G,(g) g= 3m! BQ +IN(q-L 1 
Faq) q = 2241 gl2(g6 4 1)(94 - 1)(43 + 1)(9 - 1) 1 


Here n is a positive integer, mm a non-negative integer, g a prime power. The group occurs as a quotient 
of a group of order N by a central subgroup of order d (and so has order N/d). All groups in the table 
are simple except for A,(2) = $3, A,(3) = Aq, Bz(2) = Sg, Go(2) (which has a subgroup of index 2 
isomorphic to 2A,(32)), 74, (27) (a Frobenius group of order 72), 2B,(2) (a Frobenius group of order 
20), 2G, (3) (which has a subgroup of index 3 isomorphic to A,(2)), and 2F,(2) (which has a simple 
subgroup of index 2, the Tits group, not occurring elsewhere in the table). Other coincidences between 
groups of Lie type and alternating groups are A,(22) & A,(5) & As, Ay(7) = Ag(2), Ay (32) & Ag, 
Ag(2) © Ag, Bo(3) © 7A3(2?). 

Note that C,(q) is defined for n = 2 and for q even but is isomorphic to B,(q) in these cases, 
similarly D3(q) and 2D3(q?) are isomorphic to A3(q) and 243(97) respectively. 

Some of these groups are isomorphic to classical groups, as follows: A,(q) = PSL(# + 1,q), 
_ Ba(q) * PA(2n + 1,9), Calg) & PSp(2n,q), Dn(g) = PM*(2n,q), 7An(q?) & PSU(A + 1,4), "Dn (q?) = 
PQ" (2n,q). Other common names are Sz(q) for the Suzuki group 2Bo(q), and R;(q) and R>(q) for 
the Ree groups 2G5(q) and 2F4(q) respectively. 


The groups of Lie type are listed in table 4.1. 

No uniform description of the sporadic groups is known; it is necessary to give 26 
separate constructions. Each sporadic group is the unique non-abelian composition 
factor in the automorphism group of some rather special combinatorial or algebraic 
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object (design, graph, code, lattice or non-associative algebra). We refer to the 
“ATLAS of Finite Groups” (Conway et al. 1985) for more details and references. 
The sporadic groups are listed in table 4.2. 


Table 4.2. 

The sporadic simple groups 
Group Name Order 
Mi Mathieu 24.32.5141 
Mi. Mathicu 26.33.51 
Mp9 Mathieu 27.32.5.7-11 
My; Mathieu 27 .32.§.7-11-23 
My, Mathieu 2!0,33.5.7. 11-23 
J, Janko 23.3-5-7-11-19 
Jy Hall-Janko 27 .33.52.7 
Jy Janko 27.35 -5.17-19 
J4 Janko 22! 33.5.7. 119.23.29-31-37-43 
Co, Conway 221. 39.54.72.11. 13-23 
Co5 Conway 2'8.36.53.7. 11-23 
Co3 Conway 2'0.37.53.7.11.23 
Fiz, ‘Fischer 2'7.39.52.7.41-13 
Fi>, ‘Fischer a!8. 313 52.7. 11-13-17- 23 
Fi,, Fischer 221 316.52. 73.11. 13-17-23.29 
Fy Fischer-Griess 246. 320 59. 76.112. 133.17. 19-23-29 -31-41 47-59-71 
F5 Fischer 24l 313.56. 72.4).13.17-19- 23-31-47 
Fy Thompson 2! 310.59. 72. 13.19.31 
Fs Harada-Norton 2/4. 36. 56.7. 11-19 
HS Higman-Sims 29-32. . 53... 7-- 11 
Mc McLaughlin 27.36.53.7-4) 
Suz Suzuki 2!3..37.52.7. 11.13 
Ly Lyons 28 .37..56.7.41-31-37-67 
He Held 210.33. 52.99.17 
Ru Rudvalis 2'4.33.53.7.13.29 
O'N —_O'Nan 27.34.5.73- 11-19-31 


Some of these groups have alternative names: for example. Co, =n (7 = 1,2,3): Fi, = M(n) (4 = 
22,23, 24). fy = FG=M (the friendly giant or monster), Fy = BM = B (the baby monster), F3 = Th: 
Fs = HN: Mc = McL: Jy = HI: ete. 


The main information we need about these groups is their maximal subgroups; 
or, More precisely (in view of the O’Nan—Scott theorem), maximal subgroups of © 
groups G with T <G < Aut(T) with T simple. The O’Nan-Scott theorem was 
first used for this very purpose in connection with the symmetric and alternating 
groups (an application of the so-called “bootstrap principle”); see also Aschbacher 
(1984), Kleidman and Liebeck (1988), and the ATLAS. 
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5. Applications of the classification 


5.1, Multiple transitivity and small rank 


Let t be a positive integer not exceeding ||. We say that G acts t-transitively on 
2 if, given any two t-tuples (a1,...,a,;) and (f),...,8,) of distinct elements of 2, 
there is an element g € G carrying @; to f; fori =1,...,t. Note that t-transitivity 
implies (¢ — 1)-transitivity; the condition gets stronger as ¢ increases. The symmetric 
and alternating groups of degree n are n- and (n — 2)-transitive, respectively. 


Theorem 5.1. Ail finite 2-transitive groups are known. In particular, the only 6- 
transitive groups are the symmetric and alternating groups. 


Table 5.1 gives the list of 2-transitive groups. 


Table 5.1(a) 
Finite 2-transitive groups with abelian socle 


Degree H=Go Condition No. of actions 

qt SL(d,q) < H <TL(d,q) up to q if g even, d=2 
2 if (d,q) = (3,2) 

q4 Sp(d.g) IH d>2 up to q if q even 

g® G2(q) AH q even up to g 

q 2!+23 =SL2,3)4H g = 52,77, 112,232 1 

q 24g H q= 34 ] 

q SL(2,5) 1H q = 11,192,292, s92 1 

2 AG 2 

24 Az 1 

28 PSU(3, 3) 2 

36 SL(2, 13) 1 


The degree x is a prime power p’", and the socle is elementary abelian of order » and regular. The 
stabiliser Gg = H of the origin is a subgroup of GL(m, p). 21+2k denotes an extraspecial group of this 
order. 

The maximum degree of transitivity is 2 in all cases except the first with ¢ = 2 (4-transitive if d = 2, 
3-transitive if d > 2) or (d,q) = (1,3) or (1,4) (3- or 4-transitive respectively), and the case n = 24, 
H = Az (3-transitive). In the first, fourth, fifth and sixth rows of the table, the conditions are not 
sufficient for 2-transitivity of G. For example, in the case n = 34, P = 2!+4 <1 H, we have H/P < Ss, 
and G is 2-transitive if and only if H/P contains an element of order 5. For precise conditions in all 

cases, see Huppert (1957), Hering (1985). 


The number of actions is the order of the cohomology group H! (Gp, N). 


In outline, the proof runs as follows. It was shown by Burnside (1911, p. 202), 
that a 2-transitive group G has a unique minimal normal subgroup N, which is 
either elementary abelian and regular, or primitive and simple. (This can be seen 
using the O’Nan-Scott theorem as follows. First, G is certainly primitive, since it 
preserves no non-trivial binary relations. Similarly, it preserves no power structure. 
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Table 5.1(b) 
Finite 2-transitive groups with non-abelian sucle 
n Condition N max|G/N|_min(?) max(?) No. of actions 
n nes Ay 2 n—-2 n 2ifn=6 
1 otherwise 
q@ ct dz2 PSL(d.g) (d.qg-—Ue 3ifd=2,geven 3ifd=2 2ifd>2 
i (d.g) # (2,2), 2 otherwise 2 otherwise | otherwise 
(2.3) 

22-1 424-1 a3 Sp(2d.2) | 2 2 1 

224 1_2¢-l d > 3 Sp(2d.2) 1 2 2 1 

| q23 PSU(3.q@) G.qtHe 2 2 1 

q+l q = 2241» 2 Sz(q) 2d + 2 2 1 

got q =! > 3 Rig) 2d +1 2 2 | 

11 PSL(2.11) 1 2 2 2 

i My, 1 4 4 1 

12 My t 3 3 1 

12 My> l 5 5 2 

15 A; 1 2 2 2 

22 M9 2 3 3 

23 M3 I 4 4 1 

24 Moy I 5 5 ) 

28 PSL(2,8) 3 1 2 1 

176 HS I 2 2 2 

276 Co, 1 2 2 1 


Here g is a prime power q = p*; 1 and d are positive integers, n being the degree. N is the sacle of 
G. The next column gives the index of N in the largest possible G, viz. its normaliscr in §,. Groups 
with the same socle may have different degrees of transitivity; the least and greatest such degrees are 
given. Note that the socle is 2-transitive in all cases except one. The final column gives the number of 
non-isomorphic actions. (For example, for PSL(d,q), ¢ > 2, these actions are on points and hyperplanes 
of the projective space.) 


It is straightforward to check that no diagonal group can be 2-transitive. So G is 
either of affine type or almost simple. In the latter case, if the simple socle NV of 
G is imprimitive, then a geometric argument shows that G acts on a linear space 
(whose lines are the G-translates of an N-block) admitting a parallelism, and N 
fixes all the parallcl classes, and hence that the stabiliser of two points in N is 
trivial. A theorem of Frobenius now shows that N has a regular normal subgroup, 
which is impossible.) 

Suppose that G is of affine type. Then the linear group H = Gy acts transitively 
on the non-zero vectors of the vector space V. Huppert (1957) determined all 
solvable linear groups with this property. Non-solvable groups were considered by 
Hering (1985), who showed that there can be at most one non-abelian composition 
factor, and then considered the possible simple groups to establish which could 
occur in this situation. 

Suppose that G is almost simple, that is, T < G < Aut(T) for some simple group 
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T. The proof is completed by considering the possibilities for T. As an illustration, 
I will discuss the case where T is an alternating group A,,, m 2 5. Since Aut(A,,) = 
Sm form # 6, we may (by treating one case separately) assume that G = S,, or G = 
Am. Suppose that G = S,,. Then G contains a conjugacy class € of 1/2m(m — 1) 
“transpositions” (in its natural action of degree m; we do not know how they act 
in the unknown action!). A point @ is exchanged with each further point B by 
equally many members of €; so n —1 < 1/2m(m — 1). On the other hand, G, is 
a subgroup of index n in G. There are classical lower bounds (in terms of mm) 
for the index of a subgroup of S,,. Comparing these bounds, we find that either 
the unknown permutation representation coincides with the known one, or m is 
bounded. Then only a finite amount of work remains. (The lower bounds referred 
to have a long history. It is straightforward to write down the index of a maximal 
intransitive or imprimitive subgroup of S,,. For primitive subgroups, a result more 
than good enough for this application has been available since the turn of the 
century; but it has been dramatically improved by use of the classification, as we 
will see later. This is another instance of the “bootstrap principle”.) If G = A,»,, we 
use the class of 3-cycles instead. This argument is due to Maillet (1895). 

Suppose that G is a group of Lie type. The pattern of argument is similar; there 
are a distinguished small conjugacy class (consisting of transvcctions in the case of 
most classical groups), and a lower bound for the index of a subgroup. However, 
the argument can be shortened by using some knowledge of the structure and 
character theory of the groups of Lie type. This was done by Howlett (1974) and 
Curtis et al. (1976). 

Finally, for the sporadic groups, the problem is a finite one. 

For t > 2, the t-transitive groups form a subclass of the 2-transitive groups, which 
is casily determined by inspection. 


Theorem 5.2. Let G be t-transitive of degree n, but not S, or An. 
(i) If t=5 then G is a Mathieu group M,2 or Mo (n = 12,24). 
(ii) If t =4, then either G is one of the above, or G is My, or M23 (n = 11,23). 
(iii) If t = 3, then either G is one of the above, or G is AGL(d,2) (n= 27), orG 
is Vig.Az (n = 16), or G is My, (n = 12), or G is Mz) or Aut(M22) (n = 22), or 


PSL(2,q) < G < PIL(2,q) 
(2 =q+1, g a prime power). 


The Mathieu groups M,, n= 11,12,22,23,24 were the first sporadic simple 
groups to be discovered. They are closely connected to many important combi- 
natorial configurations such as designs, geometries, codes and Hadamard matrices 
(see chapter 14). 

The result has obvious applications in the classification of designs, codes, etc. 
with 2-transitive automorphism groups. Sometimes, the 2-transitivity arises natu- 
rally and does not have to be assumed. To take an example: A Jordan group is 
a permutation group containing a subgroup fixing at least one point and acting 
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transitively on the points it does not fix; it is proper if it is not (¢ + 1)-transitive, 
where ¢ is the number of points fixed by the subgroup. Jordan (1871) showed that 
a primitive Jordan group is 2-transitive. The argument has a geometric interpre- 
tation. A Jordan group is an automorphism group of a matroid, whose flats are 
the fixed point sets of subgroups with the property of the definition. Now if the 
group is primitive, then the matroid is geometric (flats of rank 1 have cardinality 
1), whence the 2-transitivity is clear. See chapter 9 for more about matroids. From 
Theorem 5.1, the complete classification of proper Jordan groups follows. 


Theorem 5.3. A proper primitive Jordan group G satisfies one of the following: 
(i) PSL(d,q) < G < PIL(d,q),d > 3; 
(ii) ASL(d,q) < G < ATL(d,q),d 22; 
(ili) G= M2, Aut(M22), M3, or Mpa; 
(iv) G= Ay, (degree 15) or Vig.Az (degree 16). 


In particular, Jordan groups of sufficiently large rank are necessarily projective 
or affine groups. The latter assertion has important applications in model the- 
ory, in the work of Cherlin et al. (1985) on Xy-categorical, w-stable theories. It 
has also been given a purely combinatorial proof (not using the classification) by 
Evans (1986) and Zi]’ber (1988). (These “elementary” proofs were found after the 
classification, despite much earlier effort in this direction.) 

Another application is Kantor’s (1985c) determination of the 2-transitive linear 
spaces and basis-transitive matroids. 

Next we weaken the assumption of 2-transitivity. The rank of a transitive per- 
mutation group G on 22 is the number of orbits of G on 2?. The diagonal is always 
an orbit; so the rank of a group of degree greater than 1 is at least 2, with equality 
if and only if it is 2-transitive. 


Theorem 5.4. 

(i) All primitive permutation groups of rank 3 are known. 

(ii) For any given r, all but finitely many primitive almost simple permutation 
groups of rank r are known. 


Part (i) was completed by Liebeck (1987), building on work of Bannai (1972), 
Kantor and Liebler (1982), and Liebeck and Saxl (1986). The strategy is similar to 
that for 2-transitive groups. Part (ii) is due to Kantor (1979) using a result of Seitz 
(1974). 


Theorem 5.5. All almost simple primitive permutation groups of odd degree are 
known. 


This is due to Kantor (1987) and Liebeck and Saxl (1985). Note that if G is primi- 
tive of odd degree but not basic, then G < H Wr K, where H is basic of odd degree; 
and no group of diagonal type has odd degree (since, by the Feit-Thompson 1963 
theorem, non-abelian simple groups have even order). The unknown factor here 
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is thus the list of groups of affine type; determining this is equivalent to finding all 
irreducible linear groups in odd characteristic (and so somewhat out of reach yet.) 

A result similar in spirit to Theorem 5.4(ii) was proved by Kantor et al. (1989), 
also in the context of model theory. 


Theorem 5.6. There is a function f on the natural numbers, tending to infinity with 
n, with the property that any primitive group of degree n having fewer than f(n) 
orbits on S-tuples of points is explicitly known. 


5.2. Degree and order 


It was known to Mathieu (1861, see the footnote on p. 274) that 2-transitive groups 
of degree n (other than S,, and A,) exist for 5 <n < 33. However, this pattern is 
not typical. 


Theorem 5.7. For almost all n, the only primitive groups of degree n are S, and 
A,. More precisely, if 


€ = {n: there exists a primitive group of degree n, not S, or An}, 
and e(x) = |€M[1,x]], then 

e(x) =2m(x)+ (1 + V2)x!/? + O(x'/2/log x), 
where a(x) is the number of primes not exceeding x. 


This is due to Cameron et al. (1982). In outline: degrees of non-basic groups arc 
proper powers, of which squares contribute x!/? and the others O(x'/3**). Degrees 
of groups of affine type are prime powers, so either prime (contributing a(x)) or 
proper powers. Degrees of diagonal groups are either orders of simple groups or 
proper powers. For the density of the former we turn to the classification. There 
are O(log x) alternating groups, o(x!/logx) groups of Lie type (since the orders 
are roughly polynomials of degree at least 3 in a prime power argument), and 
O(1) sporadic groups, of order less than x. The most difficult estimation is that 
of almost simple groups. The crucial results are the lower bounds for indices of 
subgroups; these show that not too many simple groups can contribute to e(x). 
The number of values contributed by 7 is at most the number of divisors of |7], 
_by Lagrange’s theorem. It turns out that the dominant contributions are a(x — 1) 
(from PSL(2, p), in its representation of degree p +1, p prime) and (2x)'/? (from 
the triangular numbers (5), the degree of S, acting on pairs). The asymptotic 
expansion could be continued to higher precision. 

I have already mentioned classical results asserting that primitive groups of de- 
gree n cannot be too large. The best bound obtained without using the classification 
is n*V"l08” for primitive, not 2-transitive groups, by Babai (1981). (The 2-transitive 
groups are now known to be much smaller, at most °'°2". The original “elemen- 
tary” result of Babai (1982) was much weaker; but Pyber (1993a) has proved a 


628 P.J. Cameron 


bound n¢!8'* by elementary means.) However, using the classification, one can 
show the following. 


Theorem 5.8. With an infinite family and finitely many more exceptions, a primitive 
group of degree n has order at most n'°8", where c is a constant. 


The infinitely many exceptions satisfy 
(Am)! < G < Sin Wr S; 


with the product action, where S,, acts on k-sets, kK and / are bounded and m arbi- 
trary. By allowing further exceptions (among basic groups, just symmetric groups 
acting on partitions of fixed shape, classical groups acting on an orbit of subspaces 
in their natural modules, and certain groups of affine type), the bound can be im- 
proved to n¢!-e!°8", For almost simple groups, the order is at most n° with known 
exceptions. For applications of these methods to graph isomorphism problem, see 
chapter 27. 

An up-to-date survey of “asymptotic permutation group theory” is given by 
Pyber (1993b). 


5.3. Fixed-point-free elements 


It is elementary that a transitive finite group with degree greater than 1 contains an 
element with no fixed points. (This can be seen from the Orbit-counting lemma 2.3: 
the average number of fixed points is 1, and the identity fixes more than one point, 
so some element fixes less than one.) The next result, then, is not unexpected. 


Proposition 5.9. A transitive group of degree greater than 1 has a fixed-point-free 
element of prime power order. 


There are three surprising things about this result. First, that it is so difficult. 
The proof (Fein et al. 1981) involves first reducing the problem to consideration 
of primitive simple groups, and going through the list of simple groups, using quite 
detailed information about each class. Second is the applicability of the result (to 
a question about relative Brauer groups of finite extensions of global fields). Third 
is the fact that the following plausible extension is still open. 


Conjecture. For any prime p, there is a function f, such that a transitive group 
of degree n, where n = p*-b and a > f,(b), contains a fixed-point-free element of 
p-power order. 


As well as several applications in number theory, this problem has arisen in 
game theory (Isbell 1960) and extremal set theory (Cameron et al. 1989a). 


5.4. Distance-transitive graphs 


Cameron et al. (1983) confirmed a conjecture of Sims by showing the following. 
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Theorem 5.10. There is a function f such that, if G is primitive and G, has an orbit 
(other than {a}) of length d, then |G.| < f(d). 


(Of course, the group induced by G, on the orbit has order at most d! ; the 
point is that the kernel of this action has bounded order.) 

Using this, they showed the following. A connected graph is distance-transitive if 
its automorphism group acts transitively on ordered pairs of vertices at any given 
distance. 


Theorem 5.11. There are only finitely many finite distance-transitive graphs of given 
valency d > 2. 


Proof (sketch). Using techniques of Smith (1971), one reduces to the case where 
the automorphism group acts primitively. By the Sims conjecture, the number of 
vertices at given distance from a fixed vertex is bounded (all these sets are orbits 
of the vertex stabiliser). By the Compactness theorem of first-order logic, an infi- 
nite distance-transitive graph with the same properties exists. But this contradicts 
Macpherson’s (1982) classification of these graphs. O 


Further work on distance-transitive graphs has concentrated on applying meth- 
ods of group theory to the full automorphism group (rather than to the point 
stabiliser as above). Smith’s result cited above asserts that any (finite) imprimitive 
distance-transitive graph other than a cycle is either a “bipartite double” or an 
“antipodal cover” of a smaller distance-transitive graph; in at most two such re- 
ductions, we reach a primitive graph. So, naturally, most attention has been given 
to this case. Application of the O’Nan-Scott theorem (with some further work) 
reduces to the affine and almost simple cases. (The Hamming graphs, whose au- 
tomorphism groups preserve an obvious power structure, are characterised along 
the way. See Praeger et al. 1987.) Then these cases must be dealt with — this is 
the hard part! At the time of writing, it appears that the classification of primitive 
distance-transitive graphs is almost complete. Brouwer et al. (1989) is the standard 
reference on distance-transitive graphs, which are also discussed in chapter 15. 

Closely related is the concept of arc-transitivity. A graph is s-arc transitive if 
its automorphism group acts transitively on the set of arcs (walks with no two 
consecutive edges equal) of length s. Weiss (1981) showed the following. 


Theorem 5,12. A finite graph (other than a cycle) cannot be more than 7-arc tran- 
Silive. 


By contrast, a countable regular tree is s-arc transitive for all s. There are close 


connections with the flourishing theory of group amalgams. It is also known that 
finite s-arc-transitive digraphs of valency d exist for every s and d (Praeger 1989). 


6. Characters and configurations 


The conclusion of the orbit-counting lemma can be phrased in terms of permuta- 
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tion characters. Further information can be obtained by considering other actions, 
especially on (27. A combinatorial version of these matters is Higman’s theory of 
“coherent configurations”. General references on character theory are Serre (1977) 
and Curtis and Reiner (1962); for a survey of Higman’s ideas, see Higman (1975). 

A matrix representation M of a group G is a homomorphism from G to the 
group of invertible x m matrices over a field. I will always take this field to be 
C. but important results are obtained by considering subfields of C (such as R or 
(2), p-adic fields, or fields of non-zero characteristic. 

The character of G associated with a representation is the function 


xX(g) = trace(M (g)). 


It can be shown that two representations are similar if and only if their characters 
are equal. 

A representation M is irreducible if the space V(n,€) has no proper subspace 
invariant under {M(g): g € G}; it is indecomposable if it is not the direct sum of 
two non-trivial invariant subspaces. Maschke’s theorem asserts that any invariant 
subspace has an invariant complement, so these concepts coincide. (This is true 
generally over fields of characteristic zero, or whose characteristic does not divide 
the group order.) We call a character irreducible if the corresponding representation 
is. 

Any character of G is constant on the conjugacy classes of G. The vector space 
of class functions (functions from G to C which are constant on the conjugacy 
classes) admits an inner product given by 


Vif) = ai So file) Ale). 


gcG 


Theorem 6.1. 

(i) The irreducible characters form an orthonormal basis for the space of 
complex-valued class functions (with respect to the above inner product). 

(ii) The number of irreducible characters of G is equal to the number of conju- 
gacy classes. 


(ili) The sum of squares of the degrees of the irreducible characters is equal to 
|G|. 


Note that arbitrary characters are linear combinations of irreducible characters 
with non-negative integer coefficients. 

The character table of a group G is the matrix whose (/,/) entry is equal to the 
value of the ith character on elements of the jth conjugacy class. It is a square 
matrix which is orthogonal (with respect to suitable weight functions on the classes . 
and characters). 

Any group has a principal character, the identically-1 function, which is clearly 
irreducible. Also, if G acts on 2, then G has a matrix representation by permu- 
tation matrices. Now the trace of a permutation matrix is the number of 1s on 
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the diagonal, that is, the number of fixed points. Combining all these remarks, we 
obtain the following translation of the Orbit-counting lemma 1.3, 


Theorem 6.2. Let the finite group G act on the finite set N. Then the number of orbits 
of G is equal to the inner product of the permutation character with the principal 
character, that is, to the multiplicity of the principal character in the permutation 
character. 


This observation allows techniques of character theory to be used. It also has 
some combinatorial consequences. One consequence is Block’s lemma (see chapter 
31): If G is a group of automorphisms of an incidence structure whose incidence 
matrix has rank equal to the number of rows, then it can be shown that the 
permutation character of G on points is contained in the permutation character 
on blocks; taking inner product with the principal character, we see that G has at 
least as many block-orbits as point-orbits. 

For another example, I give an argument due to O’Nan (1985). A set S of 
permutations of 2 is called uniformly transitive if, for any a, 8 € Q, the number 
of elements g € 5S satisfying ag = f is a constant, say A. If so, then we have A - |,Q| = 
|S|. We call S sharply transitive if it is uniformly transitive with A = 1. 


Lemma 6.3. Let G act transitively on Q; and Q). Suppose that every irreducible 
constituent of the permutation character on Qh also occurs in the permutation char- 
acter on 2),. If a subset § of G is uniformly transitive on 2), then S is uniformly 
transitive on Qh. In particular, if S is sharply transitive on Q, then |Q,| divides |Q\|. 


Corollary 6.4. For d > 2, the group PI'L(d,q) contains no sharply 2-transitive sub- 
set. 


Proof. It can be shown that the character condition applies, with 2; the set of 
ordered pairs of distinct points and Q, the set of antiflags (non-incident point- 
hyperplane pairs); but || does not divide |Q;|. O 


For an extension, see Cameron et al. (1987). 
Further results can be obtained by considering different actions of G. The basic 
observation is the following. 


Lemma 6.5. Let the group G act transitively on sets [ and 4. Then the numbers of 
_ G-orbits on F x A and of G,-orbits on A (y € I) are equal, and each is the inner 
product of the permutation characters on T and A. 
Proof. A bijection between the specified orbits is given by 

Or O(y) = {8: (7,8) € O}, 


for a G-orbit O on I x A. If wp and 7, are the permutation characters on I and 
A, then a7, is the permutation character on I x A; and we have 


(arta, 1) = (ar, 72) 
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where 1 is the principal character. O 
The following illustration of this result is due to Livingstone and Wagner (1965). 


Corollary 6.6. if k <1 <n/2, then a permutation group of degree n has at least as 
many orbits on I-sets as on k-sets. 


For, under these hypotheses, if 2, and Q, are the sets of k-sets and /-sets re- 
spectively, then the symmetric group S,, has k + 1 orbits on 2, x 9; then an easy 
induction shows that the permutation character of S, on Q, is the sum of k +1 
irreducible characters, all of which appear in the character on 2),. So the permu- 
tation character on k-sets is contained in that on /-sets. Now the proof proceeds 
as in Block’s lemma. 


Corollary 6.7. If G is transitive on Q, then the rank of G is the inner product of 
the permutation character with itself. 


This is immediate from the definition of rank as the number of orbits of the 
stabiliser of a point. In particular, G is 2-transitive if and only if the permutation 
character is the sum of the principal character and one further irreducible. 

It is true in general that (77,7) is the number of G-orbits on 2? (though, if 
G is intransitive, this number cannot be equated with the number of G,-orbits). 
D. G. Higman, in a number of papers dating from the mid-1960s, has developed 
a combinatorial setting for this observation. Regard each orbit O; of G on 2? as 
a binary relation R; on 2. Now: 

(i) the given relations partition 22; 

(ii) a subset of the given relations partitions the diagonal of 07; 

(iii) the set of relations is closed under taking converses; 

(iv) for relations R;,R;,R, and a pair (a, B) € R,, the number of points y € 2 
such that (a, y) € R; and (y, 8) € R; depends only on i, j and k, not on a@ and B. 

A family of binary relations satisfying these conditions is called a coherent con- 
figuration. These structures, which include strongly regular and distance-regular 
graphs, association schemes, symmetric and quasi-symmetric designs, and regular 
two-graphs, can be studied by representation-theoretic methods like those used 
for groups. These matters are considered further in chapter 15; see also chapter 
41, section 4. 

We will also need a generalisation of the concept of permutation characters, that 
of induced characters. Let G be a group, H a subgroup, x a character of H, afforded 
by a matrix representation VM. Now let |G: H| = 2, and let x),...,x, be right coset 
representatives for H in G. For g € G, let g denote the coset representative of g. 
Now let 2 = H\G, and a = H, so that N = {ax),...,axn}. 

As before, we consider the representation P of G by permutation matrices 
corresponding to the action on 2. Now define a new representation of G of degree 
nm as follows: g is represented by block matrices with (ax;,a@x;) block equal to O 
if the (ax,;,ax;) entry of P(g) is zero, and to M (xigx;') otherwise. (Note that, if 


Permutation groups 633 


axig = ax;, then x;g = x,;, and so the argument of M really is an element of H.) It 
is easily checked that we really have a representation of G; it is called the induced 
representation, and its character is the induced character y°. 

If x is the principal character of H, then x° is simply the permutation character 
of G on the space H\G of cosets of H. 


7. The characters of the symmetric group 


There is one family of groups whose character theory is particularly important 
to combinatorics, namely the symmetric groups. This theory has connections with 
partitions, symmetric functions, Schur functions, etc.; these matters are discussed 
further in chapter 21. General references here are Robinson (1961), James and 
Kerber (1981), and the first chapter of Macdonald (1979). 

The initial observation is the following. 


Proposition 7.1. There is a natural bijection between the characters of the symmetric 
group S,, and the partitions of n. 


Proof. Given any permutation g on 92, there is a unique decomposition of N into 
cycles of g (orbits of the group generated by g): 


g= (a, a+ -)(B; Bass -)eee. 
Since 
ho'gh = (ayh anh---)(B\h Boh: +) +, 


conjugate permutations have cycles of the same lengths, and conversely; so the 
partition of n determined by the cycles of a permutation (its cycle structure) char- 
acterises it up to conjugacy. DO 


Hence the number of conjugacy classes is equal to the number p() of partitions 
of n. This is a well-studied function; see chapter 21. 

In view of the preceding section, we know that the irreducible characters of S, 
are also bijective with the partitions of n. I will give a brief description of this 
bijection, and mention a few consequences. 

A partition of n is represented by a Ferrers diagram: if the partition A has c 
parts with lengths /,,...,/,, with /, >--- >, the diagram for A has c rows with /; 
cells in the ith row, aligned on the left. 

Corresponding to a partition as above, there is a well-defined conjugacy class of 
subgroups of S,, one for each partition of 2 with parts of the appropriate sizes, 
consisting of all permutations fixing every part of the partition. These are called 
Young subgroups. We use S* to denote a typical Young subgroup corresponding 
to the partition A. We will be interested in the action of S, on all partitions of 1 of 
the appropriate shape (with labelled parts); the corresponding Young subgroups 
are just the stabilisers in this action. 
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Now we define characters 7 and 7* of S, as follows: 

e 7 is the permutation character of S,, acting on partitions of shape A (that is, 
the character induced from the trivial character of S*); 

e + is the character of S, induced from the sign character of S* (the char- 
acter of degree 1 whose value is +1 on even permutations and —1 on odd 
permutations). 

A partition A has a dual A’, whose ith part is equal to the number of parts of 

A which exceed i. Its Ferrers diagram is the transpose of that of A. Hence A” =A 
for any partition A. 

Finally, there is a total order < on partitions, as follows: if A and yz have parts 
hi,...,d- and my,...,7q, then we write A ~ y if, for some i, we have J; = m; for 
j <i while J; > m; (where undefined parts are taken to be zero). 

Now the main result is as follows. 


Theorem 7.2. For any partition 4, there is a unique irreducible y> which is a con- 
stituent of 7 but not of a for any yw < Xr. Furthermore, x? is the unique common 
constituent (with multiplicity 1) of m* and 7°. 


For example, the first few characters 7’, with their decompositions into irre- 
ducibles, are as follows: 


a”) = y) the principal character; 
mtd) = yl) 4 yr), 


ap t- 2.2) = x” + xh) + y-22). 


gp 21,1) =x”) + 2-1) + x7 22) $ yr 2ht), 


In particular, for 2 > 2k, the permutation character a\"-kk) on k-sets is the sum 
of the irreducibles y~‘“) for i < k, in accordance with our observation in the proof 
of Corollary 6.6. 


Theorem 7.3. 

(i) The degree of 7m* is equal to the number of ways of writing the numbers 
{1,...,} into the cells of d so that the rows are strictly increasing. 

(ii) The degree of x* is equal to the number of ways of writing {1,...,n} into 
the cells of d so that the rows and columns are strictly increasing. 


The first statement holds because, if Q = {1,...,n}, then any partition of shape 
A gives rise to a tableau with increasing rows (writing the elements of the ith part 
into the ith row in increasing order. For the second, see any of the books referred 
to earlier. 

Note that, just as y* is the “intersection” of z* and 7*’, so the arrangements 
counted to obtain its degree are those satisfying both the constraint for a’ and (in 
transposed form) that for 7*’. The objects counted in Theorem 7.3(ii) are called 


A 
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standard tableaux of shape A, and their number (the degree of y*) is denoted by 

A- 
a For example, if 2 = 2m and the partition A has two parts of size m, then x4(1) is 
equal to the number of ways that votes can be cast for cach of two candidates 
in an election so that the first candidate is never behind in the count. This is a 
familiar interpretation of the mth Catalan number C,,, (see chapter 21). 

From Theorem 6.1(iii) we see that 5*(f,)” =! where the summation is over all 
partitions of n. In other words, the number of ordered pairs of standard tableaux 
of the same shape is equal to the number of permutations of {1,...,2}. A bi- 
jection between these two sets is given by the Schensted correspondence. 1n this 
correspondence, the two tableaux are equal if and only if the permutation satisfies 
g? = 1; s0 


SoA = s(n), 


where s(n) is the number of involutions on {1,...,}. (This is also a special case 
of a gencral result about characters.) 

In principle, the entries of the character table can now be calculated: the values 
of *(g) are easily established, and the matrix of the system of equations for the 
x*(g) in terms of the 7(g) is lower triangular. 

The dimension of a character y’ is defined to be n —d, where d is the largest 
part of A. Now consider partitions of n, for n > 2d with fixed d, having dimension 
d and such that the parts other than the largest constitute a fixed partition of d. It 
can be shown that the degrees of such characters are given by a polynomial in n 
of degree d, with leading term n“/d!. Moreover, Schur proved the following. 


Proposition 7.4. For n > 2d, the irreducible constituents of the permutation charac- 
ter of S, on d-tuples of distinct elements are the characters of dimension d or less. 
Moreover, a permutation group G is d-transitive if and only if the restrictions to G 
of all these characters (except the principal character) are orthogonal to the principal 
character of G. 


An application of these ideas is given by Cameron et al. (1987). 


Theorem 7.5, Let S be a set of permutations of an n-set such that the Hamming 
distances 


d(g,h) = fix(gh™’) 


for g,he S,g #h, take just d distinct values. Then S is bounded above by the sum 
of squares of the characters of S,, of dimension d or smaller. 


(This bound is a polynomial inn of degree 2d, with leading term n”4 - p(n)/(d!)? , 
where p(d) is the number of partitions of d.) The bound is not too far from best 
possible: examples of size roughly (n/2d)*“ are known. 
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8. Computing in permutation groups 


A good example to keep in mind for this section is Rubik’s cube. This celebrated 
puzzle can be thought of as a cube divided into 3 x 3 x 3 smaller cubes. In its 
initial configuration, the nine squares on each face have the same colour. Each 
face (more precisely, the set of nine small cubes with faces lying in a single face 
of the large cube) can be rotated through any multiple of 7/2. A few rotations 
will destroy the original arrangement and leave the cube in an apparently random 
configuration. The problem is: given the cube in a disordered state, decide whether 
or not the initial state can be restored by a sequence of moves (rotations of faces) 
and, if so, find a sequence which does this. We can regard the six squares at the 
centres of faces as fixed, so that the remaining 48 squares are permuted by any 
move. The six face rotations through 7/2 in the positive sense generate a group 
of permutations. The problem can now be formulated and generalized as follows. 

Input: permutations g,,...,g; of a finite set 2. 

Output: information about the group G generated by g),...,g;. (In the ex- 
ample, the required information is a membership test for arbitrary permutations; 
in other situations, we may want the order of G, whether G is simple, even a 
“name” for G, etc.) 

I will give an outline of some of the basic algorithms, up to the point where we 
have the membership test and the group order; and then summarise more quickly 
some further techniqucs. 

The first step involves finding the orbit containing a point a. This is the equiv- 
alence class containing a of the smallest equivalence relation ~ in which 6 ~ y if 
Bg; = y for some /. It can be found by essentially the same algorithm that com- 
putes connected components in a digraph. For later use, this is refined so that with 
each point 8 of the orbit comes a “witness”, a word w in the generators gj,...,25 
such that aw = B. Our discussion of coset spaces in section 2 shows that these 
witnesses form a complete set of right coset representatives for the stabiliser Gy. 
Also, the algorithm has the property that any initial segment of a witness is also a 
witness (for a point of the orbit found earlier); so instead of the actual witness, we 
only need to remember which generator was used last. This involves a considerable 
saving in space. The appropriate data structure is called a Schreier vector. 

Next, we need a set of generators for Ga. (In Rubik’s cube, once we have 
moved one square, or a set of squares, into their correct positions, we want to 
do all subsequent manipulations without disturbing these squares.) The following 
lemma is due to Schreier. 


Lemma 8.1. Let g),...,8; be generators of a group G, and x,,...,x, right coset 
representatives for a subgroup H of index t. If g denotes the coset representative of 
an element g € G, then H is generated by the ns elements 


{xig;(%iR)) |: 1 <i <t,1 <j < st. 


This principle (in words, “move a to 8, using the witness for 8; apply a generator 
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to B, moving it (say) to y; then return y to @ with the inverse of the witness for 
y”), is surely familiar to every solver of Rubik’s cube. 

The disadvantage here is the number of generators. The index of G, in G (the 
size of the orbit) may be as large as the degree n of G; so we may be left with ns 
generators for G,. As we stabilise further points, the numbers of generators will 
grow explosively, and we will quickly run out of space, even though the groups 
being considered are getting smaller. 

This is dealt with by a “filter” to reduce the number of generators. The best 
algorithm is due to Jerrum (1982). Its input is a sequence of permutations of 2, 
given one at a time “on-line”. At any stage, the output is a list of permutations of 
such that 

(i) the input and output permutations generate the same group; 

(ii) the output list contains at most n — 1 entries, where n = |(2|. 

This gives an upper bound for the number of generators we have to store; the 
output from Schreier’s algorithm can be fed straight into Jerrum’s. 

Jerrum’s filter has a theoretical consequence. 


Theorem 8.2. Any subgroup of S,, can be generated by at most n — 1 elements. 


McIver and Neumann (1987) (see also Cameron et al. 1989b) have announced 
without proof that this bound can be improved to max(2, |n/2]); this is best pos- 
sible. However, the proof does not provide an algorithm to find these generators. 

Now we proceed to the important step, the Schreier-Sims algorithm for a base 
and strong generating set. A base for a permutation group G consists of a sequence 
(a1,..., a4) of elements of 2 such that only the identity fixes a,,..., ag. Thus every 
permutation in G is uniquely determined by its effect on a base. (This means that, 
once we have a base, in subsequent computations only d values of a permutation 
need be stored, rather than n; and d is usually much smaller than n, as we shall see.) 
A base can be found by successively fixing points until the stabiliser consists of the 
identity only. Clearly we may assume that no base point is fixed by the stabiliser of 
the preceding points. (A base with this property is called irredundant.) A sequence 
g of elements of G is called a strong generating set (relative to the given base) if, for 
0 <i <d, the stabiliser of (a,,...,a;) is generated by some terminal subsequence 
of g. 

We can now find a base and strong generating set as follows: 

Start with the empty sequence of elements of 2, and the empty sequence of 
elements of G. While the generating set for G is non-empty, do the following. 
Choose a point a@ not fixed by all generators of G; adjoin it to the first list. Now 
compute the orbit of a, with witnesses; add the witnesses to the second list. Then 
compute generators for G,, and replace G by G,. 

On termination, the first list is a base, and the second is a strong generating 
set of a special kind: it has the form g = (go,...,g4-1), where g; is a list of coset 
representatives for G; in G;_1, where G; is the stabiliser of (a1,...,a,). Thus 


lgi| = |Gi-1: G]. 
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Now the membership test works as follows. Given an arbitrary permutation 
g, check first whether ag is in the G-orbit of a (using the witnesses in go). If 
not, then g ¢ G; otherwise, ag = a1gp;, for some ig, so that 88o;, € G;. Proceed 
inductively. At the conclusion, either we have shown that g ¢ G, or we have 


BBO, Bis Shi 
so that 
8 = Ba-1iz -* BO EG. 


As a consequence, we see that each element g ¢ G has a unique expression of 
the above form, so that 


|G] = |gol--- |ga—tl- 


Many further algorithms for studying a permutation group are known. It is 
possible to find centralisers and normalisers of subsets, Sylow subgroups, normal 
subgroups, composition factors, cores, test simplicity; and so on. See Butler and 
Cannon (1982), Kantor (1985b), Neumann (1987). These algorithms usually start 
with a base and strong generating set. 

Some theoretical issues are raised by this point of view. Clearly it is important 
to find as small a base as possible, and to do so quickly. Blaha (1992) has shown 
the following. 


Theorem 8.3. Jf the smallest base for G has size d, then any irredundant base has 
size at most dlogn, and any base found by the greedy algorithm has size at most 
2d log logan; these bounds are best possible, up to constant factors. 


(The greedy algorithm chooses each base point in a largest orbit of the stabiliser 
of the preceding base points.) 

The examples Blaha gives to show that his bounds are best possible are intran- 
sitive. It seems likely that much stronger results hold for primitive groups. It is 
conjectured that 

(i) there is an absolute constant c such that, if the primitive group G has a base 
of size d, the greedy algorithm finds one of size at most cd; 

(ii) there is an absolute constant c such that, if G is primitive and almost simple, 
then with known exceptions, G has a base of size at most c. 

(A group with a base of size c has order at most n°; the exceptions in (ii) should 
be the primitive almost simple groups whose order is not polynomially bounded, 
see section 5.) 

Maund (1989) has proved some results in the direction of (ii). She has also found 
all permutation groups which permute their irredundant bases transitively. (Such 
a group G is a Jordan group, and its bases are the bases of a matroid; this is far 
from true in general.) 


Permutation groups 639 


Another issue concerns the extent to which computations in a permutation group 
can be done in polynomial time. The Schreier-Sims algorithm with a filter like 
Jerrum’s is polynomial, so we can always assume that we are given a base and 
strong generating set for the group in question. Many structural properties are 
in P (Kantor 1985a). A striking example is finding a Sylow subgroup, see Kantor 
(1985b). (The classical proofs of this important theorem are constructive, but yield 
exponential algorithms; the polynomial algorithm is much more complicated.) 

In practice, randomised algorithms are often used. For example, often we need to 
find an element of prime order p. This can be done quickly by choosing random 
elements of G until we find one with order divisible by p, and raising it to an 
appropriate power. 

Many of these algorithms are implemented in group-theoretic programming en- 
vironments, of which the most comprehensive are MAGMA, a development of 
CAYLEY (Cannon 1984) with new features for doing combinatorics (among other 
things), and GAP, an open system developed by Neubiiser. 

For further detail on computational group theory, see chapter 27. 


9. Infinite permutation groups 


Until the middle 1970s, the subject “infinite permutation groups” scarcely existed. 
(In the Mathematical Reviews, permutation groups were explicitly finite.) Now, 
this is a rapidly expanding area, with close links to model theory, Ramsey theory, 
enumeration and topology, among others. In this section I will sketch briefly some 
of these developments. 


9.1. Groups of finitary permutations 


A finitary permutation is one which moves only finitely many points. The set of 
such permutations on an infinite set forms the finitary symmetric group; the even 
permutations in it form the alternating group. Wielandt (1959), extending a result 
of Jordan about finite permutation groups, showed the following. 


Theorem 9.1. A primitive permutation group of infinite degree containing a finitary 
permutation contains the alternating group. 


The general theory of groups of finitary permutations was developed by Neu- 
mann (1976). The first observation is that all non-trivial blocks of imprimitivity are 
finite. Such a group of uncountable degree has a unique system of maximal blocks, 
and is an extension of a subdirect product of finite groups by the finitary symmetric 
or alternating group. For countable degree, there are additional possibilities, and 
examples including transitive p-groups. 


9.2. Groups of cofinitary permutations 


The dual situation is where all non-identity permutations in a group have only 
finitely many fixed points. Barlotti and Strambach (1983) showed the following. 
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Theorem 9.2. For any finite k, there is a k-transitive group in which any non-identity 
permutation fixes at most k points. 


The group can be chosen free of countable rank. A generalisation of this re- 
sult has applications in the theory of measurement in mathematical psychology 
(Cameron 1989). A survey of this topic is in preparation. 


9.3. Normal subgroups 


No simple analogue of the O’Nan-Scott theorem can hold for infinite groups. 
There are two reasons for this. The more serious is the absence of minimal nor- 
mal subgroups in general (e.g., in the free group of countable rank in the last 
subsection). The other is illustrated by the following fact. 


Proposition 9.3. There is a 2-transitive group which is the direct product of two 
simple regular subgroups. 


This follows from the existence of a (simple) group S in which all non-identity 
elements are conjugate (Higman et al. 1949). Now G = S x S, acting on 12 = S by 
the rule 


(g,h): so go'sh, 


is 2-transitive. (This group is of diagonal type.) 


9.4. Jordan groups 


Apart from some very recent examples duc to Hrushovski (1993), the known 
infinite Jordan groups fall into three types: 

(i) Automorphism groups of matroids (projective or affine spaces or the geom- 
etry of algebraically closed subfields of a field). 

(11) Automorphism groups of “order-like” relations. The prototype is the group 
of order-preserving permutations of Q. 

(iii) Homeomorphism groups of topological spaces (e.g., manifolds). 

Some characterisations of the first two classes exist. For example, a primitive 
Jordan group with a cofinite Jordan set preserves a projective or affine space over 
a finite field; this fact has very important implications in model theory (see Cherlin 
et al. 1985). Adeleke and Macpherson (1994) have characterised all primitive, 
not highly transitive Jordan groups as automorphism groups of certain relations; 
see Macpherson’s article in Kaye and Macpherson (1994) for discussion. Also, 
Wagner’s article in that volume discusses Hrushovski’s constructions. 


9.5. Oligomorphic permutation groups 


The permutation group G on 22 is oligomorphic (=“few shapes”) if it has only 
finitely many orbits on Q” for all n. The importance of this class of groups de- 
rives from the following theorem of Engeler (1959), Ryll-Nardzewski (1959) and 
Svenonius (1959). 
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Theorem 9.4. A countable first-order structure is characterised, up to isomorphism, 
by first-order axioms (and the countability condition) if and only if its automorphism 
group is oligomorphic. 


Oligomorphic permutation groups are also closely connected with enumeration 
theory: counting orbits on n-sets or n-tuples is equivalent to counting unlabelled 
or labelled finite structures in certain classes. A version of the cycle index can be 
defined for any oligomorphic group. 

The earliest theorem on these groups was due to Cameron (1976). 


Theorem 9.5. If the infinite permutation group G on 22 is transitive on n-subsets of 
22 for all n, but not n-transitive for some n, then there is a linear or circular order 
on {2 preserved or reversed by G. 


A more detailed account of oligomorpic permutation groups is given by 
Cameron (1990). See also several articles in Kaye and Macpherson (1994). 
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1. Introduction 


The chief difficulty in surveying finite geometries is the huge number of kinds of 
geometries which have been studied. A survey must steer between the extremes 
of omitting a large part of the subject, or becoming little more than a list of lists 
of axioms. I have tried to do this by a combination of selectivity and vagueness. 

The central actors in this story are undoubtedly the projective and affine spaces. 
Section 2 is devoted to a survey of these, giving descriptions, axiomatisations, 
and some of their properties. The next three sections give an account of their 
collineation groups, a study of configurations in projective spaces, and a brief 
pointer to a connection with coding theory. Projective and affine planes are impor- 
tant in view of their status as low-dimensional exceptions; some features of their 
theory appear in section 6. 

In the next few sections, the theme is generalization. Buildings form an impor- 
tant and natural extension of projective spaces. Most finite examples, apart from 
projective spaces themselves, are the polar spaces formed by the isotropic vari- 
eties with respect to polarities, or the singular varieties with respect to quadratic 
forms, in projective spaces. Section 8 treats buildings, following an account of 
rank 2 buildings (generalized polygons) in section 7. The next section treats a 
wider generalization to the class of Buekenhout geometries. Finally, I discuss par- 
tial geometries. 


2. Projective and affine spaces 


Projective spaces are defined, in general, over possibly non-commutative fields 
(division rings or skew fields). However, Wedderburn’s theorem asserts that any 
finite field is commutative; and Galois’ theorem asserts that there is a unique field 
GF(q) of any given prime power order q, and no field of non-prime-power order. 

Accordingly, let V be a vector space of dimension n+1 over GF(q). The pro- 
jective geometry PG(n,q) is the geometry whose points, lines, planes, ... are the 
i-, 2-, 3-, ... dimensional subspaces of V. (We use the general term variety for 
a subspace of arbitrary dimension. A k + 1-dimensional subspace will be called a 
k-flat, so that a point is a 0-flat, etc.) We say that two varieties are incident if one 
contains the other. We use familiar geometric language; a point lies on a line if the 
point and line are incident; two lines are concurrent if some point is incident with 
both; and so on. 

Now some familiar geometric properties hold: two points lie on a unique line; a 
non-incident point and line lie on a unique plane; and so on. (These properties can 
be summarized by saying that the projective geometry is a matroid. More precisely, 
if we identify an arbitrary variety with the set of points incident with it, then the 
varieties are the flats of a matroid. See chapter 9.) In addition various non-matroid 
properties hold: two coplanar lines are concurrent, and so on. 


Proposition 2.1. The number of k-dimensional subspaces of an n-dimensional vec- 
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tor space over GF(q) is the Gaussian coefficient 


i) figs 


In particular, the number of points of PG(n,q) is (q"*! — 1)/(q — 1); each line 
contains q +1 points, each plane contains q’ + q + 1, etc. 

The Gaussian coefficient can be defined by the formula of Proposition 2.1 for 
any positive real value of q except 1; its limit, as g — 1, is the binomial coefficient 
(Z). It has a combinatorial interpretation, and is of great importance; see chapter 
21. 


Proposition 2.2. For 0 < d < n, the points and k-flats in PG(n, q) are the points and 
blocks of a 2- (Pie Beles [ee iq ) design. 


See chapter 14 for the concepts of design theory. Here, as earlier, a variety is 
identified with the set of points incident with it. Note that the design is square 
(i.e. has equally many points and blocks) if and only if the blocks are hyperplanes 
(subspaces of codimension 1). Dembowski (1968), in his influential book, used the 
term “projective design” for what is here called a “square design”, but the term 
did not catch on; these objects are almost universally referred to as “symmetric 
designs”. 

How do you recognise a projective geometry? The result here is stated in the 
finite case only, but does not really require finiteness. It is due to Veblen and Young 
(1916), building on the work of Hilbert (1899) on the foundations of geometry. 

A projective plane of order n is a 2-(n? +n +1,n 41,1) design (whose blocks are 
called “lines”); it has the property that any two lines are concurrent. (See chapter 
14.) PG(2, g) is a projective plane of order q; but there are others, as we shall see. 
It can alternatively be defined as a square design with A = 1, or as a connected 
matroid of rank 3 in which any two lines meet. 


Theorem 2.3, Let P be a finite set of points, and £ a collection of subsets of P 
called lines. Suppose that the following conditions hold: 
(a) any line contains at least three points; 
(b) any two points lie on a unique line; 
(c) if a line meets two sides of a triangle, not at their intersection, then it meets 
the third side also. 
Then exactly one of the following conclusions holds: 
(i) P=L=9; 
(ii) |Pl[=1, L=9; 
(iii) [P| > 3, F = {P}; 
(iv) (P, L) is a projective plane; 
(v) P and & can be identified with the point and line sets of PG(n,q) for some 
2 3 and some prime power q. 


Finite geometries 651 


Axiom (c) above is sometimes called the Veblen—Young axiom or Veblen’s axiom 
or (with less justification) Pasch’s axiom. An equivalent formulation is that any 
three non-collinear points are contained in a subspace which is a projective plane. 

Note that the whole geometry can be recovered from the points and lines; for 
a set of points is a variety if and only if it contains the line through any two of its 
points. 

Condition (a) of Theorem 2.3 can be weakened. Given a collection of linear 
spaces (point-line geometries satisfying condition (b)), we define their direct sum 
as follows. The point set is the disjoint union of the point sets of its summands; 
every line of every summand, and every set consisting of two points from different 
summands, is a Jine of the sum. Then the sum is a linear space. Its varieties are 
all the sets of points which contain the line through any two of their points. (This 
is just the matroid direct sum defined in chapter 9.) It is now trivial to show the 
following. 


Proposition 2.4. A point-line geometry satisfying (a)-(c) of Theorem 2.3, with 
“three” replaced by “two” in (a), is a direct sum of a collection of geometries each 
satisfying the conclusion of that theorem. 


For example, given a family of (at least two) subsets (called lines) of a finite set, 
each line containing at least two points, such that any two points lie on a unique 
line and any two lines meet in a unique point, then the family is a projective plane, 
or the direct sum of a point and a line, or a triangle (the direct sum of three points). 
This is the De Bruijn—Erdés (1948) theorem (see chapter 14). 

The geometries described by Proposition 2.4 are sometimes called “generalized 
projective geometries”. Another terminology is to call a geometry satisfying The- 
orem 2.3(a) thick, and one satisfying the weakened form in Proposition 2.4 firm. 
This terminology will be extended later. 

Another point of view is to regard the projective geometry as a partially ordered 
set: the ordering is just inclusion. This poset is a lattice. 

Birkhoff (1935) characterized the generalized projective geometries in lattice- 
theoretic terms: 


Theorem 2.5. A finite lattice is the lattice of varieties of a generalized projective 
geometry if and only if it is atomic and modular. 


(An atom in a lattice is an element covering 0; the lattice is atomic if every 
element is a join of finitely many atoms. A lattice is modular if a v (b Ac) = (av 
b) Ac holds whenever a < c.) An equivalent formulation is as follows. 


Theorem 2.6. A geometric (i.e. simple) matroid is a generalized projective geometry 
if and only if its rank function p is modular, i.e. 


p(a Ab) + p(av b) = pa) + p(d) 
for any flats a, b. 
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Figure 2.1. The Desargues’ configuration. 


The next result characterizes PG(2,q) among projective planes. 


Theorem 2.7. For a projective plane I, the following are equivalent: 
(i) H is isomorphic to PG(2,q) for some prime power q; 
(ii) 7 satisfies Desargues’ theorem (fig. 2.1); 
(iii) HT satisfies Pappus’ theorem (fig. 2.2). 


Here and subsequently, a configuration theorem described by a geometric figure 
is the assertion that, if all the points and all but one of the lines with the required 
incidences are given, then the last line exists also. The figure may degenerate, i.e. 
there may be additional incidences which are not among those specified. 

The status of Theorem 2.7 for possibly infinite projective planes is interesting. A 
geometric argument shows that Pappus’ theorem implies Desargues’. (See Pickert 
1955, for example.) Furthermore, Desargues’ theorem is equivalent to the plane’s 
being coordinatized by a division ring, and Pappus’ theorem to the commutativity 
of this division ring. (So, for example, the projective plane over the rea] quater- 
nions satisfies Desargues’ theorem but not Pappus’.) Thus, the fact that Desargues’ 
theorem implies Pappus’ theorem for finite projective planes is equivalent to Wed- 
derburn’s theorem. No purely geometric proof of this implication is known. 

This result is used in the proof of Theorem 2.3. A geometric argument shows 
that any projective space of dimension at least three satisfies Desargues’ theorem. 
(If we view fig. 2.1 as the projection of a 3-dimensional configuration, we see that 
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Figure 2.2. Pappus' configuration, 


the theorem must hold in the “generic” case in which the configuration does not 
lie in a plane.) Then Theorem 2.7 shows that the planes can be coordinatized, and 
these coordinatizations can be glued together to prove the theorem. 

A projective space is determined by the design of proper subspaces of any fixed 
positive dimension. For the line through two points is the intersection of all the 
subspaces containing them; and then the other varieties are characterized as before. 

More generally, in any 2-design, the /ine through any two points is defined to 
be the intersection of all blocks containing those points. The points and lines then 
form a linear space (that is to say, two points lie on a unique line); and any block 
is a linear subspace. The lines do not, in general, all have the same cardinality. 

Dembowski and Wagner (1960) characterized the point-hyperplane designs of 
projective spaces in terms of these concepts. A triangle will mean a set of three 
non-collinear points. 


Theorem 2.8. The following conditions are equivalent for a square design D with 
A>1, 
(i) @ is the point-hyperplane design of PG(n,q) for some n > 2 and some prime 
power q; 
(ii) every line meets every block; 
(iii) the number of blocks containing a triangle is constant; 
(iv) the automorphism group of @ acts transitively on (ordered) triangles. 


Condition (ii) of the Dembowski-Wagner theorem is often expressed in dual 
form: if E is the intersection of two blocks, and p a point not in E, then there is 
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a (unique) block containing E and p. This condition implies that the points and 
blocks are the points and hyperplanes of a simple matroid; since there are equally 
many points and blocks, the matroid is modular, and hence a projective space 
(since it is easily seen to be connected). (cf. chapter 9). Notice that the full force 
of the fact that @ is a design is not needed. 

It is also possible to characterize projective spaces under a weakening of the 
group-theoretic hypothesis (iv). For projective planes, Ostrom and Wagner (1959) 
showed that 2-transitivity on points characterizes PG(2, q); and since the classifi- 
cation of finite simple groups was completed, all 2-transitive square designs have 
been known (cf. chapter 12). 

The affine geometry AG(n,q) can be defined in either of two ways: 

(1) Remove a hyperplane from PG(n,q) (that is, remove its points from all 
varieties, and remove those varieties which are contained in the hyperplane). 

(2) Take an n-dimensional vector space V over GF(q); the points of the affine 
space are the vectors, and the k-flats are all the cosets of k-dimensional vector 
subspaces. 

To match up the descriptions, let x9,...,x, be coordinates for the (n+ 1)- 
dimensional vector space from which PG(n, q) is built. If the hyperplane xp = 0 is 
removed, every point (1-dimensional subspace) remaining contains a unique vector 
with xp = 1, and the biection 


(C1, x4,---,Xn)) a (41,---,Xn) 


between the point sets is an isomorphism between the two descriptions. 
All the results described for projective spaces have analogues for affine spaces. 
For example, we have the following. 


Proposition 2.9. The number of k-flats in AG(n,q) is q"-* lide: 
Proposition 2.10. The points and k-flats of AG(n,q) form a 2-(q",q*, (e-11,) de- 
sign. If q = 2, it is even a 3-design. 


An affine plane is a 2-(n?,n,1) design, for some n > 1; in other words, an affine 
design with A = 1. (See chapter 14.) 


Theorem 2.11 (Buekenhout 1969). A non-trivial linear space with more than three 
points on some line is either an affine plane or the point-line design of AG(n,q) for 
some n and q if and only if every triangle is contained in an affine plane. 


The conclusion is trivially false if lines have two points (then the structure is 
trivial — every point pair is a line). Hall (1967) first gave a counterexample with | 
three points on every line. Such objects are Steiner triple systems in which every 
triangle lies in a 9-point subsystem. They are closely connected with commutative 
Moufang loops of exponent 3 (see Bruck 1958), and have been studied by Young 
(1973), Bénéteau (1986), Fischer (1964), Hall (1974), and others. 
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Theorem 2.12 (Dembowski 1964b). For an affine design 2 with A > 1, the following 
are equivalent: 
(i) @ is either the design of points and hyperplanes of AG(n,q) for some n > 3 
and prime power q, or a Hadamard 3-design; 
(ii) every line meets every block; 
(iti) the number of blocks containing a triangle is constant. 


Kimberley (1971) has studied Hadamard 3-designs with a view to quantifying 
the extent to which they resemble affine spaces over GF(2). 

For q = 2, every pair of points is a line, and the geometry cannot be recovered 
from its lines alone. However, lines and parallelism suffice for any g, where paral- 
lelism is an equivalence relation on lines satisfying Euctid’s parallel postulate: each 
equivalence class contains a unique line through every point. See Lenz (1954) for 
axiomatisations, and Cameron (1976) for more on the case g = 2. Note that any 
affine plane admits a parallelism. Hence, under the hypotheses of Buekenhout’s 
Theorem 2.11, two lines can be defined to be parallel if they are contained in 
an affine plane (and parallel there); the difficulty is to show that this relation is 
transitive. 


3. Collineation groups 


The so-called Fundamental Theorem of Projective Geometry is a description of the 
collineations of PG(n,q). A collineation is a permutation of the set of varieties 
which preserves the dimension and the relation of incidence. Any non-singular 
linear transformation of V acts in a natural way on the set of subspaces of V, 
and clearly induces a collineation. The set of such collineations is a group, the 
general linear group GL(n,q). If we coordinatize V as GF(q)", then the group of 
automorphisms of GF(q) also acts coordinate-wise as a collineation group. (This 
group is cyclic of order e, where q = p*, p prime.) The product of these two groups 
(which is their semi-direct product) is denoted by ['L(n, q); its elements are called 
semilinear transformations. The kernel of the action of this group on the projective 
space is easily seen to be the group Z of non-zero scalar transformations. We let 
PIL(n,q) denote the quotient, which is isomorphic to the group of collineations 
induced by T'L(7, g). 


Theorem 3.1 (Fundamental Theorem of Projective Geometry). For n 22, any 
collineation of PG(n,q) is induced by a semilinear transformation of the under- 
lying vector space; in other words, the fuil collineation group is PYL(n,q). 


Theorem 3.2. For n > 2, the collineation group of AG(n, q) is the semidirect product 
of the translation group of V by the group TL(n,q) of semilinear transformations. 


(This group is denoted by AT'L(n, q).) 
A projective line, as we have defined it, has no structure; its collineation group is 
the symmetric group. It is possible to give structure to a projective line in various 
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ways, either by using cross-ratio or as an algebraic curve; but I will not discuss 
this. 


Theorem 3.3 (Singer’s theorem). There is a collineation of PG(n,q) which permutes 
the points in a single cycle. 


This result is trivial to prove — if we regard the underlying vector space as having 
the structure of GF(q"*'), the required collineation is induced by multiplication 
by a primitive element of the field — but is the origin of the theory of difference 
sets; see chapter 14. 

There is a large body of work on collineation groups of projective spaces. The 
assumptions made on the groups fall into three classes: generation, transitivity, and 
low dimension. 

The first type of problem concems collineation groups generated by particular 
kinds of elements, such as transvections (whose fixed points form a hyperplane, 
corresponding to linear transformations of the form /+£, where E has rank 1 
and E? = 0), or reflections (induced by diagonalizable transformations with all but 
one eigenvalue equal to 1). Some of this material dates from the beginning of 
this century. A sample result is McLaughlin’s (1967) determination of irreducible 
groups generated by full transvection subgroups. See Kantor (1982) for a survey. 

Typical of the second type of problem is a result of Cameron and Kantor (1979) 
and Orchel (1978). 


Theorem 3.4. Let G be a group of collineations of PG(n,q) (n > 2) which acts 
2-transitively on points. Then either G contains PSL(n+1,q), or g =2, n=3, and 
G is isomorphic to the alternating group A>. 


Of course, this result is now superseded by the classification of all finite 2- 
transitive permutation groups; nevertheless, its proof is elementary and involves 
some nice geometry. In fact, the classification of the 2-transitive groups with ele- 
mentary abelian regular normal subgroups by Huppert (1957) and Hering (1974, 
1985) is equivalent to a determination of all collineation groups which are transi- 
tive on points. (These are the groups H = Gy listed in table 5.1(a) of chapter 12.) 
Many similar problems are resolved by the classification of finite simple groups. 

Block’s lemma (see chapters 14, 31) implies that a collineation group of PG(n, q) 
(n > 2) has at least as many orbits on lines as on points. When does equality hold? 
This problem was studied by Cameron and Liebler (1982) and Penttila (1984), but 
is not yet settled. 

The classification of subgroups of low-dimensional projective groups is almost 
as old as group theory, having been considered by Galois in his letter to Chevalier 
in 1832. Dickson (1901) gave a list of all the subgroups of PSL(2,q). Throughout 
this century, work continued on finding all (or at least all maximal) subgroups of 
PSL(n,q) for small m. This work is part of the general attempt to find all maximal 
subgroups of “almost simple” groups. The relevance of this can be seen in part 
from the O’Nan-Scott theorem (chapter 12), which gives a good description o1 
those primitive permutation groups which are not almost simple. 
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An important theorem of Aschbacher (1984) is the analogue of the O’Nan-Scott 
theorem (in its original form concerning maximal subgroups of symmetric groups) 
for collineation groups. Aschbacher defines certain classes of large subgroups: re- 
ducible (fixing a subspace); imprimitive (fixing a direct sum decomposition); the 
stabilizer of a tensor product decomposition; the projective group over a subfield 
of GF(q); a group of smaller dimension over an extension field of GF(q); a clas- 
sical group (the stabiliser of a non-degenerate sesquilinear or quadratic form); or 
the normalizer of an irreducible extraspecial group. His theorem asserts that a 
maximal subgroup of PSL(n,q) which is not in any of these classes is almost sim- 
ple and absolutely irreducible. There is an analogous result for the other classical 
groups. In addition, we expect that these “exceptional” subgroups will be small. 
Liebeck (1985) gives an upper bound of q*" for the order of such a subgroup, and 
improved bounds with “known” exceptions have been obtained subsequently. A 
typical example is the occurrence of As as a maximal subgroup of PSL(2, p) when- 
ever p is a prime congruent to +1 (mod 5). These matters, and much more detail 
about subgroups of classical (and other) simple groups, are discussed by Kleidman 
and Liebeck (1988, 1990). 


4. Galois geometry 


To Euclid, the plane and space in which he worked were familiar objects. His 
axioms were not intended to characterize these structures, but as self-evident truths 
on which to base propositions about configurations in the plane or space. Of course, 
since Descartes, algebraic or analytic proofs have become more common than 
Euclid’s geometric ones. In a similar way, the classical results of section 2 (due to 
Hilbert, Veblen and Young, Birkhoff, etc.) have made PG(n, q) a familiar object. 
The study of configurations in PG(n,q) is largely the creation of B. Segre (1961). 
Part of his legacy is the pre-eminence of Italian mathematicians in this area. 

The study of PG(n,q) has two aspects, not always sharply differentiated: the 
study of configurations within the space, for example by their intersections with 
subspaces; and the study of intrinsic properties of subgeometries (including, for 
example, conditions for representability of matroids). 

Before beginning, I draw the reader’s attention to Hirschfeld (1979, 1985), 
Hirschfeld and Thas (1991) a very detailed account of material which can at most 
be sketched here. 

A cap is aset of points in PG(n, q) with no three collinear. (For n = 2, a cap is 
called an arc.) The general problem here, which is yet unsolved, is to determine the 
size of the largest cap in PG(n,q). Consider first the case 7 = 2. The lines joining 
one point on an arc to the others are all distinct; since a point lies on q + 1 lines, 
there are at most g +2 points in an arc. If this bound is attained, then any line 
which meets the arc does so in two points; then the arc is partitioned into sets of 
size 2 by the lines through any outside point, and so g must be even. Moreover, if 
q is even, then a (q + 1)-arc has a knot or nucleus, each line through which meets 
the arc once; adjoining this point gives a (q +2)-arc. (This paragraph applies to 
any projective plane of order q.) 
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A conic in PG(2,q) is a set of points equivalent under PGL(3, q) to 
{((x,y,z)) xz =y?} = {((1,6,27)) st € GF(q)} U (0,0, 1))}. 


It is easily seen to be a (q + 1)-arc. If ¢ is even, its nucleus is ((0,1,0)). Hence, the 
following holds. 


Proposition 4.1. The maximum size of an arc in PG(2,q) is 


q+1, ifq is odd; 
q+2, if q is even. 


An arc containing this bound is called an oval. (Sometimes the term “oval” is 
used for a g + l-arc, a q + 2-arc being called a hyperoval.) 

A celebrated theorem of Segre (1955) describes all ovals in the case where q is 
odd. It was proved as a result of a speculation in cosmology (Jarnefelt 1951). 


Theorem 4.2 (Segre’s theorem). Any oval in PG(2,q), for q odd, is a conic. 


No such result is known for even q. For q < 8, any oval consists of a conic and 
its nucleus; but this is false for g = 16 (Lunelli and Sce 1958), and no classification 
is known without strong additional hypotheses. An example of a “non-classical” 
oval is the set 


{((1,2,2"1))} U {((0,0, 1))} 


in PG(2,2°), where m = 24, with 1 < d < e. (Some of these arcs are equivalent to 
conic plus nucleus; but for e = 5 or e > 7, there are some which are not.) 

This leaves the problem of arcs maximal under inclusion but not of maximum 
cardinality. A complete classification is too much to hope for; so define functions 
f, g such that 

(i) an are of cardinality less than f(q) is not maximal (and f(g) is the smallest 
number with this property); 

(ii) an arc of cardinality greater than g(q) is contained in an oval (and g(q) is 
the greatest number with this property). 

These functions have been much studied. The next theorem surveys some of the 
results. 


Theorem 4.3, (i) cq!/? < f(q) < cq*. 
(ii) 


q-349", if q is even; 


q—qi?+1, if q is even; 
S 
#q+1, if q is prime. 
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(q@)> { lq+q'?, _ in general; 
S\4) 2 q—q?+1, if q isa square. 


The lower bound in (i) is trivial: an arc with fewer than (2q)'/2 points has at most 
gq secants, which cannot cover all the points of the plane. The rest is a compilation 
of results by Boros, Fisher, Hirschfeld, Segre, Sz6nyi, Thas, Voloch and Zirilli. Note 
that (iii) depends in part on a result of Schoof and Waterhouse on the tightness of 
Weil’s theorem (see chapter 32). See also Thas (1983) for another proof of Segre’s 
theorem. 

More generally, given a set S of g+1 points in PG(2,q), a point p is called 
a nucleus of S if every line through p meets S. (Note that if S is a line, then 
every point off S is a nucleus.) Some recent results characterize various sets by 
the number or structure of their set of nuclei. For example, Blokhuis and Wilbrink 
(1987) showed the following. 


Proposition 4.4. If S is not a line, then it has at most q —1 nuclei. 


The largest cap in PG(3,q) has size g?+1 (Barlotti 1955). A cap of this size 
is called an ovoid. Ovoids have such a rich structure theory that I will digress to 
describe them further. 

Let O be an ovoid. Then, for any point p € O, the lines tangent to O at p 
(i.e. meeting O only in p) form a plane, the tangent plane at p. Any other plane 
intersects O in a set of g+1 points forming an arc in that plane. These arcs are 
called circles. Thus, any three points lie in a unique circle, and the points and 
circles form a 3-(q? + 1,q +1, 1) design. 

Any 3-(n? +1,n+1,1) design is called an inversive plane or Mébius plane. An 
inversive plane which arises from an ovoid is called egglike. 

The classical example of an ovoid is the elliptic quadric, the set 


{((X0, X1,X2,%3)): Xox1 + f(x2,x3) = O}, 


where f is an irreducible quadratic form in two variables over GF(q). 


Theorem 4.5. (i) An inversive plane is egglike if and only if it satisfies the Bundle 
theorem (fig. 4.1) (Kahn). 

(ii) An inversive plane arises from an elliptic quadric if and only if it satisfies 
Miquel’s theorem (fig. 4.2) (Van der Waerden-Smid). 

(iii) Every inversive plane of even order is egglike (Dembowski). 


See Dembowski (1964a, 1968) and Kahn (1980). Note that the Bundle theorem is 
the assertion that the rank 4 matroid associated with the design (whose hyperplanes 
are the circles) does not contain the Vamos matroid as a minor: see chapter 9. 

Apart from the elliptic quadrics, the only known ovoids are the Suzuki-Tits 
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Figure 4.1. The Bundle theorem. 


Figure 4.2. Miquel’s theorem. 
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ovoids, which live in PG(3,q) (q = 2%4*!, d > 1) and admit the Suzuki groups as 
automorphism groups (see chapter 12). Moreover, no non-egglike inversive plane 
is known. There are results asserting that an inversive plane admitting a sufficiently 
large collineation group must be known; see Ltineburg (1965), for example. 

In the same way that Mobius planes model conics on an elliptic quadric, the other 
quadrics in 3-space (the hyperbolic or ruled quadric, and the cone) are modelled 
by Minkowski and Laguerre planes. These have also been intensively studied, and 
there is a unified treatment of matters related to Theorem 4.5 (Kahn 1980). More 
specifically, the classification of flocks (sets of pairwise disjoint circles in Mobius, 
Minkowski, and Laguerre planes, or of plane sections of quadrics in 3-space) has 
now been essentially completed (see Bader and Lunardon 1989, Thas 1992, for 
example). This work has many applications to inversive planes, translation planes, 
“maximum exterior sets” of quadrics, etc. 

The largest cap in PG(n, 2) is easily seen to be the complement of a hyperplane. 
Apart from this, only isolated results are known. For example, the largest caps 
in PG(4,3) and PG(S5,3) have size 20 and 56 respectively (Pellegrino 1971, Hill 
1978). The unique cap in the latter case contains half of the points of a quadric 
and admits the simple group PSL(3, 4). 

A related problem (identical in the plane) is to determine the largest set of 
points in PG(n,q) with the property that no 1 +1 are contained in a hyperplane. 
Such a set is called an arc. Two examples of arcs are: 

(a) the simplex, the points spanned by the basis vectors and the all-1 vector, 
with cardinality n +2; 

(b) the normal rational curve consisting of the points spanned by the vectors 
(1,t,27,...,£7) for t € GF(q) and (0,0,...,1), with cardinality q +1. 

These examples show that the answer will depend on the relative magnitudes of 
n and q. The existence of an arc with k points is equivalent to the representability 
of the uniform matroid U*,, over GF(q). For n > q—-1 every maximal arc is a 
simplex; this follows from the observation that if an arc with n+ 3 points exists, 
then the matroid Sheed and so its dual uy? are representable over GF(q), and 
hence n+3<q+t1. 

Arcs are closely connected with maximum distance separable codes. (This is an 
instance of a general connection between projective geometry and coding theory 
which is discussed in section 5 and in chapter 16.) See Bruen et al. (1988). 

Another variant asks for sets meeting any line in at most k points, for some k. 
This question is less well developed, except in the plane. The analogue of ag + 2-arc 
is aset which meets every line in 0 or k points. Such a set has cardinality gk —q +k. 
A trivial necessary condition is that k divides g (proved as for the evenness of q 
when q + 2-arcs exist. These sets have been given the misnomer “maximal arcs”; 
possibly this is because a set meeting every line in at most k points has cardinality 
at most gk — q +k, with equality if and only if it is a “maximal arc”. 


Proposition 4.6. (i) A “maximal arc” exists whenever k and q are powers of 2 and 
q 2k. 
(ii) There is no “maximal arc” with k = 3, q > 3 (or, dually, with k = q/3 > 1). 
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These results are due to Denniston (1969a) and Thas (1975) respectively. 

For the maximum of generality, let h and k be integers and L a set of integers. 
Which sets S of h-flats of PG(n,q) have the property that, for any k-flat E, the 
number of h-flats contained in E which are members of S belongs to L? This is 
so general that it is impossible to survey the results; but see Hirschfeld (1985). 

Specialising slightly, a set of points is called an h-blocking set if it meets every 
h-flat but contains none. The term (its original form was “blocking coalition”) 
comes from game theory, in which the original work was done (see Di Paola 
1969). (Roughly: if the A-flats are the minimal winning coalitions in a game, then 
a blocking coalition is one which can prevent any other group from winning, but 
cannot itself win.) The subject has been revitalized by connections with coding 
theory, to be discussed in section 5. For its connections to matroid theory, see the 
treatment of t-blocks in chapter 9. A 1-blocking (i.e. line-blocking) set in PG(2, q) 
is simply called a blocking set. 


Proposition 4.7. (i) A blocking set in PG(2,q) has cardinality at least q + q'/? +1, 
with equality if and only if it is a Baer subplane, the set of points with coordinates 
in GF(q'/?). 

(ii) A blocking set in PG(2,q) which is minimal (under inclusion) has cardinality 
at most q>/? + 1. 


These results are due to Bruen (1971a), Bruen and Thas (1977). 

The lower bound is met in any Desarguesian plane of square order; but it has 
been improved to q + (2q)'/* + 1 for sufficiently large odd non-square q by Blokhuis 
and Brouwer (1986) (for a proof, see chapter 32). An example meeting the upper 
bound is given by a hermitian unital, the set of zeros of the form 


XoX2 + XX, + X2X0, 


where < = x7”, the image of x under the field automorphism of order 2. Note that 
the two extreme cases in the theorem are characterized by the fact that any line 
meets the blocking set in 1 or k points for some k (and in fact k = q!/? +1). 

For affine planes, the value is much larger (Jamison 1977), see also Brouwer 
and Schrijver 1977) 


Proposition 4.8. A blocking set in AG(2,q) has cardinality at least 2q — 1. 


For a proof, see chapter 32. 

There are analogues for larger values of 4, and for other situations. For an 
up-to-date account of blocking sets, see Jungnickel (1989). 

A version of Ramsey’s theorem was proved for projective spaces by Graham et 
al. (1972). 


Theorem 4.9. Let integers k, 1, r and a prime power q be given. Then there is an 
integer n such that, if the k-flats of PG(n,q) are partitioned into r classes, then there 
is an I-flat, all of whose k-flats are contained in the same class. 
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For further discussion of this important theorem and its context, see chapter 25. 
Here it has the following corollary. 


Corollary 4.10. For fixed | and n, there exists qo such that PG(n,q) possesses 1- 
blocking sets only if q > 4o. 


For, if a blocking set exists, partition the points into the blocking set and its 
complement; there is no monochromatic /-flat. Lower and upper bounds for gp are 
known (Mazzocca and Tallini 1985, Cameron and Mazzocca 1986), but there is a 
wide gap between them! 

An important theorem from algebraic geometry, which concerns finite geome- 
try but has not been exploited to the full by finite geometers, is the Hasse—Weil 
theorem, see chapter 32. 

Turning now to intrinsic characterisations, the most important class of config- 
urations consists of polar spaces. A duality or correlation of PG(n,q) is a lattice 
isomorphism from PG(n, qg) to its dual; it is a polarity if it has period 2. The Fun- 
damental Theorem of Projective Geometry implies that any duality arises from a 
sesquilinear form on V, a function of two variables which is linear in the first vari- 
able and semilinear in the second. Further analysis shows that polarities arise from 
forms f which are either o-hermitian, for some field automorphism o whose square 
is the identity (this means that f(v,w) = f(w,v)”), or alternating (this means that 
f(v,v) =0 for all v). Note that o may be the identity, in which case a o-hermitian 
form is symmetric. A variety is totally isotropic for a polarity if it is contained in 
its image; equivalently, if the form f vanishes identically on it. The polar space as- 
sociated with a polarity is the geometry consisting of the totally isotropic varieties. 

In odd characteristic, if f is a symmetric bilinear form, then Q(v) = f(v, v) defines 
a quadratic form, and the totally isotropic varieties are just those which are totally 
singular for Q, i.e. on which Q vanishes. Moreover, f can be recovered from Q by 
polarization, i.e. 


f(v,w) = 5(Q(v + w) — Q(v) — Q(w)). 


In even characteristic, this is no longer true: a quadratic form polarises to an 
alternating, not a symmetric, form. Since the geometry of totally singular varieties 
of a non-degenerate quadratic form possesses properties similar to those of polar 
spaces of polarities, it has been found best to extend the definition to include them, 
and to dispense with the symmetric bilinear forms altogether. 

Thus there are three types of polar spaces, which are commonly named after 
the associated classical groups (see chapter 12): 

e unitary, from o-hermitian forms with o + 1; 
e symplectic, from alternating bilinear forms; 
® orthogonal, from quadratic forms. 

The rank of a polar space is defined to be one more than the projective di- 
mension of the largest variety it contains. (In the orthogonal case, this is the Witt 
index of the quadratic form.) The major result here is due to Tits (1974), extending 
earlier work by Veldkamp (1959): 
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Theorem 4.11. For r > 3, finite polar spaces of rank r are characterized by the 
following axioms: 

(a) Any variety, together with the varieties it contains, is a projective space of 
dimension at most r — 1. 

(b) The intersection of any set of varieties is a variety. 

(c) if E is an (r —1)-flat and p a point not in E, then the set of points of E joined 
to p by lines of the polar space is a hyperpiane of E, and the union of these lines is 
an (r — 1)-flat. 

(d) There exist two disjoint (r — 1)-flats. 


Tits’ theorem is really more general; the infinite geometries satisfying the axioms 
are classified as well. There are analogues of the finite ones, as well as some new 
types. See Tits (1974) for details. 

Tits’ axioms in Theorem 4.11 were considerably simplified by Buekenhout and 
Shult (1974) as follows. In a point-line geometry, we call two points adjacent if 
they are incident with a common line. A subspace is a set of points containing 
every line through any two of its points; it is singular if any two of its points are 
adjacent. 


Theorem 4.12 (Buekenhout-Shult theorem). Suppose that a finite point-line geom- 
etry satisfies the following conditions: 

(a) if a point p is not on a line L, then p is adjacent to either one or all points 
of L. 

(b) Every line is incident with at least three points, and no two lines are incident 
with the same set of points. 

(c) No point is adjacent to all other points. 

Then the points and singular subspaces satisfy Tits’ axioms in Theorem 4.11 for 
a polar space. 


Notes: It follows from the axioms of Buekenhout and Shult (though it is by 
no means obvious) that two points lie on at most one line. Hence every line is a 
singular subspace! Again, finiteness of the geometry is not really needed; it suffices 
that chains of singular subspaces should be finite. 

Many numerical properties of the polar spaces can be simply expressed. For 
example, the following holds. 


Proposition 4.13. The number of points of a polar space of rank r over GF(q) is 
(q’ — 1)(q"** +1)/(q—-1), 


and the number of r — 1-flats is 


is 
[]@** +, 
k=1 


where e is given in table 4.1, 
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Table 4.1 
Polar spaces 
Type dim(V) € 
Symplectic 2r 0 
Orthogonal 2r 1 
Orthogonal ar+1 0 
Orthogonal 2r+2 1 
Unitary 2r = 5 
Unitary ar+1 } 


The quadrics corresponding to polar spaces with « =1,—1 are sometimes called 
elliptic and hyperbolic respectively. The formulae are also valid for r = 1,2; we 
have seen examples with r = 1 (conics, elliptic quadrics in projective 3-space, and 
hermitian unitals in the plane). Note that every point of the projective space is 
a point of the symplectic polar space. Note also that, in even characteristic, the 
symplectic and orthogonal polar spaces with ¢ = 0 are isomorphic. 

There are also characterisations of polar spaces in terms of their embeddings. 
For example, Tallini—Scafati (1967) showed the following. 


Theorem 4.14. Let q be an odd square, and n > 2. Then a set of points in PG(n, q) 
which meets every line in 1, k or q + 1 points is the point set of a unitary polar space, 
with k =q'/? +1. 


A characterisation of sets satisfying the same condition with g even and q > 4 
is given by Hirschfeld and Thas (1980a,b). As well as hermitian varieties, certain 
projections of quadrics can arise. 

The result of (4.11) is similar to the axiomatisation of projective spaces in The- 
orm 2.3, in that low-dimensional exceptions occur. In the case r = 1, a polar space 
consists of points only. For r = 2, however, it is more interesting. The hypotheses 
of Theorem 4.11 assert that we have a geometry of points and lines, with two 
points on at most one line; if a point p is not on a line L, then p is collinear 
with exactly one point of L; and no point is collinear with all others. A geometry 
satisfying these conditions is called a generalized quadrangle or GQ. There will be 
more about these geometries in general later. The important result in the present 
context is the theorem of Buekenhout and Lefévre (1974). 


_ Theorem 4.15. If a generalized quadrangle 2 is embedded as a spanning subset of 
a projective space P, so that every point of P on a line of 2 is a point of 2, then 
2 is a classical polar space in ?. 


Note that both the isomorphism type of 2 and its embedding in # are deter- 
mined. 

The configuration of A-flats in PG(,q) has intrinsic characterisations. We can 
structure it in two possible ways. One is to take it as the point set of an incidence 
Structure whose blocks are the (h + 1)-flats. The other is to define lines to be sets 
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of the form 
{X:UCX CW}, 


where U is an (h — 1)-flat and W an (4+ 1)-flat with U C W; in this case, a line has 
q +1 points, and two points lie on at most one line. By taking the (A + 1)st exterior 
power of the underlying vector space, the set of A-flats of PG(n,q) is embedded 
in the point set of PG((711) —1,q); the lines contained within this set are precisely 
the lines described above. See Sprague (1985), Tallini (1981). 

Another class of embedding theorems is due to Kantor (1974, 1975) and Percsy 
(1981). Kantor’s theorems assert that, if a matroid has the property that all the 
“top intervals” of dimension at most 3 are embeddable as sufficiently large parts 
of projective spaces, then the whole matroid is so embeddable. The canonical 
application is the following. 


Proposition 4.16. A 4-(q?+2,q+2,1) design exists only for q =2, 3 or possibly 
13. 


For standard divisibility arguments restrict q to finitely many values of which all 
except 3 and 13 are even. If g is even, then the derived design (an inversive plane) 
is egglike, by Dembowski’s theorem 4.5(iii); by Kantor’s theorem, the whole design 
is embeddable in PG(4,q), and then an easy counting argument shows that q = 2. 
(The divisibility conditions alone do not preclude the existence of a 12-(74, 18, 1) 
design!) 

The rank bound in Kantor’s theorem is necessary. For example, the 5-(12, 6, 1) 
design, a 2-point extension of the egglike inversive plane of order 3, is embeddable 
in PG(5,3) (Coxeter 1958); but the 5-(24,8,1) Witt-design, a 3-point extension of 
PG(2,4), is not embeddable in PG(5,4). (For the constructions of Witt-designs, 
see chapter 14.) 

Percsy (1981) proved an analogous result for embeddings in polar spaces, but 
there is some doubt about the correctness of his results. 


5. The coding connection 


There are many connections between finite geometry and coding theory. This sec- 
tion is a brief account of one of these; for the 2-dimensional case, see also chapter 
16. 

Let A be ann x N matrix over GF(q) having the following properties: 

(a) the rows of A are linearly independent; 

(b) any two columns of A are linearly independent. 

From A, we can construct two objects: 

(1) A set of N points in PG(n — 1,q), those spanned by the N columns of A. By 
(b), the points are all distinct; by (a), they span the space. This set is unaltered by 
multiplying A on the right by a monomial matrix: permutations of the columns just 
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permute the points in the set, while multiplying a column by a non-zero scalar does 
not change the 1-dimensional subspace it spans. On the other hand, multiplying A 
on the left by a non-singular matrix has the effect of applying a collineation (an 
element of PGL(n, q)) to the projective space, and so replaces A by an equivalent 
set of points. 

(2) A code of length N and dimension n, the subspace of GF(q)" spanned by 
the rows of A. (The dimension is guaranteed by (a).) Condition (b) asserts that 
the dual code has minimum weight at least 3, that is, it is 1-error-correcting. (See 
chapter 16 for concepts of coding theory.) Multiplying A on the left by a non- 
singular matrix does not change the code, merely changes the basis. On the other 
hand, multiplication on the right by a monomial transformation replaces the code 
by an equivalent code (under coordinate permutations and scalar multiplications), 
not changing coding-theoretic parameters of interest. 

Under these circumstances, we would expect that information about the code 
and the configuration would transfer back and forth. For example, words of weight 
3 in the dual code correspond to collinear triples of points; complements of code- 
words in the original code are hyperplane sections of the set; etc. 

More strikingly, Greene (1976) showed that the weight enumerator of the code 
is a specialisation of the Tutte polynomial of the matroid represented by the point 
set. In this way, he was able to relate coding and matroid duality. 

For a simple example, consider the set of all points of PG(n — 1,q). The dual of 
the corresponding code has length (q” — 1)/(q —1) and dimension (g" — 1)/(¢ — 
1) — n, and is 1-error-correcting, and hence perfect; it is a Hamming code. 

Again, consider a set of N points with the property that no nm of them are 
contained in a hyperplane, as discussed in section 4. The corresponding code has 
length N, dimension x, and minimum weight at least N —n+1 (by the remark 
on hyperplane sections); so it is maximum distance separable (i.e. it attains the 
Singleton bound). Conversely, a linear MDS code gives a set of points with this 
property. See Bruen et al. (1988). For a similar discussion of caps, see Hill (1978). 

A far-reaching generalization of this construction produces the so-called alge- 
braic geometry codes, see Goppa (1984), Drinfel’d and Vladut (1983), Manin and 
Vladut (1985), Van Lint and Van der Geer (1987). These codes are constructed 
from divisors on an algebraic curve; the Riemann-Roch theorem gives lower 
bounds for the dimension and minimum distance in terms of the genus of the 
curve. 


. 6. Projective and affine planes 


The importance of projective and affine planes is underlined by their appearance 
in Theorem 2.3. In Dembowski’s influential book “Finite Geometries”, published 
in 1968, these planes account for over half the total content (though the proportion 
may be less today, because of an upsurge of interest in other topics). 

We obtain an affine plane by removing a line and all of its points from a projec- 
tive plane. Conversely, given an affine plane, we reconstruct the projective plane 
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by adding a “point at infinity” to all the lines in each parallel class, and adding 
one line incident with all the new points. The canonical nature of the construc- 
tion shows that the collineation group of the affine plane is the subgroup of the 
collineation group of the projective plane fixing the chosen line; and that two affine 
planes are isomorphic if and only if the projective planes are isomorphic and the 
lines lie in corresponding orbits of the collineation group. 

Wilker (1977), following Mendelsohn (1972), showed the next result. 


Proposition 6.1. Given a group G, there is a projective plane IT with the properties 
(i) Aut(7) = G; 
(ii) for any subgroup H of G, there is a line L of I such that H is the stabiliser 
of L (and hence the collineation group of the corresponding affine plane). 


It is not known whether, if the group is finite, the plane can also be taken to 
be finite. It is not even known whether every finite group is a subgroup of the 
collineation group of some finite projective plane. 

(A related observation concerns substructures. Any set of points and lines form 
a partial linear space, that is, any two points lie on at most one line and dually. 
A well-known free construction embeds any partial linear space in a projective 
plane. Is every finite partial linear space embeddable in a finite projective plane? 
The question may be specialized to linear spaces or simple matroids of rank 3 (two 
points on exactly one line).) 

It is possible to coordinatize any projective plane by an algebraic structure 
known as a planar ternary ring. (In the case of PG(2,q), this structure is GF(q) 
equipped with the ternary operation (x,m,c)++ x-m+c.) There is a considerable 
amount of theory about the interplay between algebraic properties of the PTR, 
configuration theorems in the plane, and the existence of certain collineations. 

A collineation 6 of a projective plane IT is called central if there is a point p (its 
centre) such that 6 fixes every line through p. Dually, @ is axial if it has an axis, a 
line L such that @ fixes every point on L. A non-identity collineation has at most 
one centre and at most one axis, and is central if and only if it is axial. A central 
collineation is called an elation if its centre is incident with its axis, a homology 
otherwise. 

Desargues’ theorem is equivalent to the statement that [7 has “all possible” 
central collineations; that is, for every point p and line L, the group of central 
collineations with centre p and axis L acts transitively on the points of M other 
than p and LMM, where M is any line on p different from L. We abbreviate this 
condition by saying that J7 is (L, p)-transitive. If H is (L, p)-transitive for all L and 
p, then the group of elations (resp. homologies) with given centre and axis can 
be identified with the additive (resp. multiplicative) group of the coordinatising 
field, hence the PTR satisfies the axioms for a field. This is the prototype of the © 
connections alluded to above. 

The most ambitious classification of finite projective planes is the Lenz—Barlotti 
classification, according to the configuration formed by the pairs (L, p) for which 
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the plane is (L, p)-transitive. (Lenz considered elations, i.e. incident pairs, and 
distinguished seven types; by including homologies, Barlotti refined this to 53 types. 
The Desarguesian planes stand at one extremity; at the other are the planes which 
are not (L, p)-transitive for any (L, p). See Dembowski (1968), Hughes and Piper 
(1973) for an account of this. 

One particularly important case is that where there is a line L such that J7 is 
(L, p)-transitive for all p incident with L. Equivalently, in the affine plane obtained 
by removing L, the group of translations (collineations fixing every parallel class 
of lines but no points) is transitive on points. Such an affine plane is called a 
translation plane. The translation group must be elementary abelian, and hence 
a vector space V over GF(p) for some prime p; its dimension must be even, say 
2m. The points of the plane can be identified with the non-zero vectors. Then the 
lines containing 0 are m-dimensional vector subspaces, and the remaining lines are 
cosets of these subspaces. 

The kernel of a translation plane is the set of all those endomorphisms of the 
vector space which map the lines through 0 to themselves. It is a field & containing 
GF(p), and V and the lines through 0 are k-vector spaces. So we can replace 
GF(p) in the above by k = GF(q), say, and redefine m so that dim,V = 2m. 

The lines through the origin form a set of g” +1 subspaces of dimension yn — 1 
in PG(2m —1,q) which partition the point set. Such a collection of subspaces is 
called a spread. Many examples of translation planes have been constructed by 
taking a linear group G over GF(q) and looking for a G-invariant spread; see 
Ostrom (1970), for example. 

Translation planes of order g* with kernel containing GF(q) have received most 
attention, and their theory is best seen in relation to the Klein quadric. 

The set W of skew-symmetric 4 x 4 matrices over a field k is a 6-dimensional 
vector space. The determinant of a skew-symmetric matrix is a square, det(A) = 
Pf(A)*, where Pf(A), the Pfaffian of A, is (in this case) a quadratic form on W. 
Thus the singular matrices are precisely those on which the Pfaffian vanishes. 
Moreover, the rank of a skew-symmetric matrix is necessarily even, so (in this 
case) 0, 2 or 4. 

Any skew symmetric matrix of rank 2 can be written as v'w — wv, where v 
and w are row vectors. The vectors themselves are not uniquely determined, but 
their span is; so we have a bijection from the set of lines of PG(3, k) to the points 
of PG(5,k) which are singular for the quadratic form given by the Pfaffian. This 
set Q of points is called the Klein quadric. 

Now it is easy to see that two lines L, L’ of PG(3,k) intersect if and only if the 
corresponding points of the Klein quadric are joined by a line lying in the quadric. 
(The points on this line translate back into the lines of the plane pencil in PG(3, k) 
determined by L and L’.) So a spread of lines in PG(3,k) (and hence a translation 
plane of the type we are considering) corresponds to an ovoid on the quadric, a 
maximal set of pairwise non-collinear points of 2. (If kK = GF(q), the number of 
points on an ovoid is q? +1.) 

The Pappian plane (the plane AG(2,q7) in the finite case) is obtained when 
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the ovoid is an elliptic quadric (a 3-dimensional section of 2). The corresponding 
spread is called regular. 

As we saw in section 4, this quadric has the structure of an inversive plane. 
Its circles are conics, i.e. plane sections of 2. Translating back, the set of lines 
corresponding to a conic is a regulus, one ruling of lines on a hyperbolic quadric. 

Any regulus & has an opposite regulus R°P, the set of lines which are transversals 
to every line through &. (On the Klein quadric 2, #°? is the section by the plane 
perpendicular to the plane defining %.) 

For an intrinsic definition, recall that three pairwise skew lines in 3-space have 
a unique common transversal through any point of one of them; a regulus is the 
set of all common transversals to the three lines. Any three lines of a regulus have 
the same set of transversals; this set is the opposite regulus. 

Each point on a line of & lies on a unique line of R°?. This means that we 
can replace the lines of ® in a regular spread by those of #°P, obtaining another 
spread and hence another translation plane. This process is known as derivation. 

More generally, if any spread contains a regulus, then it can be derived; in 
particular, the regular spread can be derived with respect to any set of pairwise 
disjoint reguli (circles in the Miquelian inversive plane). 

This can be still further generalized. More general sets of points of the Klein 
quadric can be used. For example, suppose that we have a set of circles of the 
inversive plane such that every point lies in 0 or k of them. The corresponding 
reguli of PG(3,k) contain each line of the regular spread 0 or & times; if they are 
all replaced by their opposites, another spread results. See Bruen (1971b). 

More generally, the connection with the Klein quadric has led to a body of 
work relating translation planes, configurations on the quadric, and coding theory 
(Kerdock codes), and has led to the construction of large numbers of planes. See 
Kantor (1983). 

Another idea, due to Ostrom (1966), does not even require a translation plane; 
some lines (forming a so-called replaceable net) of an arbitrary plane of square 
order m? are replaced by a suitable collection of Baer subplanes (of order m). 

A different classification of projective planes by collineation groups was pro- 
posed by Hering (1979). I will take the easiest of Hering’s sixteen cases. We say 
that G acts irreducibly on I if G fixes no point, line, triangle, or proper subplane 
setwise. Suppose that G is irreducible, and that G contains a central collineation. 
Then Hering shows that a minimal normal subgroup of G is either elementary 
abelian of order 9 (and equal to its centralizer), or non-abelian simple (and having 
trivial centralizer). Thus G is contained in either AGL(2,3) or Aut(S), where S 
is non-abelian simple. (This result is in the same spirit as the O’Nan-Scott theo- 
rem; see chapter 12.) The combined efforts of Hering, Walker, Reifart and Stroth 
have determined which possible simple groups can occur, up to one unknown, the 
sporadic group J>. (It is known that J, can act irreducibly on an infinite projective 
plane.) 

There have been many other characterizations of projective planes by symmetry 
properties. Perhaps the most famous is the Ostrom-Wagner theorem, mentioned 
earlier: a finite projective plane admitting a 2-transitive collineation group is De- 
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sarguesian. This result can be strengthened now, using the classification of finite 
simple groups. Kantor (1987) showed the following. 


Theorem 6.2. Let G be a collineation group of the finite projective plane 1] of order 
n, which ts either transitive on flags, or primitive on points. Then either 

(i) I is Desarguesian; or 

(ii) n? +n+1 is prime, and G is solvable. 


Kantor obtained this as a corollary of his classification of almost simple primitive 
groups of odd degree (see chapter 12), since the number of points in a projective 
plane is odd. The implication from flag-transitivity to point-primitivity was proved 
much earlier, by Higman and McLaughlin (i961). In case (ii), the plane is given by a 
difference set in the cyclic group of order n? +n +1. This occurs in the Desarguesian 
planes (but only for n = 2 and n = 8 can G be flag-transitive); no non-Desarguesian 
examples are known. 

The next two results summarize some general combinatorics of projective and 
affine planes. 


Proposition 6.3. (i) If a projective plane of order n has a proper subplane of order 
m, then either n = m? or n > m? +m; the first alternative occurs if and only if every 
line of the plane is incident with a point of the subplane (and dually). 
(ii) An involution of a projective plane of order n has n+1,n+2orn+n'/? 41 
fixed points, forming a line, a line and a point off it, or a subplane respectively. 
(iii) A polarity of a projective plane of order n has at least n+1 absolute points; 
if there are n+1, they form a line if n is even, an oval uf n is odd. 


Notes: (i) A subplane of order '/ of a plane of order n is called a Baer sub- 
plane. No example with n = m*+m is known. The theorem is false for affine 
planes; AG(2,3) is isomorphic to the configuration of inflection points of a cubic 
curve, and is embedded in AG(2,q) for all g = 1 (mod 3), g > 4 (and, indeed, in 
PG(2,4), as the hermitian unital). However, if a projective plane of creer n nas 
an affine subplane of order m <n, then either (n,m) = (4,3), or n > m? — im _ $3 
see Cameron (1980). 

(ii) An involution is a collineation of order 2. It is an elation, a homology, or a 
Baer involution respectively in the three cases. The result is proved by observing 
that the fixed points and lines form a (possibly degenerate) subplane, and that any 
non-fixed line is incident with a fixed point and dually. 

(iii) A polarity is an incidence-preserving map of order 2 interchanging points 
and lines; a point or line is absolute if it is incident with its image. In PG(2, q), any 
polarity has either qg +1 or q*/2 +1 absolute points, as we saw in section 4. 

Gleason (1956) characterized the Desarguesian planes of even order by the 
“Fano configuration” (fig. 6.1), so-called because it was excluded by the Axiom of 
Fano (1892) (!) 


Theorem 6.4. A projective plane is isomorphic to PG(2,24) for some d if and only 
if the diagonal points of any complete quadrangle are collinear. 
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Figure 6.1. The Fano configuration. 


Table 6.1 
Small projective planes 
Order 2 3 4 5 6 7 8 9 10 
Number 1 1 1 1 0 1 I 4 0 


Proposition 6.5. The following objects are equivalent: 
(i) an affine plane of order n with two distinguished parallel classes; 
(ii) @ set of n— 1 mutually orthogonal Latin squares of order n; 
(iii) a sharply 2-transitive set of permutations of an n-set. 


For let {H;,...,H,} and {V,,...,V,} be the two given parallel classes of lines, 
and set {p;j} = H; 1 V;. For each of the n — 1 further parallel classes {Z,,..., Ln}, 
take the Latin square with (i,j) entry k if and only if pj; € Ly. The “equation” 
of each line L, is a permutation: i + j if and only if p;; € L,. The constructions 
reverse. 

Thus the maximum number of mutually orthogonal Latin squares of order n 
is a measure of “how close a plane of order n is to existing”. This function has 
received much attention since the question was first posed by Euler in 1782. 

If we take fewer than n — 1 mutually orthogonal Latin squares, the construction 
yields a net. Nets are discussed further in section 10. (See also chapter 14, section 
7.) 

All known projective planes have prime power order. In the other direction, 
the Bruck-Ryser (1949) theorem was, until very recently, the strongest necessary 
condition for the existence of a projective plane. : 


Theorem 6.6. [f a projective plane of order n exists, where n is congruent to 1 or 2 
(mod 4), then n is the sum of two squares. 


(Cf. chapter 14, section 3.) For small orders, the number of non-isomorphic 
planes is given in table 6.1. 

The non-existence of planes of order 6 follows from the Bruck-Ryser theorem, 
though this special case had been settled much earlier (by Tarry 1901). For order 
10, however, this is the end-product of a massive computation by Lam et al. (1989). 
I would like to sketch briefly the history of this problem. 

Over the course of time, attempts to construct a plane of order 10 gave way to 
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attempts to prove non-existence. The first phase involved looking at collineations. 
If we are trying to construct an object, the larger collineation group we assume, 
the easier our job will be (if the object exists!). Hughes, Hall, Whitesides, Janko 
and many others analysed the possible collineations of prime order. Gradually, the 
possible prime divisors of the order of the collineation group were whittled away. 
Some general results were found by Hughes, using Hasse~Minkowski theory and 
other techniques. For example, we have the following (Hughes 1957). 


Proposition 6.7. A projective plane whose order is congruent to 2 (mod 4) but 
greater than 2 cannot have a collineation of order 2. 


(Such a collineation would necessarily be an elation, by Proposition 6.3(ii).) 

The other line of attack was using coding theory. This provides a technique for 
showing that the plane must contain one of a fairly small number of configurations. 
It is then necessary to try with bare hands to extend each configuration to a plane. 

Let I be a projective plane of order n. Set N =n? +n+1. In the vector space 
GF(2)%, let C be the row space of the incidence matrix of I7. (We take the con- 
vention that points index columns of the matrix.) Now C is a code; the support 
of any codeword (the set of coordinates where it has a non-zero entry) is a set 
of points in the plane which can be expected to have interesting properties. It is 
elementary that, if m is even, then the minimum weight of the code is +1; the 
support of any word of weight n +1 (resp. n +2) is a line (resp. oval). (Cf. chapter 
16 (9.2).) 

If n = 2 (mod 4), then C+ is a subcode of C with codimension 1. This means 
that a set S of points supports a codeword if and only if 


|S L| = |S| (mod 2) 


for every line L. It is also true that all weights in C are congruent to 0 or 3 (mod 4). 
Now we specialise to the case n = 10. Then, by the MacWilliams identities, the 
weight enumerator of C is completely determined by the numbers of words of 
weight 12, 15 and 16. In particular, C must contain a word of weight 12, 15, 16 
or 19. Now, as described, all configurations of these sizes can be determined, and 
each tested for embeddability in a plane of order 10. 

A word of weight 15 gives a 15-set meeting every line in an odd number of 
points, at most 5 (its symmetric difference with a line has cardinality at least 16, 
since it clearly cannot be an oval). This case was excluded by MacWilliams et al. 
. (1973) in their pioneering application of the technique. (In fact, an easy counting 
argument shows that there are exactly six 5-secants, and that these form a complete 
arc in the dual plane; this possibility had been excluded by an earlier computation 
by Denniston 1969b.) As remarked earlier, words of weight 12 are ovals; this case 
was settled by Lam et al. (1983). Two subsequent computations by Lam et al. 
(1986, 1989) excluded the other two possible weights. 

These techniques can in principle be applied to planes of other orders; but 
the scale of the problem, even for the next admissible order 12, puts any further 
applications out of reach for some time. 
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The classification of planes of order 9 is due to Lam et al. (1991). 

The code of a projective plane can be computed over GF(p) for any prime p. 
It is non-trivial if and only if p divides the order n, and contains its dual with 
codimension 1 if and only if p divides n to the first power only. For example, 
the code of the Desarguesian plane of order p” has dimension ey" +1. It has 
been conjectured by Hamada (1973) that the code of any non-Desarguesian plane 
of the same order has strictly larger dimension. For example, the code (mod 3) 
of the Desarguesian plane of order 9 has dimension 37; for each of the three 
non-Desarguesian planes, the dimension is 41. 

Lander (1981) associated a chain of r codes with a plane whose order is exactly 
divisible by p’. If r is odd, the middle code in the chain contains its dual with 
codimension 1, so techniques like those discussed earlier can be applied. 

Despite our lack of knowledge of planes of non-prime-power order, there are 
enormous numbers of planes of prime power order p’ (for fixed p and large r), 
including many non-isomorphic translation planes. In particular, non-Desarguesian 
planes of all prime power orders except 4, 8 and primes are known. The existence, 
of non-Desarguesian planes of prime order is one of the most important open 
problems at present. 


7. Generalized polygons 


Generalized polygons form an important class of geometries including both projec- 
tive planes and GQs; they also serve as ingredients in the construction of buildings, 
to be defined in the next section. 

A geometry has varieties of d different types, with an incidence relation between 
them. It can be represented by a d-partite incidence graph, whose vertices are the 
varieties, two vertices adjacent if they are incident. Now let us specialise to the case 
d= 2. A connected bipartite graph of diameter m which is not a tree has girth at 
most 2m. Generalized polygons are extremal graphs for this bound: a generalized 
m-gon is a geometry with two types of variety, whose (bipartite) incidence graph 
has diameter m and girth 2m, with the property that, for any variety, there is a 
variety at distance m from it. 


Proposition 7.1. (i) A generalized 2-gon is a geometry (with at least two points and 
at least two lines) in which any point is incident with any line. 

(ii) A generalized 3-gon is a generalized projective plane, i.e. a projective plane, 
the direct sum of a point and a line, or a triangle. 

(iii) A generalized 4-gon is a GQ. 


A generalized polygon is called thick if every vertex in the incidence graph has ~ 
valency at least 3. 


Proposition 7.2. (i) In a thick generalized m-gon, vertices of the same type have the 
same valency, if m is odd, all vertices have the same valency. 
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(ii) A non-thick generalized m-gon with m even is the flag geometry of a gener- 
alized m/2-gon. 


(The flag geometry of a geometry & has as points the varieties of @ and as lines 
the incident pairs, or dually.) 

In a thick generalized m-gon, we denote by s + 1 the number of points on a line, 
and by ¢+1 the number of lines on a point. Then s =t if m is odd. The most 
important result is the Feit-Higman (1964) theorem. 


Theorem 7.3. A finite thick generalized m-gon has m = 2, 3, 4, 6 or 8. Moreover, if 
m = 6 then st is a square; if m= 8 then 2st is a square. 


Further necessary conditions are known, derived from the representation theory 
of coherent configurations (cf. chapters 12, 15). These include divisibility conditions 
(expressing the integrality of the multiplicity of eigenvalues, see Higman 1975), and 
(from the Krein bounds) some inequalities due to Higman (1975) and Haemers 
and Roos (1981). 


Proposition 7.4. (i) A thick generalized 4-gon or 8-gon satisfies t < s* and s < t?. 
(ii) A thick generalized 6-gon satisfies t < 8° ands < 0°. 


A curious question related to Proposition 7.4 asks whether there exists a thick 
generalized polygon with s finite and ¢ infinite. All that is known is that there is 
no GQ with s =2 or 3 and ¢ infinite. 

Quite a large number of GQs are known (though they are not as prolific as 
projective planes). By Theorem 7.3(ii), the non-thick ones are complete bipartite 
graphs or rectangular grids. The parameters of the known thick GQs, up to duality, 
are (q,q), (q,q*), (q7, 4°), or (¢—1,q +1), where q is any prime power. The first 
three include the polar spaces of rank 2, except for the hyperbolic (ruled) quadric 
in PG(3,q), which is a square grid. (In the notation of table 4.1, we have s = q, 
t =q'**.) A simple construction for the first non-trivial example, the symplectic 
polar space in PG(3,2), is as follows: the points and lines are the edges and 1- 
factors of the complete graph Kg; incidence is membership. 

A construction due to Tits (1962) ties in several concepts we have met already. 
Let © be an ovoid in PG(3,q), C the code constructed from © as in section 5 (so 
that C has length q? + 1, dimension 4, and weights q” and q* — q). The points are 
. Of three types: a special point 00; GF(q) x ©; and C. Form a graph by joining oo 
to all points of the second type; joining (a,p) € GF(qg) x © to v € C if and only 
if vp = @; and joining v,v' € C if and only if their distance is q*. This graph is 
strongly regular; its (¢ + 1)-cliques are the lines of a GQ with s = q, t= q’, which 
is classical if and only if 0 is an elliptic quadric. 

The simplest class of examples of the fourth type, due to Ahrens and Szekeres 
(1969), is constructed from an oval 0 in PG(2,q), g even. Points are the vectors 
in the underlying 3-dimensional vector space; lines are cosets of the 1-dimensional 
subspaces in ©. 


676 P.J. Cameron 


There are many results on GQs; the theory parallels that of projective planes. 
See Payne and Thas (1984) for a survey. Among many results, I choose just one 
to mention: Walker’s (1977) analogue of Hering’s result on irreducible collineation 

roups. 
. The only known finite thick generalized m-gons with m > 4 are constructed from 
certain exceptional or twisted groups of Lie type. Up to duality, G2(q) gives a 6- 
gon with s =t = q; °D4(q>) gives a 6-gon with s = q, t = q°; and 7F,(q) gives an 
8-gon with s = q, t =q*, q = 274*). (See chapter 12 for these groups.) 

The most important general result about generalized m-gons is the classification 
of thick Moufang polygons: all are classical, i.e. (in the finite case), PG(2,q), a 
rank 2 polar space, or one of the 6-gons or 8-gons just described. (The Moufang 
condition extends, in a natural way, the condition of (L, p)-transitivity for all inci- 
dent pairs p, L (see section 6).) This result is due to Fong and Seitz (1973, 1974) 
in the finite case, and to Faulkner, Tits and Weiss in general (see Tits 1976, Weiss 
1979). 


8. Buildings 


The definition of a building exists in several versions, none of which is simple to 
explain. I will attempt to provide a definition here. Keep in mind that buildings 
form a class of geometries including projective and polar spaces, and standing in 
the same relation to generalized polygons as projective spaces do to projective 
planes. Two recent books (Brown 1989 and Ronan 1989) give good accounts of 
the theory. 

We need yet another reformulation of the notion of geometry. A chamber system 
of rank d is a set € carrying d equivalence relations ~,...,~ 4_;. It is usual to 
assume that these relations intersect pairwise in the relation of equality. 

From a geometry with d types of varieties, we construct a chamber system as fol- 
lows. The chambers are the complete flags (i.e. sets of d pairwise incident varieties, 
one of each type.) We set F ~; F’ if the varieties in F and F’ of all types except 
the ith are the same. (Thus, in a point-line geometry, two flags are “point-related” 
if they involve the same line but different points, and dually.) 

Not all interesting chamber systems come from geometries. For example, a Latin 
square can be conveniently regarded as a chamber system, where the cells of the 
square are the chambers, and there are three equivalence relations: “same row”, 
“same column”, and “same entry”. This is the correct picture of a Latin square in 
experimental design, where the cells might correspond to experimental plots and 
the equivalence relations to significant inter-plot effects. 

A path in a chamber system is a sequence of chambers in which each consecu- 
tive pair of chambers is distinct and satisfies one of the relations ~,. Since these 
relations are equivalences, we may (by omitting vertices if necessary) assume that, 
if Cn ~i Crt and Crit ~; Crsz2, then i ¥ j, A path satisfying this condition will 
be called reduced. A shortest path is necessarily reduced; and in a thin chamber 
complex (in which every equivalence class has size 2), a path (in which successive 
vertices are distinct) is necessarily reduced. 
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In the chamber system of a generalized m-gon, an apartment is a subsystem 
isomorphic to the chamber system of an ordinary m-gon. Then: 

(a) any two chambers are contained in an apartment; and 

(b) the shortest path between two chambers is unique, unless they are opposite 
in every apartment containing them. 

Moreover, these properties characterize generalized m-gons among chamber sys- 
tems of rank 2. 

The definition of a building is of the same form; only the apartments need to be 
changed, and (b) reformulated in such a way that it can be generalized. 

A Coxeter group is a group defined by a presentation with generators xo,...,Xg_1 
and relations asserting that x? = 1 for all i and that (x;x;)* = 1 for suitable integers 
m,;. (It is sometimes convenient to allow some relations of the second type to be 
absent.) A Coxeter group is described by a Coxeter diagram in which the nodes 
correspond to the generators, and x; and x; are joined by m;; — 2 edges (or, if m; jis 
large, by a single edge with label (m;;)). Note that non-edges (mj; = 2) correspond 
to commuting generators; so the Coxeter group of a disconnected diagram is the 
direct product of the Coxeter groups of the components. The dihedral group of 
order 2m has a presentation 


(xo, %1542 =x? = (xox1)” = 1), 


and so has the diagram eo below. Coxeter’s theorem (see Coxeter and Moser 
1957) asserts the following. 


Theorem 8.1. The finite Coxeter groups are those given by disjoint unions of the 
diagrams in table 8.1. Each is a group of euclidean transformations generated by 
reflections fixing the origin. 


Now a Coxeter complex is defined to be a chamber system whose chambers 
are the elements of a Coxeter group, with g ~; A if and only if either g =A or 
gx; —h. (This is the Cayley diagram of the group with respect to the distinguished 
generators, with edges labelled by generators in the usual way.) It is a thin chamber 
complex, in the sense described earlier. 

The Coxeter group of type A, is the symmetric group on the set {0,1,...,7}, 
the gs mee generators being the “adjacent transpositions” x; = (i i+ 1) for 
i=0,...,2—1. It is straightforward to check that the Coxeter ee of this 
. group is isomorphic to the flag complex of the free matroid on {0,1,...,2} (whose 
flats are all the subsets). 

In any chamber complex whose relations are parametrized by the generators 
of a Coxeter group, we can “translate” any simple path into a group element, 
by multiplying together the generators corresponding to the equivalence relations 
used on the path. Condition (b) in the definition of a generalized m-gon can now 
be stated as follows: 

(bb) two shortest paths between the same pair of chambers translate into the 
same element of the Coxeter group. 
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Table 8.1 
The finite Coxeter groups 
Diagram Order 

An o——o——0 --- o——o——o_ (n nodes) (n+1)! 
Cn o——o——0 --- o ==» _ (nm nodes) 2"n! 
Dr ooo +: —— (n nodes) an-lyt 
Es o——__o-—__9—__o——0 51840 

3 
E, —_—o—— 0 0 2903040 

° 
F3 o-——_o—___—_—_0—_0—__o—_—o 696729600 

° 
F4 o—__o oo 1152 
Hy pec) en 120 
Ay Fac) See ee 14400 
rae o (m) 2m 


(In the case of a generalized m-gon, if two chambers are “opposite”, then the 
elements of the Coxeter group corresponding to shortest paths between them are 
either xox; --- (m terms) or x,xq--- (mm terms). But these are equal, as a conse- 
quence of the relations 


xX) = xq = (xor))" = 1 
which hold in the dihedral group of order 2m.) 

Now we define a building to be a chamber complex which satisfies (a) and (bb), 
where the apartments are Coxeter complexes for an arbitrary (but fixed) Coxeter 
group. The type of the building is that of the Coxeter group. The building is thick 
if each equivalence class of each ~; has size at least 3. 

It is an instructive exercise to verify that buildings of type A, are precisely the 
flag complexes of generalized projective spaces; their apartments correspond to 
bases. Thick buildings of this type are just the projective spaces. 

Thick buildings of types C, and D, are polar spaces, and conversely. (The Cox- 
eter complex is the flag complex of the geometry with point set {u;,u;:1 <i <7}, 
the flats being the sets which contain at most one of u; and vu; for each /. 


Finite geometries 679 


This is realized as an apartment in a polar space by a set of vectors satisfying 
f(uj,v;) = 5, f(ui, 4j) = f(vi,0;) = 0, where f is the sesquilinear form.) 

A building is called spherical (or of spherical type) if its apartments are finite. 
Clearly any finite building is spherical! The types of spherical buildings are just 
those given in table 8.1. 

Let J be a subset of the index set of the equivalence relations in a chamber 
system €. Then a connected component for {~;: j € J} is itself a chamber system, 
and is a building (resp. thick building) if @ is. This, together with the Feit-Higman 
theorem, shows that there is no finite thick building of type H3 or H,. 

Let G be a group of Lie type (see chapter 12), defined over a field of charac- 
teristic p. A Borel subgroup B of G is the normalizer of a Sylow p-subgroup. The 
minimal parabolics Pp,...,P4_, of G (relative to B) are the atoms of the lattice 
of subgroups between B and G; the rank d of G is their number. Now define a 
chamber system €(G), as follows: The chambers are the right cosets of B in G; 
and Bx ~; By if and only if P;x = Piy. 

Now, at last, we can state the finite version of the main theorem of Tits (1974). 


Theorem 8.2. The finite thick buildings with rank at least 3 are precisely the chamber 
systems 6(G) arising from finite groups G of Lie type and rank at least 3. 


It should be stressed that this major theorem includes the axiomatisations of 
both projective spaces of Theorem 2.3 and polar spaces of Theorem 4.12, though 
the proof of the latter result takes up the bulk of Tits (1974). As in both of those 
results, finiteness of the geometry can be weakened, to the assumption that the 
building is of spherical type; some more examples have to be added to the list. 

Tits (1986) achieved the next stage by classifying the so-called “affine buildings” 
of rank at least 4. (These have Coxeter groups which are generated by euclidean re- 
flections not all having a common fixed point. Their diagrams are suitable one-point 
extensions of those of the spherical buildings.) For buildings with more general 
diagrams, there are too many for such a classification to be possible: see Ronan 
and Tits (1987). 

Table 8.2 lists the groups of Lie type and the types of the corresponding buildings. 
See table 4.1 of chapter 12 for more information about these groups. 


9. Buekenhout geometries 


In the last section, we had to resort to chamber systems in order to obtain the 
correct definition of a building. There is no doubt that this adds a level of con- 
ceptual difficulty. It turns out that there is a very natural way to attempt to define 
buildings as geometries; it does not quite yield the right definition because too 
many geometries are obtained. Buekenhout made a virtue of this by showing that, 
if we widen the definition just a little more, we get a class of geometries including 
operands for many of the sporadic simple groups as well as those of Lie type. 
Recall that a geometry has a set X of varieties of d different types — this can 
be described by a type map tp: X — A, where A is a d-set of types — with an 
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Table 8.2 
Buildings of Lie type 

Group Type of building 
A,(q) An 
B,(q) Cn 
Cr(q) Cn 
Dn(q) Dn 
E,(q) (n = 6, 7, 8) En 
F4(q) F, 4 
G2(q) i 
7A, (4?) Cin/2| 
*D,(q?) C4 

6 
*D4(q°) he 
£6 (47) F, 
2Bo(q). q= g2d+l At 
2G(q), q= 32d+1 Ay 

& 
2Fy(q). gq = PH"! he 


incidence relation holding between some pairs of varieties. For convenience we 
assume that any variety is incident with itself and with no other variety of the 
same type. 

A flag is a set of mutually incident varieties. Thus a flag has cardinality at most 
d, with equality if and only if it is complete (contains one variety of each type). A 
geometry is called firm (resp. thick) if any incomplete flag is contained in at least 
two (resp. three) complete flags. A geometry is connected if its incidence graph is 
connected. The rank of a geometry is the number of types. 

Let F be a flag of cotype J (that is, A\ tp(F) =J). The residue of F is the set 
of varieties not in F which are incident with every variety in F. It is a geometry 
with type set J, and is firm (resp. thick) if the original geometry is. A geometry is 
residually connected if every residue of rank at least 2 is connected. 

Buekenhout’s idea is that interesting classes of geometries can be axiomatized 
by simple global assumptions (e.g. residual connectedness) together with detailed 
hypotheses about the rank 2 residues. These hypotheses can be expressed by a 
diagram. We identify various interesting classes of rank two geometries by “edge 
labels”. A diagram is a set A with a class 2;; of connected point-line geometries 
for each distinct i,j € A (with the convention that the members of 2); are the 
duals of those of %;;). Now a Buekenhout geometry with diagram A is a residually 
connected firm geometry with type set A, in which the residue of any flag of cotype 
{i,j} is a member of %;; (the points and lines having type i and j respectively). 

We adapt the convention of the last section: the class of generalized m-gons is - 
denoted by m — 2 parallel edges (or by an edge with label (7m), if m is large). 


Proposition 9.1. A building with Coxeter diagram A is a Buekenhout geometry with 
type A. 
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To illustrate, consider a projective space PG(n,q): the varieties of type i are the 
i-fiats, for 0 <i <m — 1. Consider a residue of rank 2 and type {i,j}, with i <j. 
(a) Suppose that j =i+1. Let the flag be 


Uc +--+ C Uj_1 C Ujag C --- C Uy. 


Its residue consists of all i-flats and (i+ 1)-flats X satisfying U;_, C_X C Uj,2, and 
is a projective plane PG(2,q) (a generalized 3-gon). 
(b) Suppose that j > i+ 1. Let the flag be 


Up Ce C Uj COU jay C0 C Uy C Uj Ce C Up. 


The residue consists of the i-flats X with U;_; C X C U;,;, and the j-flats Y with 
Uj-1 C Y C Uj,,;. Clearly any such X and Y are incident. So the residue is a 
generalized 2-gon. 

Stay with geometries over diagrams involving generalized polygons for a mo- 
ment. These are called Tits geometries, or geometries that are almost buildings 
(GABs). Much of the language of buildings can be re-used here; for example, a 
GAB is spherical if its type is one of those listed in table 8.1. There is no finite 
thick GAB of type H3 or H4: the same argument that excludes buildings of these 
types applies. 

There are two ways in which a GAB can fail to be a building: 

(i) It may not be “simply connected”. A covering map between geometries with 
the same diagram is a surjection preserving type and incidence and inducing an 
isomorphism on any rank 2 residue. (Strictly, this is what is known as a 2-covering, 
to distinguish it from a topological covering of the simplicial complex whose sim- 
plexes are the flags.) A geometry @ has a universal cover, which covers every cover 
of S. Now buildings are simply connected, i.e. equal to their universal covers. The 
most we can hope for in a general characterization is to conclude that our geome- 
tries are covered by buildings. But in the finite spherical case, things are nicer, 
because of the theorem of Brouwer and Cohen (1986). 


Theorem 9.2. A finite thick spherical GAB which is covered by a building is a 
building. 


(ii) _It may have nothing to do with a building. A celebrated example is Neu- 
. maier’s A7-geometry (Neumaier 1984), defined as follows. 

The point set is X¥ = {1,2,3,4,5,6,7}. 

The lines are the 3-subsets of X. 

There are 30 ways of giving X the structure of PG(2,2), falling into two orbits 
of 15 under A;. The planes are the objects in one of these orbits. 

Every point is incident with every plane; incidence between the other pairs of 
types is membership. 

This geometry has diagram C, = o——o=—= =>, (The 2-gon and 3-gon are clear. 
Consider the residue of the point 7. This consists of 15 lines, identified with the 
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2-subsets of {1,...,6}; and 15 planes, identified with the partitions of {1,...,6} 
into three 2-sets. (Given lines 712, 734, 756, for example, there are just two ways 
to complete to a PG(2,2), and these lie in different A7-orbits.) This is a GO we 
met earlier.) 

This geometry is simply connected, and so is not covered by a building. But Tits 
(1981) showed that things like this are the only obstructions. 


Theorem 9.3. Let G be a GAB in which every rank 3 residue of type C3 or Hy is 
covered by a building. Then G is covered by a building. 


We observed that there are no finite thick GABs of type H3. Combining this 
with Theorems 9.2 and 9.3, we see that a finite GAB with no C; residues (or, 
indeed, no “bad” C3 residues) must be a building. 

Once we leave spherical GABs, the infinite cannot be avoided: many finite GABs 
have covers which are infinite buildings (see Kantor 1981). Much recent work has 
concentrated on GABs whose universal cover is an affine building. A theorem of . 
Kantor et al. (1987) describes all finite flag-transitive GABs whose universal cover 
is an algebraic affine building defined by a simple algebraic group of relative rank 
at least 2 over a locally compact local field. There are only finitely many universal 
covers, but passage modulo primes leads to an infinite number of finite examples. 

The Buekenhout-Shult theorem 4.13 gives a characterization of polar spaces in 
terms of points and lines only. This has been extended to many other buildings 
and GABs; after work by Cohen and Cooperstein (1983) and many others, the 
best result to date is that of Hanssens (1988). 

Buekenhout exploited the freedom given by his definition in another way, by 
introducing some new classes of rank 2 geometries. The most commonly used, in 
increasing generality, are: 


o——», circle or complete graph: lines are all pairs of points; 


obo, linear space: two points lie on a unique line; 


IT 


o——», partial linear space: two points lie on at most one line. 


If a class of geometries has symbol oS», the class of duals is denoted by ae ey 


A simple result illustrates these. 
Proposition 9.4. (i) A geometry with diagram 
L L L 


o——o 0 --- @——~o 


is the same as a geometric (simple) matroid. 
(ii) A geometry with diagram 


c 
O—_0——_0 
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is an extension of a projective plane of order n. Such a geometry exists only for 
n=—1, 2 or 4; there is a unique example in each case. 


The last part of (ii) is a classical result of Hughes (1965) on extensions of pro- 
jective planes, afforced by the computer proof by Lam et al. (1983) of the non- 
existence of a plane of order 10 containing an oval. 

Many specific diagrams have been explored. Buekenhout collected large num- 
bers of examples, including (as noted) geometries connected with many sporadic 
simple groups; see his paper (Buekenhout 1979) for some of these. A striking 
theorem is due to Sprague (1985). 


Theorem 9.5. A finite geometry with diagram 


L* L 


o——_o——0 


consists of the (i — 1)-, i- and (i + 1)-flats of a generalized projective geometry. 
Geometries with diagram 


c c* 
o> ——0-__=0 


are semibiplanes; see Wild (1980). Some theory has been developed for 


Cc 
o—_o———0 


(extended generalized quadrangles: Cameron et al. 1990). More generally, Bueken- 
hout and Hubaut (1977) considered locally polar spaces 


Cc 
Oo—Oo——_ o_o ::- oO, 


and Del Fra et al. (1991) found the flag-transitive geometries in this class (un- 
der some extra hypotheses). These include geometries for several of the sporadic 
simple groups (Fischer, McLaughlin and Higman-Sims). 
An important recent development concerns the work of Ivanov and his collab- 
_orators on the so-called P-geometries and tilde-geometries. For these, the diagram 


is linear, and all strokes are projective planes (of order 2) except for the stroke at 


one end, which is either oP» (for the points and lines of the Petersen graph) 


or c= (for a particular triple cover of the generalized quadrangle of order 2 


having 45 points). Many of the sporadic simple groups, including the Monster, the 
Baby Monster, and Janko’s fourth group, act flag-transitively on such geometries. 
The classification of fiag-transitive geometries in these classes has recently been 
completed (see Ivanov 1993 for a summary), and has a number of applications in 
other areas. 
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There are many classes of finite geometries which have not been covered in this 
survey. Some, such as designs and association schemes, find their place elsewhere 
in this volume. But one which deserves a mention is the class of partial geometries. 
This class includes both projective planes and GQs, but generalizes them in quite 
a different direction from generalized polygons. 

A partial geometry with parameters (s,t,a), or pg(s,t,a@), is a point-line geom- 
etry having the properties: 

(a) any line is incident with s +1 points, and any two lines with at most one 
point; 

(b) any point is incident with ¢+ 1 lines, and any two points with at most one 
line; 

(c) if the point p and line L are not incident, then exactly a points of L are 
collinear with p (and so also exactly @ lines on p are concurrent with L). 

It is clear that the dual of a pg(s,t, a) is a pg(t,s, a). A projective plane of order 
nisa pg(n,n,n+ 1), while a GQ is a pg(s,f, 1). Bose (1963), who introduced partial 
geometries, used the notation (r,k,t). Partial geometries are conveniently divided 
into four classes: 

(1) 2-designs with A = 1 (the case a = s+ 1) and their duals (the case a =f +1); 

(2) transversal 2-designs with A =1 (the case a =s) and their duals, viz. nets 
(the case a =f); 

(3) GQs (the case a = 1); 

(4) sporadic partial geometries (the remaining cases). 

A net, or pg(s,t,z), is equivalent to a set of t —1 mutually orthogonal Latin 
squares of order s + 1, (The points are the cells, and the lines correspond to rows, 
columns, and entries in each square, as in Theorem 6.4.) 

Bose’s purpose was to prove characterization theorems for some strongly regular 
graphs. The point graph of a partial geometry (whose vertices are the points, 
adjacent if they are collinear), is easily seen to be strongly regular; the intersection 
numbers are rational functions of s,t and a. A strongly regular graph is said to 
be pseudo-geometric if its intersection numbers are of this form (for some integral 
S,t,a; it is geometric if it is the point graph of a partial geometry. Building on 
previous ideas of Bruck for nets, Bose showed the following. 


Theorem 10.1. A pseudo-geometric strongly regular graph whose parameters satisfy 
s>i(t+2)(t-1+ a(t? +1)) 
is geometric. 
This important result has several corollaries. 


Corollary 10.2. The triangular graph T(n) and the square lattice graph L2(n) are 
characterized by their intersection numbers for n > 8, n > 4 respectively. 
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Here T(n) and L2(n) are the point graphs of the (trivially unique) partial ge- 
ometries pg(n — 2,1,2) and pg(# ~— 1,1,1) respectively. The characterizations are 
due to Hoffman (1960) and Shrikhande (1959) respectively. The inequalities are 
best possible. 


Corollary 10.3. A net of order n and deficiency d can be completed to an affine 
plane if 


n> }(d‘ — 24° + 2d’ +d —2). 


As we saw earlier, a net is a pg({7 — 1,t,¢), and corresponds to t — 1 MOLS; its de- 
ficiency is d =n — t (the number of extra squares required for an affine plane). The 
complement of its point graph is a pseudo-geometric (m — 1,d — 1,d — 1) graph. If 
the inequality holds, it is geometric, and hence the required squares exist. This 
result is due to Bruck (1963). 

Bose’s result was refined by Neumaier (1979), who derived the same conclusion 
from an inequality on the intersection numbers. This gives a non-existence crite- 
rion for strongly regular graphs: no such graph whose intersection numbers satisfy 
Neumaier’s inequality but fail to be pseudo-geometric can exist. 

Several results about projective planes or GQs have been extended to partial 
geometries, including bounds on subgeometries. Thas and De Clerck (1978) de- 
termined all partial geometries embeddable in projective spaces, extending the 
Buekenhout-Lefévre theorem 4.15. 

Finally, partial geometries provide us with a new “stroke” which can be used 
to enlarge the class of Buekenhout geometries. Examples of its use include the 
3-nets of Laskar (1974), Dunbar and Laskar (1978), and the EPGs of Hobart and 


Hughes (1990), with diagramso—°2 »——-o and o—"o—P8 6 respectively. 


A number of generalizations have been considered (semi-partial geometries, 
partial geometric designs, 1}-designs, partially balanced designs, and so on). One 
further type is the class of near polygons, partial linear spaces in which every 
line contains a unique nearest neighbour to every point. This class includes the 
generalized polygons; near polygons with point diameter 2 are the same thing 
as GQs. Shult and Yanushka (1980) showed that a near polygon with thick lines 


containing a quadrangle has diagram o——o—=~o, Among such geometries are 
the “dual polar spaces”, characterized by Cameron (1982). 
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Introduction 


The theory of finite incidence structures can be divided into three parts: finite 
geometry, theory of block designs and hypergraph theory. It is difficult to tell 
these apart, but roughly speaking finite geometry concerns itself with incidence 
structures that satisfy geometrical, that is, structural requirements, the theory of 
block designs concerns itself with incidence structures that satisfy numerical 
requirements, and hypergraph theory concerns itself with completely arbitrary 
systems of sets. Typically, for designs with a small number of points, or designs 
that satisfy certain inequalities (almost) with equality, it is possible to extract 
structural information from the numerical data; on the other hand, in the general 
case pure chaos reigns, and not much more can be said about designs than about 
arbitrary hypergraphs. A relatively high structured part of design theory is the 
theory of association schemes. These are discussed in chapter 15. In this chapter 
we very briefly discuss the main facts concerning t-designs and a few related 
subjects. Only in the section on Witt designs and Golay codes, the longest of this 
chapter, do we attempt a somewhat fuller treatment. 


1. Generalities 


A (block) design is a pair (X,B) where X is a set (the point set) and @ (the 
family of blocks) is a family of subsets of X (not necessarily pairwise distinct) such 
that some regularity condition is satisfied. Thus, there are many types of (block) 
design; often however, “block design” is taken to be an abbreviation for 
“balanced incomplete block design” (BIBD) (see below). 

If two members of & are incident with the same points then we say that the 
design has repeated blocks. A design without repeated blocks is called simple. 

The (point—block) incidence matrix of a design is the 0-1 matrix with rows 
indexed by the points and coluinns indexed by the blocks of the design, where the 
(p, B) entry is 1 if p € B and 0 otherwise. We shall use / for the identity matrix 
and J for the all-1 matrix of any suitable size. 

The dual of a design (X, @) is the design (@, X); the incidence matrix of the 
dual is the transpose of that of the original design. (Note that we here identify the 
point x with the set of blocks containing x; in general we shall not worry when the 
incidence relation between points and blocks is different from the ordinary 
’ membership relation.) 

A design is called square whenever its incidence matrix is square. A design is 
said to have a polarity whenever its incidence matrix can be written as a 
symmetric matrix. (Note that what are called “symmetric designs” in the 
literature, are square BIBDs, and do not in general have a polarity.) 

When the design is finite, some numbers have conventional denotations. The 
number of points is called v, the number of blocks 5, and the size of the blocks 
(when constant) k. The replication number r(x) or r, of a point x is the number of 
blocks on x. We write just r if it does not depend on x. 
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Let us give names to some important types of design. 

A t-(v, k, A) design is a block design on v points with blocks of size k such that 
any set of ¢ points is covered by (i.e., is a subset of) precisely A blocks. We shall 
always assume that k >t; now a t-(v, k, A) design is also an i-(v, k, A,) design for 
0<i<t, where A, =A and A, =(v —i)A;,,/(K —i) for i<1t. Note that A, =5 and 
A, =r. 

"A Steiner system S(t, k,v) is a t-(v,k,1) design. One also uses S,(t,k,v) as 
synonym for t-(v, k, A). 

A balanced incomplete block design (BIBD) is a 2-(v,k, A) design (where the 
“incomplete”’ originally required that k<v, but that requirement is now often 
forgotten). This concept came from statistics, and some statistical terminology 
and notation is still current; the points are sometimes called “varieties” (that is 
why there are v of them) and the blocks “treatments”. 

A pairwise balanced design (with index A) is a design such that any pair of points 
is in precisely A blocks. Clearly, a BIBD is a pairwise balanced design (PBD) with 
the additional requirement that all blocks be of the same size k. (We shall riot 
need t-wise balanced designs for #2. For some results on these, cf. Kramer 
1983.) 

An (r, A)-design is a pairwise balanced design with index A in which each point 
lies in r blocks. 

A partial linear space (sometimes called “semilinear space”) is a design (X, L) 
with point set X and set of lines & such that no two lines have two points in 
common. The dual of a partial linear space is again a partial linear space. 

A linear space is a pairwise balanced design of index unity, i.e., a partial linear 
space such that any two of its points are joined by a line. (Do not confuse these 
linear spaces with those from functional analysis.) 

A projective plane PG(2,n) is a Steiner system S(2,n+ lntnt 1). An 
affine plane AG(2,n) is a Steiner system S(2, n,n’). More generally, PG(d, q) 
and AG(d, qg) denote the projective and affine space of order g and dimension d. 
(For d =3, these are necessarily coordinatized by a field (everything is finite in 
this chapter), and q is a prime power.) 

A subset of the point set of a partial linear space is called a subspace if any line 
that meets it in at least two points, is entirely contained in it. 


2. De Bruijn—Erdoés and Fisher inequalities and variations 


Often it is possible to show that a design has at least as many blocks as it has 
points. One standard argument goes as follows: let the design have incidence 
matrix A, and suppose that AA’ is nonsingular. Then A does not have more TOWS: 
than columns. For instance, when the design is a BIBD then AA’ =(r—A)I+al 
with determinant kr(r — A)”~’ so that if v > k (and b > 0) then b =v. This is called 
Fisher’s inequality. Many generalizations exist; e.g., for 2s-(v, k, A) designs one 
has the Petrenjuk—Ray-Chaudhuri-Wilson inequality (Petrenjuk 1968, Ray- 
Chaudhuri and Wilson 1975) which states that if v =k +s, then b>(¢). 
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A beautiful argument due to Conway for the case of linear spaces is the 
following. Suppose 6<v. Observe that if the point x is not on the line L then 
r, =k, where k, denotes the size of the line L. Now we have 


ba BIH Dae, <2 lan 23 inb 


L ee 


and hence all inequalities are equalities so that b =u andr, =k, whenever x ¢ L. 
This shows that in a linear space b =v with equality iff the linear space is a 
(possibly degenerate) projective plane, a result first published by De Bruijn and 
Erdés (1948) (Hanani 1951 says he found this in 1938; see also Motzkin 1951.) 

Ryser (1968) proved b > v for pairwise balanced designs (that are nondegener- 
ate: A>0, v>1 and r, >A for each x), thus generalizing both Fisher and De 
Bruijn—Erd6s. (Proof: Again AA’ is nonsingular.) 

In case a group action is given we have analogous results: Let v and b be the 
number of point and block orbits of some design under a group G. If the design is 
square, with nonsingular incidence matrix then v = 6 (Brauer 1941, cf. Parker 
1957, Hughes 1957, Dembowski 1958). (Proof: In case G = (g), let the permuta- 
tion matrices P, Q represent the action of g on points and blocks, so that 
PAQ =A. Now trQ=trA 'P''A=trP '=trP, so that g fixes as many points 
as blocks. In the general case apply Burnside’s lemma.) In the general case, 
where the incidence matrix need not be square and has rank p we have (Block 
1967) 0<b +u-—p and 6 <0 + b-—p. (This is a purely combinatorial result: it 
holds when v and b are the number of point and block classes in a tactical 
decomposition.) When A has rank v (as is the case for 2-designs) we find v <b. 
Similarly, generalizing the Wilson—Petrenjuk inequality, Kreher (1986) showed 
for a 2s-(v, k, A) design with v = k +s that the number of block orbits is at least 
the number of orbits of G on s-sets. 

A planar space is a linear space in which certain subspaces are called planes, 
and one requires that any three noncollinear points determine a unique plane. 
Hanani (1954/5) shows that p = v if p is the number of planes, and that equality 
holds iff the space is a projective 3-space. More generally, in any matroid with v 
points and A hyperplanes (cf. chapter 9) we have h=v (cf. Motzkin 1951); 
Conway’s proof generalizes to this situation since r, >k,, holds by induction. 
Equality holds if and only if we have a (possibly degenerate) projective space. 


3. Square 2-designs 


Cases of equality in one of the above inequalities are especially interesting. 

Ito calls a 2s-design tight when equality holds in the Wilson—Petrenjuk 
inequality, and Ito (1975, 1978), Enomoto et al. (1970) and Bremner (1979) 
determine all tight 4-designs. (The only example up to complementation is the 
unique Steiner system S(4, 7,23) with b = 253.) Peterson (1977) shows that no 
tight 6-designs exist. It should be feasible to show that no tight 2s-designs exist 
with 2s > 6. (Cf. Deza 1975, Bannai 1977.) 
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Ryser (1968) shows that a nondegenerate pairwise balanced design that is 
square (i.e., has b = v) either has constant k and r (i.e., is a square BIBD) or has 
precisely two distinct block sizes k, and k,, and k,+k,=v+1. The dual of a 
design of the latter kind is called a A-design. (But note that specifying A will cause 
confusion with the concept of t-design.) Examples may be obtained by starting 
with a square 2-(v,k,k — A) design and replacing all blocks B containing a fixed 
point x, by {x)}UX\B. The A-design conjecture says that all examples are 
obtained in this way. See, e.g., Bridges (1983), Woodall (1970), Shrikhande and 
Singhi (1976) and Seress (1987). 

There exists a large body of literature on square BIBDs; they are usually called 
“symmetric designs’, Dembowski uses ‘‘projective designs”, here we shall call 
them SBIBDs. The most popular examples are projective planes PG(2, q) and, 
more generally, the designs of points and hyperplanes in a projective space 
PG(d, q). For a list of all known SBIBDs, see the appendix. Let us list some 
important properties here. By definition b =v, and it follows that r=k and 
k(k — 1) =(v —1)A. The dual of an SBIBD is again one (since AJ = JA = kJ and 
hence AA™=(r—A)I+AJ=.A'A), and it follows that any two blocks have A 
points in common. For an SBIBD the following parameter restriction is known. 


Theorem (Bruck—Ryser—Chowla). [f a SBIBD has parameters 2-(v, k, A) then the 
following holds. 

(i) (Bruck and Ryser 1949, Schutzenberger 1949, Shrikhande 1950). If vu is 
even then k — X is a square. 

(ii) (Chowla and Ryser 1950). If v is odd then the equation z’ = (k—A)x’? + 
(-1)°"? "ay? has a nontrivial integer solution. 


Proof. (i) follows from (det A) =k?(r-a)’); (ii) expresses the fact that the 
quadratic forms ee On a,x," and (k —A) Uijey4) + AD, -y x,)’ are equiva- 
lent. In fact (ii) is necessary and sufficient for the existence of a rational matrix A 
satisfying the abovementioned equations. O 


Condition (i) e.g., rules out 2-(22,7,2); (ii), e.g., rules out PG(2, 6), ie., 
2-(43, 7,1). No other restrictions are known, i.e., no case is known of a 
parameter set for a SBIBD passing the Bruck—Ryser—Chowla criterion (and 
Fisher’s inequality), while it is known that no corresponding design exists. 
(However, just recently Lam has announced that exhaustive computer search 
shows that no projective plane of order 10 exists.) 

There is an analogue of this in case a group action is given (Hughes 1957): Let 
a be an automorphism of prime order p of a SBIBD, with f fixed points. Then a 
has m=f+(u—f)/p point and block orbits, and tf we put n=k-—A then the 
equation 2° = nx” + (-1)°"""p!*" Ay” has a nontrivial integer solution. 

Lander (1983, Theorem 3.20) gives the following rather strong condition on p 
and f: Suppose that p is odd and that we have q' = —1 (mod p) for some j and for 
some prime q dividing the square free part of n. Then f is odd. 
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A bound on f is given by Wilbrink (cf. Lander 1983, (3.7)) (using Haemers’ 
interlacing result, see Haemers 1980, (3.1.1)): Let @ be a nonidentity auto- 
morphism of a SBIBD with f fixed points. Then f <u —2n and f <k + Vn, and if 
equality holds in either inequality then a must be an involution and every nonfixed 
block contains precisely A fixed points. 


Corollary (Feit 1970). f<v/2 and if equality holds then vu =4n and a is an 
involution. 


More precisely, if k, is the average number of fixed points on a nonfixed block, 
then Af <k,(k + Vn). Trivially k,<A, but sometimes sharper bounds follow 
from the observation that each orbit of a nonfixed block yields an equidistant 
constant weight code with word length v—f, weight k-—k, and Hamming 
distance 2(k — A). Another trivial bound is: if f >0 then p<k. 

A difference set D in a group G (written additively) is a set such that 
(G, {D +g|g€G}) is a SBIBD; equivalently, |G| =v, |D| =k, and in the list of 
differences d, —d, (d,, dD) each nonzero group element occurs precisely A 
times. (More generally, one considers difference families and designs where the 
block set is the union of several G-orbits. This is called Bose’s (1939) method of 
symmetrically repeated differences. One calls the given representatives for the 
orbits base blocks or initial blocks and constructing the G-orbits developing 
(mod G).) 

An important example was given by Singer (1938) who showed that the design 
of points and hyperplanes in the projective space PG(d, q) has a cyclic difference 
set (i.e., a difference set in G = Z,). 

The most important result on difference sets is the multiplier theorem. 


Theorem (Hall and Ryser 1951). Let D be a difference set in G=Z, and suppose 
that p is a prime with p|k — A, (p,v)=1, p >A. Then p is a multiplier of D, i.e., 
pD=D +e for some eEG. 


Many generalizations exist. For a discussion of multiplier theorems see Ryser 
(1963, chapter 9), Hall (1967, chapter 11), Dembowski (1968, pp. 87-90), Lander 
(1983), Beth et al. (1985, chapter 6). One may always choose D so as to be fixed 
by a given multiplier p (Mann 1965) (for: p induces an automorphism of the 
design and fixed the point 0 hence must also fix some block D), and, in case G is 
abelian, one may even find a D which is fixed by all (numerical) multipliers of D 
(McFarland and Rice 1978). 

A lot has been done in the special case where A = 1 (‘planar difference sets’), 
see, for example, Jungnickel and Vedder (1984) and Wilbrink (1985); for a 
survey, see the books mentioned. 

For a table of parameters of the known symmetric designs, see the appendix. 
Dembowski (1968, pp. 105-108), Baumert (1971), Hall (1974), Kibler (1978), 
Lander (1983), and Beth et al. (1985, chapter 6) survey the known difference 
sets. A good discussion of difference sets, and an update to Lander’s tables can be 
found in Jungnickel (1992). 
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4. Inequalities in 2-designs 


Consider the incidence matrix A of a BIBD, so that AA‘ =(r—A)1 +AJ. Since 
AA’ has spectrum (kr)'(r —A)’~' (with multiplicities written as exponents), it 
follows that A‘A has spectrum (kr)'(r — A)”’"'0"~’, and hence the matrix Q = (r — 
A)I + (Ak/r)J — A‘A is positive semidefinite (it has spectrum 0°(r - A)’~"). It 
follows that if Q,, is a principal submatrix of Q of order m, then det O,, = 0 (and 
det Q,,=0 for m>b-—v). This result is known as Connor's inequalities, cf. 
Connor (1953). The case m= 1 of these inequalities gives us Fisher’s inequality 
again. The case m = 2 of these inequalities says that for any two blocks B, C we 
have 


2Ak 
ktA-r<|BNC|<r-k-At—. 


These inequalities are sometimes called Majurdar’s inequalities, although Connor 
already states them explicitly. Majumdar (1953) observed that we have equality in 
the left-hand inequality if and only if |B MN D|=|CNMD| for all blocks D4 B, C 
(provided B and C are not incident with the same set of points), and that equality 
in the right-hand inequality holds if and only if |B M Dj +|C NM D| =2ak/r for all 
blocks D ¥ B, C. (Indeed, let e, be the Bth unit vector. Then, since Q is positive 
semidefinite, we have for blocks B, C: 


0<(e, —€.)' Oleg — ec) =A|BNC|—k-At7) 


with equality if and only if Qe, = Qe,. Thus, meeting in kK +A—r points is an 
equivalence relation, and equivalent blocks meet other blocks in the same number 
of points. Conversely, if 8 and C meet all other blocks in the same number of 
points, then O(e, — e-) =(|BNC|—k—A+ Ale, —€-) so that |BNC|—k-A+ 
r is an eigenvalue of Q, i.e., 0 or r—A, and hence |BNC| is either k or kK +A-—r. 
Similarly, we have equality in the right-hand inequality above if and only if 
Qe, = —Qe,.) See also Bekers and Haemers (1980). It follows that if a BIBD 
contains two disjoint blocks, then 7 =k +A,i.e.,b2v+r-—1, a strengthening of 
Fisher’s inequality. A bound on the maximum number of blocks disjoint from a 
given block was given by Majindar (1962). 

Another direct consequence of Connor’s inequalities is Mann’s inequality 
(Mann 1969): If in a BIBD with v >k a block is repeated m times (i.e., if m 
blocks are incident with the same set of points), then b = mv. (Indeed, for the 
corresponding principal submatrix we find Q, = (r—A)i— (k(r—A)/r)J with 
eigenvalue (r — A)(1 — km/r) =0.) 

A very good discussion of these results is contained in Wilson (1984); 
generalizations of these results to t-designs are also given there. 


5. Derived and residual designs, extensions 


Given a design @ = (X, B) and a point x € X, the derived design (at x) is defined 
as B= (X\{x}, {B\{x}|xE€BEB}) and the residual design (at x) as D* = 
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(X\{x}, {B |x BE B}). Conversely, @ is called an extension of B,, and Q, is 
called extendable. In case @ is a t-(v, k, A) design we find that 2, is a (¢ — 1)-(v — 
1,k—1, A) design, while 9” is a (¢— 1)-(v — 1,k, A,_, — A) design (where A,_, = 
A(u -t+1)/(A-—t+1)). 

Sometimes one can guarantee that an extension exists. Alltop (1975) shows that 
if t is even then any t-(2k + 1,k, A) design extends to a (f+ 1)-(2k +2,k +1, A) 
design. (Proof: Invent a new point © and add it to the existent blocks; add the 
complements of all blocks.) When f is odd, we have the following result due to 
Alltop (1975) and Dehon (1976): if a (f + 1)-(2k + 1, k, A‘) design is the union of 
two t-(2k + 1,k, A) designs then a (¢ + 1)-(2k +2,k +1, A) exists. 

Cameron (1973) has determined what SBIBDs can have an extension. He finds 
that one necessarily has one of the following cases: 

(i) v=4A4+3,k=2A+1,a Hadamard design (cf. the appendix (2)); 

(ii) v= (A+ 2)(A72 + 4A 42), K=A74+3A41; 

(iii) v= 111, K=11, A=1; 

(iv) v = 495, k= 39, A=3. 

(Proof. The extension is a design with only two intersection numbers (namely 0 
and A+ 1), ie., is quasi-syrmmetric, and hence carries a strongly regular graph (cf. 
Goethals and Seidel 1970). Now apply the standard restrictions on parameters of 
strongly regular graphs.) 

Concerning case (ii): one may show the following: If there exists a ie 
regular graph I with parameters (vu, k, A B)=(At (At 4, (A+ 17+ 
5A +5), 0, (A+1)(A+2)) then it has g}, =0 so by Cameron et al. (1978) its 
subconstituents are again strongly regular, and we find a strongly regular graph A 
with parameters (v, k, A, wh) =((’ +4 + 2)(A? +5A+5), (A+ 1)(A +3), 9, 
(A+1)’). By Haemers (1980) the »-graphs in F are strongly regular and we find 
a strongly regular, graph E with parameters (v,k, A, w)=((A+1).A + 3)(A? + 
4A +2), (A+ 1)(A7+3A 41), 0, A(A+1)). Finally, on the set of neighbours of 
any vertex in IT we have a 3-design 3-(k,, 4p, A), the extension of a symmetric 
design with parameters as in (ii) above. Conversely, given a 3-design with these 
parameters we may construct a strongly regular graph with the parameters of I, 
so that extendible SBIBDs with parameters (ii) coexist with strongly regular 
graphs I. The only known examples are the cases A = 0 (Clebsch graph; Petersen 
graph; K,,>; degenerate design 3-(5,2,0)) and A=1 (Higman-Sims graph; 
77-graph; Gewirtz graph; unique design S(3, 6, 22)). Probably there are no other 
examples; the case A = 2 has been attacked by Bagchi (1988), corrected in Bagchi 
(1991), but unfortunately the amended proof still has a gap. 

Concerning case (iii) (that of an extendible projective plane of order 10): Lam 
et al. (1983) have conducted an exhaustive computer search and found no partial 
geometry pg(6, 9, 4); this result implies that a projective plane of order 10 cannot 
have ovals, and a fortiori cannot be extendible. 

In the case of an SBIBD @ the standard terminology is slightly different: if D* 
is the dual of 2 then one usually calls Z, = ((@*),)* and B® = M*** the derived 
and residual design (at the block B). If @ has parameters (b, v,r, k, A) then B, 
and 2° have parameters (in the same order) (v—1,k,k—1,A,A—1) and 
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(v—1,u—k,k,k—A, A), respectively. In particular, a residual design satisfies 
r=kt+A. 

A BIBD is called quasi-residual if r=k + A, or, equivalently, if b+1=uv +r. 

Sometimes it is possible to prove that a quasi-residual design in fact must be a 
residual, i.ec., must be embeddable in a SBIBD. It is very easy to see that an 
affine plane (in the finite case: a 2-(n”, n, 1) design) is embeddable in a projective 
plane (in the finite case: a 2-(n? +n + 1,n + 1, 1) design). 

Hall and Connor (1954) and Connor (1952) showed that also quasi-residual 
designs with A = 2 are embeddable. (This can be used to prove nonexistence: for 
example, a 2-(15,5,2) BIBD cannot exist since it would be embeddable in a 
2-(22, 7,2) SBIBD, but by Bruck—Chowla-Ryser this latter design does not exist. 
The proof is by observing that such a design only has two possible intersection 
numbers (1 and 2) and hence is quasi-symmetric. The associated strongly regular 
graph has the parameters of the triangular graph and by Connor (1958), 
Shrikhande (1959a,b), Hoffman (1960), Chang (1959) must be triangular, except 
when k = 6. The latter case is settled by an ad-hoc argument (no design exists), 
and in the triangular case it is easily seen that the design is embeddable.) 

For A = 3 there exist examples of nonembeddable quasi-residual designs. E.g., 
there exist designs with parameters 2-(16, 6, 3) where some pair of blocks meet in 
4 points (Bhattacharya 1944), and such designs obviously cannot be embedded in 
a 2-(25,9,3) design. Bose et al. 1976 (see also Neumaier 1982) showed for 
quasi-residual designs that given A, if & is sufficiently large then the design is 
embeddable. Kelly (1982) considers the question in what cases an embedding 
must be unique (when it exists). 


6. Existence and construction of t-designs with large ¢ 


Let us first look at the positive side: how does one construct t-designs with, say, 
t>3? 

The easiest way to obtain a t-(v, k, A) design is by taking the orbit of a k-set 
under a group G acting ¢-transitively (or t-homogeneously) on a v-set; of course 
one obtains in this way precisely those designs that admit a block-transitive 
automorphism group. (See, e.g., Hughes 1965.) For 12 all such groups G (and 
corresponding actions) are known (see chapter 12), and for ¢> 5 there are no such 
groups other than the symmetric and alternating groups. 

When this easy scheme does not work, one might try to combine several 
G-orbits, where G is a group that is less than ¢-transitive. This method has been 
used successfully by Doyen (1974), Kramer and Mesner (1976), Denniston 
(1976), Brouwer (1977a,b), Mills (1978), Kramer (1984), Leavitt and Magliveras | 
(1984) and others. When the number of orbits to be combined gets larger than a 
few dozen then advanced techniques are necessary to keep the required computa- 
tion time within realistic bounds; Kreher and Radziszowski (1986) used a variant 
of the L* (Lenstra—Lenstra—Lovasz) lattice reduction technique (see chapter 19) 
to find a 6-(14, 7, 4) design, the smallest possible nontrivial 6-design. Today many 
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t-designs with <5 are known (infinite families of 4-designs have been con- 
structed by Alltop (1969), Hubaut (1974) and Driessen (1978); Alltop (1972) 
constructs an infinite series of 5-designs). The first 6-designs (with parameters 
6-(33, 8, 36) and 6-(20,9,112)) were constructed by Leavitt and Magliveras 
(1984), Kramer et al. (1985) 

Another method that has produced some good designs was first given by 
Assmus and Mattson (1969) (see also Pless 1969, 1970, 1972 and MacWilliams and 
Sloane 1977, (chapter 16, section 8); if we have a linear code € with word length 
n and minimum distance d, and an integer ¢ with 0<¢<d such that at most d—-1 
of the weights w of the dual code €* are in the range 1<w<n—t, then the 
supports of the codewords of weight d in © form a t-design. In this way one finds 
several 5-designs with uv € {12, 24, 36, 48,60} such as, e.g., 5-(24,9,6), 5- 
(36, 12, 45) and 5-(48, 12, 8). 

Finally one has recursive constructions: construct a design using smaller designs 
as building blocks. For f = 2 this approach is highly successful (cf. below, sections 
7-10), and for t=3 various constructions are known, but until recently there 
were only very few results for f>3. For example, Tran van Trung (1984) shows 
that if there exists a t-(v,k, A) design, and A is not too large then there exists a 
t-(v +1, k, (v +1—2)A) design (by putting v + 1 disjoint copies of the given design 
on au +1 set) —starting with Alltop’s designs this yields a new infinite family of 
5-designs. 

A very ingenious recursive construction was developed by Teirlinck (1987) (see 
also Teirlinck 1989, 1992). Using the symmetric group in a highly nontrivial way 
he constructs ¢-designs without repeated blocks for all ¢. 

When repeated blocks are allowed it is a simple exercise in linear algebra to 
show that t-designs with arbitrarily large t do exist. In fact Wilson (1973) shows 
that given t, k and v (with t<k<v) there is a constant A such that a ¢-(v, k, A) 
design (possibly with repeated blocks) exists for all A> A satisfying the obvious 
condition that each A, is integral. 

Only finitely many Steiner systems S(t,k,v) with t24 are known, namely 
S(5, 6, 12), S(5, 8, 24) (Witt 1938), S(5, 6, 24), S(5, 7, 28), S(5, 6, 48), S(S, 6, 84) 
(Denniston 1976), S(5, 6,72) (Mills 1978), S(5, 6, 108) (Griggs) and the derived 
4-designs. The other known Steiner systems are the following: with = 3 we have 
the Mobius geometries 5(3, q + 1, q” + 1); Hanani (1960) showed that an S(3, 4, v) 
(called Steiner quadruple system) exists iff v=2 or 4 (mod 6). (For a survey on 
Steiner quadruple systems, see Lindner and Rosa 1978.) Hanani (1979) gave vari- 
Ous recursive constructions for 3-designs. With ¢ = 2 many constructions are known; 
the “geometric” ones are: affine geometries S(2, q, q"), projective geometries 
5(2,q+1,(q"*! —1)/(q — 1)), unitals S(2, ¢ + 1, g° +1) and arcs in a plane of 
even order (Denniston 1969) S(2,2’,2*' +2’ — 2°) (r<s). Recursive construc- 
tions (see below) show that the necessary conditions v = 1 (mod k — 1) and v(v — 
1) = 0 (mod k(k — 1)) for the existence of some S(2, k, v) are sufficient if v is large 
€nough, and in fact always suffice for k <5 (Kirkman 1847, Hanani 1961, 1965). 
For a discussion of the situation for k = 6 see Mills (1984), Mullin et al. (1987) and 
Assaf (1988a). Infinite classes of cyclic Steiner 2-designs including a “unital” 
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S(2,7, 217) are constructed in Mathon (1987) and Bagchi and Bagchi (1989). 
Doyen and Rosa (1980) contains an extensive bibliography on Steiner systems. 

Of course one conjectures that Steiner systems exist for arbitrarily large t, but the 
number of blocks quickly becomes too large for present-day computers to handle. 

Concerning uniqueness, the following is known. The projective and affine 
planes of orders 2, 3, 4, 5, 7 and 8 are unique (MacInnes 1907, Hall 1953, Hall et 
al. 1956). (But one knows at least 4 projective planes and 7 affine planes of order 
9, cf. Kamber 1976, Hurkens and Seidel 1985.) (In the meantime it has been 
shown by exhaustive computer search that there are precisely 4 projective planes 
of order 9, and no such planes of order 10.) It is possible that all projective planes 
of prime order are desarguesian; for all nonprime prime power orders larger than 
8 also nondesarguesian planes are known (Dembowski 1968, p. 144). Also the 
affine space AG(3, 2)= S(3,4,8) and the Mo6bius (or inversive) planes 
MG(2, g) = S(3,¢+1,q°+1) with q=3, 4, 5, 7 (Witt 1938, Chen 1972, 
Denniston 1973a,b) are unique. When gq is an odd power of 2 (28), then one has 
the Suzuki inversive plane besides the classical one. No other Mobius planes are 
known. Finally, the Witt systems 5S(4,5,11), S(5,6,12), S(4,7,23) and 
S(5, 8, 24) (Witt 1938) are unique. In probably all others cases, when a Steiner 
system is known, then in fact at least two nonisomorphic systems are known. 

On the negative side there are not too many results either. A first obvious 
restriction is that there cannot be a ¢t-(v,k, A) when no (t—i)-(U —i,k—#, A) 
exists. This observation (with i=f—2) together with Fisher’s inequality yields 
(kK —¢+1)(k —£+ 2) <A(v —¢+ 1) (Van Tilborg 1976). Some general restrictions 
on the sizes of codes and designs in association schemes have been given by 
Delsarte (1973) (the “linear programming bound”) and some specific conse- 
quences are listed below. For Steiner systems we have the following additional 
results. Dembowski (1964) shows that inversive planes (i.e., designs S(3,” + 
1,n’+1)) of even order n have order a power of two. Tits (1964) shows for 
Steiner systems S(t, k, v) that (k — t+ 1)(¢+ 1) <v (which is satisfied with equality 
for S(3, 4, 8), S(5, 6, 12) and S(5, 8, 24) but otherwise seems to be very weak — it 
tules out e.g., S(10, 16,72) but already nonexistence of 5(4, 10,66) is known 
(Kantor 1974, Cameron 1977, Denniston 1978); see also section 3 of Cameron 
1980). Other special nonexistence results for Steiner systems are: there is no 
5S(4, 6,18) (Witt, 1938), and no S(4,5,15) (Mendelsohn and Hung 1972). 
Concerning a hypothetical S(4, 5, 17), Denniston (1980) has shown that any such 
system has trivial automorphism group. For arbitrary f-designs we have: there is 
no 3-(11,5,2) (see Oberschelp 1972, Dehon 1976) and no 4-(17,8,5), 6 
(19,9, A), A<10, 6-(20, 10, A), A=7, 14, 4-(23,8, 2), 4-(23, 11, A), A=6, 12 or 
4-(24, 12,15) (Delsarte; Haemers and Weug 1974), or 4-(17,7,2), 5-(19, 9, 7), 
12-(29, 14, 4), 10-(29, 13, 3), 22-(53, 25, 5), 26-(58, 28, 8) (Kohler 1985, 1988/89, 
see chapter 15). 

For tables, see Brouwer (1977b) (v < 18), Driessen (1978), Gronau (1985), 
Chee et al. (1990) (v <30), Hanani et al. (1983) = 3, v <32) and Mathon and 
Rosa (1985) (¢=2,r<41), with some additions in Abel (1994). Kramer and 
Mesner (1975) list admissible parameters for Steiner systems S(t, k,v) with 
v— t= 498. 
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7. Mutually orthogonal Latin squares 


As a preparation for the discussion of t-designs with t= 2 we must first consider 
an auxiliary structure that also has independent interest. 

A Latin square (of order n) is an nm Xn matrix with entries in a set S of 
cardinality n such that each row and each column is a permutation of S. (A 
standard reference on Latin squares is Dénes and Keedwell 1976; a good survey 
can be found in Jungnickel 1984.) 

Two Latin squares A and B with entries in S resp. T are called orthogonal if 
Sx T= {(a,;,5,)|1<i, <n}. In old (recreational) literature such a pair was 
often called a Graeco—Latin square, and one used S= {a,b,c,...} and T= 
{a, B,y,..-}. Example: 


aa bd cB ay 
by aB dé ca 
c& da ay bg 
aB cy ba a6 


A lot of research has been devoted to the question: What is the maximum size 
N(n) of a collection of mutually orthogonal Latin squares of order 2? 

The main results are the following: 
e N(0) = M(1) = +, 
oN(n)<n-1 forn2=2, 
e N(q) =q —1 for prime powers q, 
e (MacNeish 1922) If n =I], p{‘ with p, prime then N(n) = max,(p;'— 1), 
® (Chowla, Erdds and Straus 1960) lim,_,.. M(t) = +©. 
In view of this last result we may define n, as the largest integer for which no r 
mutually orthogonal Latin squares (MOLS) exist. (Thus, M(n) =r for n >n,.) 

Our knowledge about n, is summarized by 2, =6 (Tarry 1901, Bose et al. 
1960), 2, < 10 (Wang and Wilson 1978, Todorov 1985), n, <= 42 (Abel 1991, Abel 
and Todorov 1993), n, <62 (Hanani 1970), n,<76 (Wilson 1974, Wojtas 1980), 
n, <780, ng 2846, n,<4030, 1,, 6148, n,,<7222, n,. <7286, m,, <= 7288, 
n,, < 7874, n,, < 8360, n,, < 52502 (Brouwer and van Rees 1982, Brouwer 1980), 
and for large r we have the estimate n, = O(r'**) (Beth 1983); see also Chowla et 
al. 1960, Rogers 1964, Yuan 1966, Wilson 1974. 

Usually one does not work with MOLS but translates the problem into the 
language of transversal designs — see the following two sections. 


8. Group-divisible designs and transversal designs 


A parallel class is a collection of blocks partitioning the point set of a design. A 
group-divisible design is a linear space in which a parallel class has been singled 
out. The blocks in this parallel class are renamed groups (and no longer counted 
as blocks). Note that it is meaningful to have groups consisting of a single point. 
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Remarks. (1) Various extensions of this concept exist (and are sometimes also 
called group-divisible); all are special cases of the concept of a partially balanced 
incomplete block design (PBIBD) where one asks for a system in which the point 
set carries an association scheme (cf. chapter 15) with n classes, and there are n 
indices A, such that two points in relation i are joined by precisely A; blocks. Our 
group-divisible designs have as underlying two class association scheme (strongly 
regular graph) the union of a number of disjoint cliques, and A, =0, A, =1. We 
shall use group-divisible design of index A for the case where one has a partition 
into groups, and points from different groups are joined by A blocks (i.e., A, =0, 
A, =A). 

(2) Some authors write “part” or “level” or “‘groop” instead of “group”. Many 
authors use “divisible’’ without qualifier. 


There is a 1-1 correspondence between linear spaces on v + 1 points with one 
singled out point and group-divisible designs on v points: Given the linear space, 
call the special point © and throw it away; call the sets G\{«}, where G was a line 
through ~, groups. Conversely, given the group-divisible design, invent a new 
point ©, add it to the groups and add the resulting sets to the collection of lines. 
This process is called “adding a point at infinity”. 

If a linear space has several mutually disjoint parallel classes then one may add 
a point at infinity for each of the parallel classes and finally join the points at 
infinity by a line at infinity. This process is often called ‘completion’. Of course 
the obvious example is the construction of a projective plane starting from an 
affine plane. 

A transversal design (of index A) is a group-divisible design (of index A) such 
that each block (i.e., nongroup) meets each group in precisely one point. It is 
easily seen that (if there are at least three groups then) all groups have the same 
size g, say, and all blocks have the same size k. Furthermore, we have v = gk, 
b=Ag’, r=Ag. 

There is a 1-1 correspondence between transversal designs (i.e., transversal 
designs of index unity) with parameters g, k and sets of kK — 2 mutually orthogonal 
Latin squares of order g: each of the g’ blocks corresponds to a (the same) 
position in each of the Latin squares — given a block, its point on the first and 
second groups determine row and column, and its point on each of the remaining 
groups determines the entry in the corresponding Latin square. 

One of the advantages of using transversal designs (instead of MOLS) is that 
One treats rows, columns and symbols uniformly. Another is that it brings some 
geometric flavour to the problems. 

Let us look at some of the statements made about N(n) in the previous section. 

Given a block B and a point x outside, we see k — 1 blocks (and one group) on 
xX meeting B, so that kK —1<g, ie., N(g) Sg —1. , 

Starting with a projective plane of order q one finds (by throwing out a point ~) 
a transversal design with (k, g) =(q+1,q), which shows M(q) =q-— 1. 

MacNeish’s result follows by a simple direct product construction (formulated 
with equal ease in the language of Latin squares, transversal designs or quasi- 
groups). 
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For the Chowla—Erd6s—Straus theorem we need some recursive constructions 
considered in the next section. 


9. PBD-closed sets 


A set K of nonnegative integers is called PBD-closed if whenever we have a 
pairwise balanced design of index unity with all blocksizes in K then also vu € K. 
This concept was introduced by R.M. Wilson and proves to be extremely useful: 
one has strong information about PBD-closed sets and PBD-closed sets occur in 
many planes in a natural way. 


Theorem (Wilson 1972). Let K be PBD-closed. Then K is eventually periodic with 
period B(K):= g.c.d.{k(k —1)|k EK}, ie., if K intersects the residue class a 
(mod B(K)), then K contains almost all integers k =a (mod B(K)). 


Let us write B(K, A) for the set of all v for which a B(K, A; v) (that is, a PBD 
with v points, index A and all blocksizes in K) exists. Write B(K) for B(K, 1) and 
B(k, A), B(k) in case K = {k}. Now K is PBD-closed iff B(K) = K. 

Some examples of PBD-closed sets are provided by the following. 


Lemma (Breaking up blocks, Hanani 1961, (3.11)). B(B(K)) = B(K), and, more 
generally, B(B(K, A)) = B(K, A) for all K. 


Hanani’s lemma (Hanani 1961, (3.10)). Let k@ K and let R, = {r|r(k —1)+1€ 
B(K)}. Then B(R,) = R,. 


Theorem (Bose et al. 1960). Let G, be the set of integers g for which there exists a 
transversal design with blocks of size k, groups of size g and at least one parallel 
class of blocks. Then B(G,) = G,. 


(These three statements are easy exercises, best proved by a picture.) 

Now we have enough material to sketch a proof of the Chowla—Erd6s-Straus 
theorem: We wish to show that for all k the set G, contains all sufficiently large 
integers. The easiest way to obtain transversal designs with a parallel class is to 
start with a transversal design with one more group and throw this group away, 
now the blocks passing through a discarded point will form a parallel class. Next, 
using MacNeish’s theorem, we see that all numbers with a factorization into 
' prime powers all at least k are in G,. Finally we need a few pairwise balanced 
designs in order to exploit the fact that G, is PBD-closed. Given a transversal 
design with parameters g, k, discard g—A points from one group to obtain a 
truncated transversal design: a group-divisible design with k — 1 groups of size g 
and one group of size h, blocks of sizes kK and k—1, and v =(k —1)g +h points. 
This construction shows that if N(g) =k —2 and g=h>O0andg,h,k-1,kEG, 
then (A —1)g +h€G,. The rest is number theory, not combinatorics. 

One may have wondered why we did not appeal to Wilson’s theorem, but in 
fact Wilson’s theorem is proved using Hanani’s lemmas and the Chowla—Erd6s-— 
Straus theorem. 
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Wilson’s theorem enables us to show that given k and A a 2-(u,k, A) exists 
whenever v satisfies the obvious divisibility restrictions and is large enough. More 
generally one has the following. 


Theorem (Wilson 1972b). B(K, A) contains all sufficiently large integers v with 
A(v — 1) =0 (mod a(K)) and Av(v — 1) =0 (mod B(K)), where a(K):= g.c.d{k — 
1|k@K} and B(K):= g.c.d.{k(k —1)|kK © K}. 


Proof. By the foregoing it suffices to find at least one example in each residue 
class. Using difference sets in finite fields and an “unfolding” construction to 
make designs with A=1 and large v from designs with large A and small v, 
enough examples can be found. See Wilson (1972a,b,c), Brouwer (1979), Beth 
(1985). 0 


10. Steiner triple systems 


Let us try to work out an example and construct Steiner triple systems for all 
possible v. A Steiner triple system (STS(v)), is a Steiner system S(2,3,v), or 
equivalently, a linear space in which all lines have length 3. The condition that r 
and b should be integral forces v = 1, 3 (mod 6). To streamline our construction 
we shall also construct linear spaces with v = 5 (mad 6), this time requiring that all 
lines have size 3 except for one with size S. 

Extend the notation B(K) by priming those block sizes that must not occur 
more than once. Then our claim is that B({3, 5‘}) is the set of all odd integers. 
Clearly all its members are odd. Let R= {r|2r +1€ B({3,5’})}. Clearly R 
contains 0, 1, 2—the corresponding linear space has at most one line. Also 3, 4, 
5 & R — the Fano plane PG(2, 2), the affine plane AG(2, 3) and the completion of 
a 1-factorization of the complete graph K, provide (the unique) examples. By a 
simple variation of Hanani’s lemma we see that B({3, 4, 5'}) CR. We show that 
R=N. Given rE R, if r is odd then by induction r € B({3, 5‘}) CR and we are 
done. If we can write r=3g+h with O<A<g where g=O or 1 (mod 3) and 
either g #6 or h<4 then we find rE R using a truncated transversal design and 
induction. (Here we use the existence of a “truncated transversal design” on 
3-6+4 points —i.e., a Latin square of order 6 with 4 pairwise disjoint transver- 
sals. This is guaranteed by a more general recursive construction due to Hanani 
(1961, (2.12)); let us give such a Latin square explicitly - the transversals are 
indicated with subscripts 


A, B, C, D, E F 
B, A, D, C; F E 
C D F, &£, A, B, 
DC E, KF, B, A, 
E, F, A B C, D, 
F, E, B A D, CG, ~+) 


Block designs 709 


It remains to consider r € {6, 8}, i.e., v € (13, 17}. For v = 13 we may take the 26 
blocks {1,3,9}, {2,6,5} (mod 13) on the point set Z,,. For v = 17 we complete 
the five parallel classes of the design on Z,, with blocks {0, 1,4}, {0,6}, {0, 5}, 
{0,2} (mod12) and parallel classes [{0,6}+ 2], [{0,5}+2i})+0, 1 and 
[{0, 2}, {1,3} + 42] + 0, 2 @ EZ,,). 

This completes our construction and shows that there is a STS(v) for all v = 1, 3 
(mod 6), a result due to Kirkman (1847). 

Many direct constructions of STSs exist, but the above provides one of the 
shorter existence proofs (assuming that N(g)>2 for g #2, 6 is already known) 
and shows the spirit of the constructions of other families of designs with small 
block sizes: a few small cases have to be handled explicitly, and the rest is done 
with the help of recursive constructions. 

Let us, however, also mention some other types of construction. 

A typical difference set construction in a finite field goes as follows: Let q =1 
(mod 6) be a prime power, and let x be a primitive element of F,. Write 
q = 6e + 1. Then the set of blocks {{x', x‘*”, x'*“*} + y|O<i<e, yEF A makes 
F, into a STS(q). (Note that we used this construction above for ‘the case 
v = 13.) 

More generally one has (Wilson 1972a): If g is a prime power and k|2A or 
(k —1)|2A then the necessary condition k(k — 1)| A(q—1) suffices for the exist- 
ence of a 2-(q,k, A) design. (The construction is similar.) 

A completely different approach is that using Skolem sequences. 

A Skolem sequence (of order x) is a partition of the numbers 1, ..., 2” into n 
disjoint pairs (¢,,b;) such that b, —a,=i (i=1,...,n). A Skolem sequence of 
order n exists iff m =0 or 1 (mod 4) (Skolem 1957). Given such a sequence, we 
find a partition of 1,...,3” into n pairwise disjoint triples (i, a, + n, b; +n), and 
using these as base blocks (mod 6n + 1) we find a STS(6n + 1) (Skolem 1958). A 
hooked Skolem sequence (of order n) is a similar partition of the numbers 
1,2,...,2n—1, 2n + 1. Skolem (1958) conjectured and O’Keefe (1961) proved 
that such sequences exist iff n =2 or 3 (mod 4). Again we find Steiner triple 
systems ~a hooked sequence provides a difference 3n +1 where an ordinary 
sequence gives 3n, but mod 67 + 1 these are the same. Hanani (1960) constructed 
similar partitions of 1,2,...,2,2+2,...,2n+1 for n=3 (mod4) and used 
these to construct cyclic STS(6n + 1) for all n. But in fact Peltesohn (1938) had 
already twenty years earlier shown that a cyclic STS (i.e., one with Z, in its group 
of automorphisms) exists for all v= 1 or 3 (mod6) except v = 9. 

' Many other results exist on STSs with specified automorphisms. Just to mention 
one example: Rosa (1972), Doyen (1972) and Teirlinck (1973) showed that STSs 
with an automorphism group containing an involution that fixes only one point 
(“reverse STSs’’) exist iff v = 1,3, 9 or 19 (mod 24). 

Lindner and Rosa (1975) showed that an STS(v) with trivial automorphism 
group exists iff v = 1, 3 (mod 6), v = 15, and Babai (1980), using Wilson’s (1974b) 
estimate (see Aleksejev 1974) on the number M(v) of nonisomorphic Steiner 
triple systems of order v (one has log N(v) ~ (n’/6)log n) shows that almost all 
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STSs have a trivial group of automorphisms (i.e., if A(v) is the number with 
trivial automorphism group then A(v) ~ M(v)). 

Returning to the Skolem sequences, this has grown into a subject of its own, 
independent of STSs. Independent motivation is provided by the needs of 
radioastronomers who want a sequence of positionings of their arrays with 
antennas so as to do measurements on a wavelength and all its small integer 
multiples. See, e.g., Langford (1958), Davies (1959), Priday (1959), Baron 
(1970), Roselle (1972), Bermond et al. (1978a,b). There is also some relation 
with the problem of finding graceful numberings of certain graphs. 

Instead of looking at automorphism groups one may ask for specific structural 
properties. Doyen and Wilson (1973) showed that if u, v=1 or 3 (mod 6) then 
there exists a STS(v) with a sub-STS(x) iff v > 2u + 1. Conversely it is sometimes 
useful to have systems without certain specified subconfigurations, see, e.g., 
Brouwer (1977c). One may introduce a dimension concept for linear spaces and 
study the possible values of the dimension function for STS, see, e.g., Teirlinck 
(1979). Probably the most important structural property studied is resolvability 
(see section 12). A resolvable STS is called a Kirkman triple system, and 
Ray-Chaudhuri and Wilson showed that a KTS(v) exists iff v =3 (mod 6), thus 
solving a one-century-old problem. 

Finally one may consider systems of STSs. Teirlinck (1977) shows that given 
any two STSs of the same order, one may find isomorphic embeddings of them on 
the same point set so that they have no triple in common. Several people studied 
the maximum number of pairwise disjoint STSs on a given point set. Clearly this 
cannot be higher than v —2, and it is now known that this upper bound is 
achieved for v #7. Cayley (1850) showed that for uv =7 this maximum is 2. See 
Teirlinck (1973b, 1984, 1989), Schreiber (1973), Lu (1983, 1984). 


11. 3-Designs 


We are still very far from a satisfactory asymptotic theory for 3-designs, like 
Wilson’s theory for 2-designs. Hanani (1963) constructs 3-designs with k = 4 for 
all admissible parameter sets. For some recursive constructions, see Hanani 
(1979). Some of the designs constructed there have been rediscovered by Assmus 
and Key (1986). 

In the case of pairwise balanced designs one uses transversal designs in the 
recursive constructions, and the first thing to do here would be to prove the 
analogue of the Chowla et al. theorem for transversal 3-designs. (R.M. Wilson 
teports that J. Blanchard has recently done this, and more generally obtained the 
corresponding result for transversal f-designs.) MGdbius planes provide 3-designs, 
and truncating them one obtains various triplewise balanced designs; Laguerre 
planes provide transversal 3-designs, and Minkowski planes provide nested 
transversal 3-designs. Using these it is possible to give recursive constructions for 
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many 3-designs, but it seems that one does not yet have sufficient control to prove 
asymptotic existence results. 


12. Resolvability 


A resolution of a design is a partition of the set of blocks into parallel classes; a 
design is called resolvable when a resolution exists. (A parallel class is a 1-design 
with r= 1, a partition of the point set.) For example, affine planes, Hermitean 
unitals and the design of points and lines in projective 3-space are resolvable. The 
obvious necessary condition for resolvability of a design with constant block size k 
is that k divides v, and usually this condition is also sufficient. In particular, 
resolvable Steiner triple systems (known as Kirkman triple systems KTS(v)) have 
been constructed for all v =3 (mod 6) by Ray-Chaudhuri and Wilson (1971) and 
resolvable Steiner systems S(2,4,v) for all v=4 (mod12) by Hanani et al. 
(1972). For resolvable group-divisible designs with k = 3, see Rees and Stinton 
(1987). Asymptotic results are known. 

The existence of such partitions implies strengthenings of Fisher’s inequality. 
For example, for an (r, A)-design, the point-block incidence matrix A satisfies 
AA’ =(r—A)I +AJ, and if r>A one concludes that rk A =v. But in case the 
collection of blocks can be partitioned into c 1-designs, there are c essentially 
different ways of writing the all-one vector as a linear combination of rows of A, 
and it follows that v = rk A <b —c +1. The case of equality can be characterized, 
cf. Beutelspacher and Lamberty (1983). 

An affine resolvable BIBD is a BIBD with a resolution such that for some 
constant yz > 0 any two blocks from different parallel classes meet in precisely 
points. (And then p = kK’ !v.) As we just saw, for a resolvable BIBD one has 
6 2u+r-—1, and Bose (1942) shows that equality holds if and only if the design 
is affine resolvable. Examples of affine resolvable designs are the designs of points 
and hyperplanes in an affine space. Strong nonexistence results are known. For a 
survey, see Shrikhande (1976). 

Much attention has been given to resolutions of the complete design (;,) of all 
k-subsets of a v-set. Indeed, Cameron (1976) is entirely devoted to them. First of 
all, such resolutions exist if and only if k|v (Baranyai 1975, see also Brouwer 
1976b, Baranyai and Brouwer 1977). (This is proved by a simple but ingenious 
repeated use of the Ford—Fulkerson max-flow min-cut and integral flow 
theorems.) In fact, Baranyai’s theorem roughly speaking implies that all con- 
ceivable 1-design-like structures exist. For an application, see, e.g., Stinson 
(1983). 

For k = 2 these resolutions are known as 1-factorizations of the complete graph, 
and for v < 10 all have been determined (Gelling 1973). Since there are many of 
them for larger v (for the number f(v) of nonisomorphic 1-factorizations of K, we 
have log f(v) ~1n’log n), one might impose additional restrictions; a perfect 
1-factorization is one in which the union of any two 1-factors (parallel classes) is a 
Hamilton circuit. It is conjectured that perfect 1-factorizations exist for all even uv, 
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and they are known for v=p+1, v =2p (p an odd prime) and for all even 
v <40. Cf. Seah and Stinson (1987). A survey of one-factorizations has been 
given by Mendelsohn and Rosa (1985), in fact the entire issue of the Journal of 
Graph Theory containing that paper is entirely devoted to factorizations of 
raphs. 

: ‘e interesting one-factorization of the 4-subsets of a 24-set is provided by the 
Steiner system S(5, 8, 24): given any 4-set (‘‘tetrad’’) there is a unique partition of 
the 24-set containing this 4-set such that the union of any two elements of the 
partition is block of the Steiner system; see section 16. 

One may also consider other partitions of the block set of a design. A design 
with constant block size k is called separable if its block set can be partitioned into 
1-designs with r € {1, Kk}. (This concept occurs in various results on the construc- 
tion of sets of MOLS from a BIBD.) 

A 3-design is called doubly resolvable if its set of blocks can be partitioned into 
resolvable 2-designs. Thus, Sylvester’s school girl problem (solved by Denniston 
1974) asked for a partition of the set of all triples on 15 points into resolvable 
Steiner triple systems S(2, 3, 15). See also Denniston (1974a, 1979). Resolutions 
into more special designs can be found in Baker (1976, 1984), Beth (1974, 1979), 
Brouwer (1976a), Chouinard (1983), Denniston (1972, 1973c,d, 1983), Kramer 
and Mesner (1975b), Hartman (1987a,b). Partitions of the collection of all 
k-subsets of a v-set, such that no two k-sets in the same part have a (k — 1)-set in 
common, play a role in the construction of good constant weight codes for d = 4, 
cf. Van Pul and Etzion (preprint). 

Instead of asking for special resolutions, one may ask for several resolutions 
with specified mutual relationship. Two resolutions are called orthogonal when 
each parallel class of one resolution has at most one block in common with each 
parallel class of the other resolution. A pair of orthogonal resolutions of the 
complete design (3) of all pairs in a v-set is called a Room square; see the first 
part of Wallis et al. (1972). For some variations, see Fuji-Hara (1986) and the 
references given there. Orthogonal factorizations of graphs are surveyed in 
Alspach et al. (1992); see also other papers in that collection. 


13. Completing designs 


Given a partial design of some kind that either has very few blocks, or is already 
almost a full design, it is often possible to prove that it can be completed to a 
design of the required type. Let us give some examples. 

A Latin rectangle is a matrix with entries in some finite set S such that no 
element occurs twice in the same row or column. Thus, a Latin square of order n 
is an m X n Latin rectangle with |S| =. Ryser (1951) gave necessary and sufficient 
conditions for the embeddability of Latin rectangles in Latin squares. Using this 
result, Evans (1960) proved that any partial Latin square of order n can be 
embedded in a Latin square of order m for each m = 2n. He conjectured that any 
partial Latin square of order n with at most n — 1 filled cells can be completed (to 
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a Latin square of order ), and this was proved by Andersen and Hilton (1983) 
and Smetaniuk (1981). See also Damerell (1983), Andersen (1985) and Daykin 
and Haggkvist (1984). 

Cruse (1974) gave necessary and sufficient conditions for the embeddability 
(1975a) of symmetric square Latin rectangles in symmetric Latin squares. Using 
his results, Lindner (197S5a) showed that any partial Steiner triple system on m 
points can be completed to a Steiner triple system on 6m +3 points. (Finite 
embeddability had been shown already by Treash 1971. Lindner’s bound has been 
improved later; many related results exist. See Lindner 197Sb, Andersen et al. 
1980.) 

In a projective plane, any two lines have precisely one point in common. Many 
results say that if in a partial linear space any two lines meet, and there are 
enough lines, then it can be embedded in a projective plane. See, e.g., Dow 
(1983). A projective plane has equally many points and lines. Many results say 
that if the number of lines of a linear space is only slightly more than the number 
of points, then the design is either one of a few exceptions, or obtained by 
truncating a projective plane. See, e.g., De Witte (1976), Totten (1976, 1977). A 
projective plane has as many points per line as lines per point. Many results say 
that if the (maximum) number of lines per point is only slightly more than the 
(minimum) line size, then the design is obtained by truncating a projective plane. 
See, e.g., Shrikhande and Singhi (1985), Beutelspacher and Metsch (1986). The 
famous result by Bruck (1963) that a net of order n with roughly n — (2n)'" 
parallel classes can be completed to an affine plane is another example of this 
phenomenon. Most of these results use Bose and Laskar’s “claw and clique” 
technique in some way or another; let us very roughly sketch a version of it. 

Given a linear space, look at the graph whose vertices are the lines and where 
two lines are adjacent when they are disjoint. Clearly, finding an embedding of 
the linear space into a projective plane is equivalent to finding a collection of 
cliques in this graph such that each edge is in precisely one of these cliques, any 
two cliques meet, and the lines in each given clique partition the point set of the 
linear space. Usually, in order to fulfil these requirements it is enough to find on 
each edge a sufficiently large clique. Now suppose that we can show that each 
j-claw in the graph can be extended to a (j + 1)-claw in at least M ways for j < m, 
but that no (m + 1)-claws exist. (This information might come from counting, but 
also eigenvalue arguments work in cases where the graph is sufficiently regular.) 
It follows that the M points that can be added to an (m-—1)-claw must be 
mutually adjacent, and in this way we find the required large cliques. 

Somewhat stronger results have recently been obtained by Metsch. 


14. Packing and covering 


Corresponding to the problem of constructing any type of design, where there is 
“precisely A” in the definition, one has problems of constructing maximal 
packings and minimal coverings, with ‘at most A” and “at least A” in the 
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definitions. In particular, the packing and covering problems corresponding to the 
construction of Steiner systems S(t,k,v) are the problems of determining the 
maximum size D(t,k,v) of a collection of k-subsets of a v-set such that no 
t-subset is covered more than once, and the minimum size C(t,k,v) of a 
collection of k-subsets of a v-set such that each t-set is covered at least once. 
Clearly, one has C(t, k,v)=(°)/(*) and D(t, k,v) <(’)/(4), in both cases with 
equality if and only if a Steiner system S(z, k, v) exists. R6dl (1985) shows that for 
fixed ¢ and k one has 


lim C(t, k, v)(F)/(7) =lim DG, kv) = 1 


(see chapter 33). Slightly more precise bounds were given by Johnson (1962), 
Katona et al. (1964) and Schénheim (1964, 1966) who showed that 
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and 
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These bounds are rather tight; for example, when ¢ =2 and k, d are fixed, then 
the difference between left-hand and right-hand sides remains bounded as v tends 
to infinity. For packings the above bound can be sharpened slightly by not only 
applying D,(t, k, v) = (v/k)D,(t- 1, k — 1, uv —1), but also the occasionally useful 
D,(t, k, v) 2 (v/(v — k))D,(t, k, v — 1). Another small improvement is obtained by 
observing that the collection of ¢-sets that are not covered precisely A times is 
cither empty or covers at least ¢ points. For t= 2 this yields: if (k — 1)|A(v — 1) 
and Av(v —1)/(kK-1)=—1 (modk) then C,(2,k,v) 2 Av(v — 1)/k(k — 1) +1, 
and if (kK~—1)|A@—1) and Av(u—1)/(K-1)=1 (modk) then D,(2,k,v)< 
Av(v — 1)/k(K —1)—1. The bounds resulting from this observations hold with 
equality for (¢,k)=(2,3) (Fort and Hedlund 1958, Schénheim 1966, Hanani, 
1975) and for (t,k)=(2,4), v>19 (Mills 1972, 1973, Brouwer et al. 1977, 
Brouwer 1979, Billington et al. 1984, Hartman 1986, Assaf 1986, 1988b); for 
(t, kK) = (3, 4) equality holds for large v (Hartman et al. 1986). For (t, k) = (2,5), 
see Mills (1983), Lamken et al. (1987), Mills and Mullin (1988), Assaf et al. 
(1993). For small v one has bounds like 


tt +1-20 
~(@-1)e 


(provided the denominator is positive) for the packing numbers; Todorov (1985a) _ 
gives infinitely many cases with strict inequality for the covering numbers. 
Coverings using few sets are studied in Mills (1979), Todorov (1985b, 1986). See 
also Bate and Van Rees (1985), Todorov and Tonchev (1982). A recent survey is 
Mills and Mullin (1992). 

These problems are also studied in various other guises. For example, finding 


Dit, k, v) 
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D(t, k, v) is equivalent to finding the maximum size A(n,d,w) of a binary code 
with word length n, constant weight w and minimum distance d, when n =v, 
w=k and d=2(k —t+1)-tables for »<28 are given in; finding C(t, k,v) is 
equivalent to finding the Turdn number T(n,1,r), the minimum size of a 
collection of /-subsets of an 7-set such that every r-subset containst at least one of 
these, when n=v, /=u-—k, r=u-t. 

A variation on this theme has recently gained popularity. One studies the 
function g(t, A; v), the minimum size of an exact ¢-covering, that is, of a ¢-wise 
balanced design, with largest block size k. For a survey, see Stanton (1988). 

In this area one also encounters functions of interest to both mathematician and 
layman. The lottery number L(n,1,r,t) is the minimum size of a collection of 
I-subsets of an n-set with the property that each r-set meets one of these /-subsets 
in at least ¢ points. For suitable parameters n, J, r, ¢ this is the minimum number 
of lotto forms one has to fill in to be assured of winning a prize. In Holland and 
Germany the values of L(41,6,7,4) and L(49,6,6,3) (respectively) are of 
special interest. See Hanani et al. (1964), Oberschelp (1972), Brouwer and 
Voorhoeve (1979). One also has the football pool problem, where one wants to 
find the size of the smallest code in Q” with given covering radius 8. For 
(n, 6, |O|) = (13, 1,3) this can be interpreted (in Holland) as the minimum 
number of football toto forms one needs to complete to have a sure second prize. 
Cf. Kamps and Van Lint (1970). There is some recent progress here. 


15. Codes and designs 


There are close connections between coding theory and design theory. (See for 
example Cameron and Van Lint 1980, MacWilliams and Sloane 1977 (esp. chapter 
6) and Bridges et al. 1981.) The most direct correspondences work as follows. 
Given a code € of word length n, take the collection of supports of the code 
words of weight w. This yields a design with block size w on 7 points. (We already 
mentioned the Assmus—Mattson theorem (in section 6), which guarantees that 
nice codes yield nice designs.) Conversely, given a design (X, @) and a (usually 
finite) field F, consider the linear subspace of F” spanned by the characteristic 
vectors of the elements of &. This yields a (linear) code of word length v. Often it 
is easier to work with the code than with the design. Usually one tries to choose F 
_in such a way that the resulting code becomes self-orthogonal, since there are 
powerful results on self-orthogonal codes. (To give a very trivial example: one 
can immediately conclude that dim € <v/2). For a design with block size k and 
intersection numbers s,,..., 5, this means that one chooses the characteristic p 
such that k =s,=0 (mod p) for all j, if possible. Examples of the use of this 
technique are Bagchi’s attempted proof of the nonexistence of a design 3- 
(57, 12,2) (an extension of a biplane 2-(56, 11, 2), cf. section 5) by studying the 
ternary code generated by such a design, the work of Calderbank on quasi- 
symmetric designs, and the characterizations by Tonchev of designs related to the 
Witt designs (cf. section 16). 
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In this context it is very useful to know dim @, that is, to know the rank of the 
incidence matrix . of the design over the field F. For a BIBD, it follows from 
AA™=(r—A)I+AJ and det AA’ =kr(r—A)’"' that A has p-rank v or v—1 
unless p|(r—A), and p-rank v precisely when P HV kr(r — A). The p-rank of a 
projective plane of order n, with p|n, p’ nis (n? + n + 2)/2, see Sachar (1979), 
Klemm (1986). The most important (and most often rediscovered) result on 
p-ranks is that the p-rank of the incidence matrix of points and hyperplanes in 
PG(d, gq), where g=p”", equals (4+2-')”+1 (Goethals and Delsarte 1968, 
Hamada 1968, MacWilliams and Mann 1967, Smith 1969) and that the corre- 
sponding p-rank for AG(d, q) is one less (Hamada 1973), Hamada (1968, 1973) 
also gives the ranks for the designs of points and f-flats in projective and affine 
spaces. He conjectured that these ranks are minimal for these geometries among 
the BIBDs with the same parameters, and later “‘Hamada’s conjecture’’ was taken 
to mean that among several designs with the same parameters the “nicest”? one 
would have the smallest p-rank. However, this is only a heuristic, and there are 
counterexamples for most interpretations of “nicest”. For example, among the 
Steiner systems S(2, 4, 28) the one with the largest automorphism group is the 
classical unital, which has 2-rank 21, while the Ree unital has 2-rank 19. For some 
positive results, see Hamada and Ohmori (1975); for some more counterexam- 
ples, see Tonchev (1986b). Lander (1983) gives more detailed results, using 
instead of p-rank the finer tool of elementary divisors. 


16. Witt designs, Golay codes and Mathieu groups 


By far the most important structure in design theory is the Steiner system 
5(5, 8, 24). Together with the related designs S(4, 7, 23), S(5, 6, 12), S(4, 5, 11) it 
was first constructed by Witt (1938) and these four Steiner systems are known as 
the Witt designs. We shall see that they are uniquely determined by their 
parameters. These designs are very closed related to the Golay codes, and we 
shall discuss these first. 


16.1. The Golay codes 


Let B be a vector space over F, with fixed basis e,,...,e,. A code © is a subset 
of V. A linear code is a subspace of V. The vector with all coordinates equal to 
zero (resp. one) will be denoted by 0 (resp. 1). 

The Hamming distance d,,(u, v) between two vectors u, v € V is the number of 
coordinates where they differ: when u=)u,e,, v= ¥ ve, then d,(u,v) = 
|{é | u; #v,}|. the weight of a vector u is its number of nonzero coordinates, i.e., 

d,,(u, 0). 

The minimum distance d = d(€) of a code @ is min{d,,(u, v)|u,v€ 6, u # v}. 
The support of a vector is the set of coordinate positions where it has a nonzero 
coordinate. 
| There exist codes, unique up to isomorphism, with the indicated values of n, q, 
€| and d: 
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n q |¢| d name of 
(i) 23 2 4096 7 binary Golay code 
(ii) 24 2 4096 8 extended binary Golay code 
(iii) 11 3 729 5 ternary Golay code 
(iv) 12 3 729 6 extended ternary Golay code 


Codes with these parameters will be constructed in section 16.2. We shall show 
uniqueness in the binary case in section 16.4. For the ternary case, see Delsarte 
and Goethals (1975). Let us assume that the codes have been chosen such as to 
contain 0. Then each of these codes is linear (the dimensions are 12, 12, 6, 6). 

The codes (ii) and (iv) are self dual, i.e., with the standard inner product 
(u,v) = u,v, one has € = €* for these codes. 

The codes (i) and (iii) are perfect, i.e., the spheres with radius (d — 1)/2 around 
the code words partition the vector space. (Proof by counting: |sphere| = 1+ 
(73) + (3) + (3) = 2048 = 2"! in case (i), and |sphere| = 1 + 2(}1) + 4(') = 243 = 
3° in case (iii).) 


Remark. Except for the repetition codes (with |@| = q, d =n), there are no other 
perfect codes € with d>3, cf. Van Lint (1971, 1974), Tietavadinen (1973, 1974), 
Tietavdinen and Perko (1971), Reuvers (1977), Best (1982), Hong (1984). 


The weight enumerators A(x):= ¥ a,x', where a, is the number of code words 
of weight i, are: 

(i) 1+ 253x” + 506x* + 1288x'! + 1288x'? + 506x"* + 253x"° + x”, 

(ii) 1+ 759x® + 2576x"? + 759x'® + x74, 

(iii) 1+ 132x° + 132x° + 330x® + 110x° + 24x", 

(iv) 1+ 264x° + 440x° + 24x). 

(Proof: Cases (ii) and (iv) follow immediately from (i) and (iii) (cf. the first 
paragraph of the next section). In cases (i) and (iii) use the fact that the codes are 
perfect; e.g., in case (iii) the sphere around 0 covers the vectors of weight at most 
two. The 2°(1!) vectors of weight 3 must be covered by spheres around 
codewords of weight 5, so that a, = 2° -(4!)/(3) = 132; next a, = (2*- (3) - 132: 
(3) — 132+ (8) 2)/(4) = 132; ete.) 

The supports of the code words of minimal nonzero weight form Steiner 

systems S(4, 7, 23), S(5, 8, 24), S(4,5, 11) and S(5, 6,12), respectively. 
. For those who know what a near polygon is: the partial linear space with as 
points the vectors of the extended ternary Golay code and as lines the cosets of 
1-dimensional subspaces spanned by a vector of weight 12 is a near hexagon with 
s +1=3 points/line and ¢ + 1 = 12 lines/point and diagram (as distance transitive 
graph) as in fig. 16.1. It has quads (namely 3 X 3 grids GQ(2, 1)). 


16.2. The Golay codes — Constructions 


Given one of the extended codes one may puncture it by just deleting one 
coordinate position. This proces (i) and (iii) from (ii), (iv). Conversely, given (i) 
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O54 (On a at“) v= 729. 
2 


Figure 16.1. 


one may construct (ii) by extending it, i.e., adding a parity check bit such as to 
make the weight of all code words even; and given (iii) (normalized by 
multiplying certain coordinate positions by —1 such that the normalized code 
contains the all-one vector) one may construct (iv) by adding a check trit such as 
to make the sum of all coordinates a multiple of three (but this latter fact is not 
easy to see; cf. Delsarte and Goethals 1975). 


16.2.1. A construction of the extended binary Golay code 

This code is the lexicographically first code with word length n = 24 and minimum 
distance 8: write down the numbers 0, 1,...,27*—1 in binary and consider them 
as binary vectors of length 24. Cross out each vector that has distance less than 8 
to a not previously crossed-out vector. The 4096 vectors not crossed out form the 
extended binary Golay code. (Proof. Just do it. Some work may be saved by 
observing (M.R. Best) that any lexicographically minimal binary code with a 
number of vectors that is a power of two is linear so that all one needs are the 12 
base vectors. These turn out to be 


eocoocoocooocoocoe 
Oroocoocodnccoe 
corocoocooocooce 
ecocr oc coocooqc‘ce 
cooococorocoocjoeooccoeo 
oooocorocjooco 
SC:O'°O SO 'O :O.0 OC OO 
Bere rereee DOOCSO 
cooococroroocc;c“( 
cooococoorocoe 
cooooccooecooroceo 
See OOO = = OO 
cooococoococorso 
BrP OoOorr Creer Ore OS 
ep OF OF OFF CHK Oo 
SB OCF OF FB OF KF KE CO 
cooococooocooco- 
Or Ororrrr CoO 
RFP OOCOFRFR kK OF OF Oe 
CGoOorrr OCF Or K Oe 
Ore ke FR CO OrFR COR 
RPBrPePoococororos= = 
oOorcoorrr OCOr =S & 
Ccoorococr OP RP ee = 


) 


Remark. Deleting the columns with only one 1 and interchanging zeros and ones 
we find the incidence matrix of the unique symmetric group divisible design 
GD(5, 2, 2; 12) in Hanani’s notation — see below. 


16.2.2. Construction as quadratic residue codes 
For (n, g) = (11, 3) or (23, 2) consider the linear code generated over F, by the n 
vectors c,(1 <i <7) with coordinates 


_ fi if j—disanonzero square (mod n) , 
(c,), = ; 
0 otherwise . 
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This yields the ternary and binary Golay codes. (Proof. The only nontrivial thing 
to check is the minimum distance. One easily sees that the extended code has all 
weights divisible by 3 resp. 4 so that all that remains is to prove that its minimum 
distance is not 3 resp. 4 and that is easy. For explicit details see Van Lint (1982, 
§6.9) (but notice that some of the statements there are valid only in the binary 
case).) 


16.2.3. Constructions from the 2-(11,5,2) biplane and the icosahedron 

Let B be the incidence matrix of a design with point set Z,, and blocks 
{1,3,4,5,9} +i @EZ,,) (ie., the translates of the set of nonzero squares 
mod 11). (This design is a square block design 2-(11, 5, 2): any two points are on 
two blocks and dually.) Then the rows of the 12 x 24 matrix 


017 
I where K=J—B 
1K 


generate the extended binary Golay code. 

Let N be the adjacency matrix of the icosahedron (points: 12 vertices, adjacent: 
joined by an edge). Then the rows of the 12 x 24 matrix (I J — N) generate the 
extended binary Golay code. Conversely, given a generator matrix (J X) for the 
extended binary Golay code, either one of its rows has weight 12 and we are in 
the first situation, or all rows have weight 8 and X is the incidence matrix of the 
unique symmetric group divisible design GD(5, 2, 2; 12); by suitably ordering the 
rows and columns we may obtain X = N and we are in the second situation. 


16.2.4. A similar construction for the extended ternary Golay code 
Let S be the 5 <5 circulant matrix with first row (0 1 -—1 —1 1) (ie., the 
quadratic residue character mod 5). Then the rows of the 6 x 12 matrix 


0 1° 
I 
-1 S$ 
generate the extended ternary Golay code (over F,). (Cf. Cameron and Van Lint 
1980, chapter 13: Symmetry codes.) One checks easily that (up to permutating 


coordinate positions and multiplying columns by —1, i.e., up to monomial 
transformations) this is the only possibility for a generator matrix (I X). 


16.3. Parameters of the Witt designs 


For a t-(v, k, A) design, the number of blocks containing a point set X and disjoint 
from a point set Y (where X M Y = @) can be expressed in the parameters ¢, v, k, 
A, |X|, [Y| when |X U Y|c. Let us call these numbers 2(|X], |¥]). 

For S(5,8,24) we have: A,=1, A,=5, A,;=21, A,=77, A, = 253, Ay = 759. 
The “intersection” triangle here gives the numbers p(|X|,|Y|) with [XU Y| 
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constant in each row and |X| increasing in each row, where X U Y is contained in 
a block. 


7539 
253 506 
77 176 330 
21 56 120 210 
5 16 40 80 130 
1 4 12 28 52 78 
1 0 4 8 20 32 46 
1 0 0 4 4 16 16 30 
1 0 0 0 4 0 16 0 30 


Given a block B, of S(5,8, 24), let n; be the number of blocks B such that 
|B) B| =i. Then n, = 1, n, = 280, n, = 448, no = 30 and all other 7, are zero. 

For those who know what a near polygon is: the partial linear space with as 
points the 759 blocks of the Steiner system S(5, 8, 24) and as lines the partitions 
of the point set of the design in three pairwise disjoint blocks, is a near hexagon 
with s+1=3 points/line and ¢+1=15 lines/point and diagram (as distance 
transitive graph) as in fig. 16.2. It has quads (with 15 points and 15 lines: 
GQ(2,2), i.e., Sp(4,2) generalized quadrangles). A quad in the near polygon 
corresponds to a sextet in the design: a partition of the point set into six 4-sets 
such that the union of any two of them is a block. Distances 0, 1, 2, 3 in the near 
polygon corresponds to intersections of size 8, 0, 4, 2, respectively. 


ae (On Oa Bo) tL 


Figure 16.2. 


For S(5, 6,12) we have: A,=1, A,=4, A; =12, A, =30, A, = 66, Ay = 132. Our 
intersection triangle becomes 


132 
66 66 
30 36 30 
12 18 18 12 
4 8 10 8 4 
1 3 5 5 3 1 
1 0 3 2 3 0 1 


Given a block B, of S(5,6, 12), let n,; be the number of blocks B such that 
|B) OB| =i. Then 2,=1, 1,=45, n, = 40, n, =45, ny =1 (and n, =n, =0). In 
particular the complement of a block is again a block. 

Note that all the above information follows directly from the parameters of 
these Steiner systems (and may thus be used in uniqueness proofs). 
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There exist unique designs S(5,8,24), S(4,7,23), S(3,6,22), S(2,5,21), 
S(1, 4, 20), S(S, 6, 12), $(4, 5,11), S(3, 4, 10), S(2, 3, 9), S(1, 2, 8). (S(2, 5, 21) is 
the projective plane of order 4, S(2,3,9) the affine plane of order 3, S(3, 4, 10) 
the Mobius plane of order 3.) (In view of the derivation S(t, k, v)—> S(@t—1,k — 
1, v — 1) it suffices to construct S(5, 8, 24) and S(5, 6, 12), and we shall find these 
as the supports of the code words of minimal nonzero weight in the extended 
Golay codes; uniqueness will be shown as a corollary of the uniqueness of the 
Golay codes.) 


16.4. The uniqueness of the extended binary Golay code € 


For a good account, of the uniqueness of the Golay codes and associated Steiner 
systems see MacWilliams and Sloane 1977, chapter 20). There the uniqueness of 
S(5, 8,24) is proven ‘by hand” - examining its structure in detail, and the 
uniqueness of € follows rather easily by use of the linear programming bound. 
Here we follow the opposite way, getting the uniqueness of S(5, 8, 24) from that 
of €, and proving the latter directly, without recourse to the theory of association 
schemes. Instead, the uniqueness of € will come as a consequence of the 
uniqueness of the 2-(11,5, 2) biplane. 


Theorem. Let € be a binary code containing 0, with word length 24, minimum 
distance 8 and |€|>=2". Then € is the extended binary Golay code. 


Proof. If we delete a coordinate position we find a code € with word length 23, 
minimum distance (at least) 7 and |@,| = 2’. As we saw before, such a code must 
have |€,|=2'? and weight enumerator coefficients Q@y) = @2,=1, a, =a, = 253, 
a, = a, = 506, a, = @,, = 1288 (by the sphere-packing argument it follows that €, 
is perfect). Now if © contains a word of weight w not divisible by 4 then by 
suitably puncturing we would find a €, containing a word of weight w or w—1 
not 0 or —1 (mod 4), a contradiction. Hence € has weight enumerator coefficients 
Gy = Az, = 1, ag = a1, = 759, a,, = 2576. Giving an arbitrary vector in € the rdle of 
0 we see that all distances between code words are divisible by 4. If u, v€ @ then 
d,,(u, v) = wt(u) + wt(v) — 2(u, v) so the inner product (u, v) is even and it follows 
that @ is self-orthogonal. But €* is a linear subspace of dimension 24 — 
dim (€) <12 so that €* = and © is a linear code. Let u and & be two 
complementary weight 12 vectors in €. The code €, obtained from @ by throwing 
away all coordinate positions where u has a 1, has word length 12 and dimension 
11 and hence must be the even weight code (consisting of all vectors of even 
weight). This means that we can pick a basis for € consisting of 4 and 11 vectors 
v, with (u, v;)=2 so as to get a generator matrix of the form 


Cet - 
I 


where J is an identity matrix of order 11. A little reflection shows that J — K is the 
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incidence matrix of a 2-(11, 5, 2) biplane. This shows uniqueness of € given the 
uniqueness of the 2-(11, 5, 2) biplane, and the latter is easily verified by hand. O 


Theorem. There is a unique Steiner system S(5, 8, 24). 


Proof. (i) Existence: the words of weight 8 in € cover each 5-set at most once 
since d= 8, and exactly once since (24) = 759-(§). 

(ii) Uniqueness: Let # be such a system, and let €, be the linear code over F, 
spanned by its blocks. From the intersection numbers we know that @, is 
self-orthogonal (i.e., €,C €;) with all weights divisible by 4. In order to show 
that |€,| 22" fix three independent coordinate positions, say 1, 2, 3 and look at 
the subcode €, of €, consisting of the vectors u with u,=u,=u,. Then 
dim €, = (dim €,) + 2. Thus, in order to prove dim €, > 12 it suffices to show that 
the code generated by the blocks of S(5,824) containing 3 given points has 
dimension at least 10. In other words, we must show that the code generated, by 
the lines of the projective plane PG(2, 4) (which is nothing but S(2, 5, 21)) has 
dimension at least 10, but that is the result of the next theorem. 

The blocks of an S(5,8,24) assume all possible 0-1 patterns on sets of 
cardinality at most 5 so that €; has minimum weight at least 6. Since @, has all 
weights divisible by 4 and €,C @; it follows that d(¢,)=8. Now apply the 
previous theorem to see that @, is the extended binary Golay code, and & the set 
of its weight 8 vectors. O 


Theorem. The code over F, spanned by the lines of the projective plane PG(2, 4) 
has dimension 10. 


Proof. Let abcde be a line in PG(2, 4). The set of ten lines consisting of all 5 lines 
on a, 3 more lines on b, and one more line on each of c, d, is linearly 
independent, so the dimension is at least 10. But the previous proof (or a simple 
direct argument showing that the extended code cannot be self-dual) shows that it 
is at most 10. O 


16.5. Substructures of S(5,8, 24) 


Theorem. Let B, be a fixed octad (block of S(5,8,24)). The 30 octads disjoint 
from By form a self-complementary 3-(16, 8,3) design, namely the design of the 
points and affine hyperplanes in AG(4, 2), the 4-dimensional affine space over F,. 


Proof. Let & be the collection of octads disjoint from B,. We have seen already 
that || = 30. 
(i) The linear span of @ is a code of dimension 5 and weight enumerator 
1 + 30x° + x'®. (Proof. Having zeros at the positions of B, gives 7 restrictions, so 
this span has codimension 7 in the extended binary Golay code ©.) 
(ii) Each block B € 8 is disjoint from a unique B’ € @ and meets all other 
blocks in precisely 4 points. (Proof. Obvious from (i).) 
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(iii) @ is a 3-(16, 8,3) design. (Proof. Each triple is covered 30-(§)/('£) =3 
times on average, but no triple is covered 4 times.) 

(iv) We have AG(4, 2). (Proof. Invoke your favorite characterization of 
AG(4, 2) or PG(3, 2), say Dembowski-Wagner or Veblen and Young. An explicit 
construction of the vector space is also easy: choose a point 0¢ B, and regard it 
as origin. If x, y are nonzero points then the three blocks B,, B,, B, on 0, x, y 
have a fourth point z in common (for B, is the complement of B, + B, (i.e., 
B,AB,) hence contains B, B,)-—now write x+y=z. If x, y, z are three 
arbitrary nonzero points and B,, B,, B, are the blocks containing 0, x, y then 
unless z = x + y precisely one of the B,, say B,, also contains z. Now in order to 
check that (x + y)+z=x+(y+z) we can do all computations within B, (using 
the induced 3-(8, 4, 1) design) — but clearly the 3-(8, 4, 1) design is unique (the 
extension of the Fano plane), i.e., is AG(3,2) and addition is associative. This 
defines the vector space structure, and the blocks are the hyperplanes on 0 and 
their complements.) O 


Theorem. Let T, be a fixed tetrad (4-set). Then T, determines a unique sextet, i.e., 
partition of the 24-set into 6 tetrads T, such that T, UT, is a block for all i, j (ij). 


Proof. Since 4,=5 there are 5 blocks B, on Ty, (=1,2,3,4,5) and with 
T;:= BT, we have T, UT, = B, + B, (0 #i#j #0). Since A, = 1 the 6 tetrads T; 
are pairwise disjoint. 


Theorem. Let B, be a fixed octad, x © By, y E By, Z the complement of By, U {y}. 
Then there is a natural 1-1 correspondence between the (2) = 35 3-sets in By\{x} 
and the (2° +1)(27+2+1)=35 lines in the PG(3,2) defined on Z. Triples 
meeting in a singleton correspond to intersecting lines. 


Proof. A line in the PG(3, 2) on Z is a T\{y} where T is a 4-set such that 3 of 
the blocks on it are disjoint from B,. Of the remaining two blocks on T, precisely 
one contains the point x, and if B is this one then BM By\{x} is the triple 
corresponding to the given line. O 


Theorem. Let D, be a fixed dodecad (support of a vector of weight 12 in €). The 
132 octads meeting D, in six points form the blocks of a Steiner system S(5, 6, 12) 
on Dy. 


Proof. Each 5-set in D, is in a unique block of S(5, 8, 24), and this block must 
meet D, in 6 points. O 


Theorem. Let D, be a fixed dodecad and x € D,. The 22 octads meeting Dy in six 
points and containing x form the blocks of a Hadamard 3-design 3-(12, 6, 2). 
There is a natural 1-1 correspondence between the 4 - 132 = 66 pairs of disjoint 
blocks of the S(5,6,12) on D, and the (2) = 66 pairs of points not in Dg. 
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Proof. Given a pair of points x, y outside Dj, there are precisely two octads on 
{x, y} meeting D, in six points, and these give disjoint blocks in the S(5, 6, 12) 
(for: if these octads are B, B' then B'= B+ D,). Varying y we find 11 pairs of 
disjoint blocks, blocks from different pairs having precisely 3 points in 
common. L) 


16.6. The Mathieu group M,, 


M,, is by definition the automorphism group of the extended binary Golay code @ 
(or, what is the same, of the Witt design S(5,8,24)), i.e., the group of 
permutations of the 24 coordinate positions preserving the code. For a beautiful 
discussion of this and related groups, see Conway (1971). 


Theorem. M,, has order 24-23-22-21-20-16-3 and acts 5-transitively on the 24 
coordinate positions. 


Proof. Let N be the adjacency matrix of the icosahedron. Since Aut N is 
transitive on the 12 points, and N is nonsingular, so that if (J J—N) generates @ 
then also (J—N’ 17) for some N’ equivalent to N, it follows that M,, is 
transitive. (This immediately implies uniqueness of the binary Golay code — see 
next section.) 

The representation as quadratic residue code gives an automorphism with cycle 
structure 1+ 23, so M,, is 2-transitive. This same representation also gives 
1+1+11+11. The representation below exhibits an automorphism with cycle 
structure 1°7° so that M,, is 3-transitive. 

Let % be the extended binary Hamming code (with word length 8, dimension 
4) consisting of the 8 rows of (°° ry (where F = circ(0110100) is the incidence 
matrix of the Fano plane PG(2, 2y) and their complements. Let #* be the 
equivalent code obtained by replacing F by F* = circ(0001011). Then #N X* = 
{0,1}. Let €={(a@+x,b+x,a+b+x)|a,bEKH,x€H*}. Then € has word 
length 24, dimension 12 and minimum distance 8 as one easily checks. Hence @ is 
the extended binary Golay code. 

From automorphisms with cycle structure 1°7° and 1*5° (the latter is easily seen 
in the icosahedral representation) we see that M,, is 4-transitive. 

Both # and #* have automorphism group PSL(2, 7) acting on the coordinates 
numbered ©, 0, 1, 2, 3, 4, 5, 6. Elements in PGL(2, 7)\PSL(2, 7) interchange 7 
and #*. Any automorphism of % of shape 4° (for definiteness, say x+>2 — 1/(x + 
2)) yields an automorphism of @ of shape 4°. 

From automorphisms of shape 1*5° and 4° we see that M,, is transitive on 
5-sets. The stabilizer of a 5-set contains permutations of shape 1°5° and 1°2° (the 
latter, ¢.g., by interchanging the first and second groups of 8 coordinates in the 
representation given above) inducing 5 and 1°2 on the 5-set, but since (ABCDE) 
and (AB) generate the symmetric group Sym(5) on 5 symbols, this shows that 
M,, is 5-transitive. 
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Since M,, is transitive on 5-sets, and a 5-set determines a unique octad, M,, is 
transitive on octads. 

Let us give yet another representation of € (due to Conway). Consider the set 
of 4x6 matrices with entries 0 or 1 satisfying the following two restraints: 

(i) The six columns sums and the first row sum have the same parity. 

(ii) If r,; denotes the ith row (1<i=<4) and F,=({0,1,,@} and ¥ is the 
linear code (with word length 6, dimension 3 and minimum distance 4) over F, 
generated by the rows of the matrix 


1001.0 @ 
010310. 
001141 «21 


then r, t wr, + or, EF. 

It is almost trivial to verify that these matrices form a linear code with word 
length 24, dimension 12 and minimum distance 8 over F,, i.e., we have the 
extended binary Golay code again. 

Three octads are given in fig. 16.3. 

Some easy automorphisms (with cycle structure): 

© interchange rows r,, r3, 7, cyclically [3°1°] 

© interchange the last two rows and the last two columns [2°1°] 

Wigaaes lst and 2nd, 3rd and 4th, 5th and 6th column, and rows r, and r, 

This representation shows that the pointwise stabilizer of a 5-set is transitive on 
the remaining 3 points of the octad containing it. Next observe that if a is a 
permutation fixing a certain octad pointwise and g fixes this octad setwise then 
aw*®:= g ‘ag fixes the octad pointwise. It follows that if O is an orbit of the 
pointwise stabilizer of this octad, then gO is also an orbit. Since we can find g 
with shapes 47+ 4* and 1°5+ 15° and @ with shape 1°3+ 13° we see that the 
pointwise stabilizer of an octad is transitive on the remaining 16 points. 

Let H be the subgroup of M,, fixing a block B (setwise) and a point x # B. By 
the above |H| > 24-23 -22-21-20-16-3/759- 16 =48!. The code words in € that 
are zero on the positions of B U {x} form a subcode with codimension 8 in ©, i.¢., 
with dimension 4. No nonidentity element of H can act trivially on this subcode 
(no two coordinate positions are dependent, e.g., because this is the code 
spanned by the complements of the hyperplanes in PG(3,2) so |H|< 
|PGL(4, 2)| = 15.14.12.8=18!, Since equality must hold we have shown that 
|M,,| =24+23-22-21-16-3 and that H= Alt(8) =PGL(4,2). O 


A 


Figure 16.3. 


726 A.E. Brouwer 


Theorem. M,, is transitive on trios (partitions of the point set into 3 octads), 
sextets and dodecads (vectors in 6 of weight 12). 


Proof. (i) PGL(4, 2) is transitive on the hyperplanes of PG(3, 2). 
(ii) Any tetrad determines the sextet containing it, and M,, is 4-transitive. 
(iii) Writing the dodecad as B + B’ where B and B' are octads with |B N B’| = 
2 we see that it suffices to show that the pointwise stabilizer of B is transitive on 
the 16 blocks B’ meeting B in a given pair. But this stabilizer is the elementary 
abelian group 2‘ and if some translation fixed B' then there would be an affine 
hyperplane meeting B’ in 6 points — impossible. O 


16.7. More uniqueness results 


Theorem. Let €, be a binary code containing 0 with word length 23, minimum 
distance 7 and |€,|22'°. Then ©, is the (perfect) binary Golay code. 


Proof. Add a parity check to ©, to obtain €. Thus ©, is obtained from @ by 
suppressing some coordinate position, but all positions are equivalent since M,, is 
transitive. O 


Theorem. There is a unique Steiner system S(4, 7, 23). 


Proof. The proof is very similar to that of the uniqueness of S(5, 8, 24). Let ©, be 
the code spanned by the blocks and add a parity bit to obtain a self-orthogonal 
code © of word length 24. As before one identifies @ as the extended binary 
Golay code, then €, as the (perfect) binary Golay code, then the blocks of 
S(4, 7, 23) as the words of weight 7 in this code. O 


Theorem. There is a unique Steiner system S(3, 6, 22). 


Proof. Inspired by Lander (1983, esp. pp. 54 and 71), we first construct 9 as the 
extended linear code over F, spanned by the lines of PG(2, 4). Then 2 has word 
length 22, and we have seen already that dim = 10. @ is self-orthogonal and 
hence there are three codes Q, of dimension 11 such that 9C9,CB* (i= 
1, 2,3). But @ can be identified with the subcode of @ defined by u, =u, =u, 
and the three codes 9, are found as subcodes defined by u,=u,, u, =u, and 
u, = uy, respectively. (More precisely, our codes are obtained from the subcodes 
of © just mentioned by dropping the first three coordinate positions and adding a 
parity bit; note that 1 © Y.) Now 3-transitivity of M,, tells us that the three codes 
2, are equivalent; each has 77 words of weight 6. Given any Steiner system 
5(3, 6, 22), its blocks must span one of the codes 9,, and the blocks of the Steiner 
system are recovered as the supports of the code words of weight 6 in this 
code. O 


Theorem. (a) Let €“ be a binary code containing 0 with word length 24 —i, 
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minimum distance 8, and size at least 2'?"'. If 0<i<3 then 6 is the i times 
shortened extended binary Golay code. 

(b) Let € o be a binary code containing 0 with word length 23 — i, minimum 
distance 7, and size at least 2‘ If 0<i< 3 then €® is the i times shortened 
binary Golay code. 

The weight enumerators are (for i>0) given by 


i n dim 

1 23 11 1+ 506x® + 1288x'? + 253x'° 

2 22 10 1+ 330x° + 616x'* + 77x’® 

3 21 9 1+ 210x* + 280x? + 21x"° 

1 22 11 1+ 176x’ + 330x° + 672x"! + 616x'? + 176x!5 + 77x16 
2 21 10 1+ 120x’ + 210x° + 336x'! + 280x!? + S6x1° + 21x'¢ 
3 20 9 1+ 80x’ + 130x® + 160x!! + 120x!? + 16x!* + 5x! 


Adding a parity check bit to 6 we find €, and for i>O the latter is the even 
weight subcode of eo, 

(c) Let €g, be a binary self dual code with word length 22 and minimum distance 
6. Then Gg. is the once truncated binary Golay code. 


Proof (Sketch). Part (b) follows from part (a) since each of these codes has a 
group that acts transitively on the coordinate positions. For part (a), apply 
Delsarte’s linear programming bound (enhanced by addition of a few obvious 
inequalities) to obtain uniqueness of all weight enumerators given. (Cf. Best et al. 
1978.) The case i=0 has been treated earlier. 6° U(€@" +1) has minimum 
distance 7 hence is the binary Golay code. This settles the case i=1. €U 
(€ + 1) is self-orthogonal with minimum distance 6 and word length 22 with 2" 
words hence is linear. But according to Pless and Sloane (1975) the unique such 
code is the once truncated binary Golay code. This settles the case i = 2 and part 
(c). If we extend €° U(¢°’ +1) with a parity check bit we obtain a self- 
orthogonal code 2 with minimum distance 6 and word length 22; & is contained 
in a self-dual code with d = 6 and n = 22, necessarily €)). Removing the parity bit 
again we find that ¢“ is contained in the twice truncated binary Golay code 
which has weight enumerator (1 + x7!) + 21(x° + x") + 56(x° + .x'°) + 120(x’ + 
x") + 210(x* + x7?) + 280(x° + x'7) + 336(x1° + x1!). Clearly €© must consist of 
all vectors in this code with a weight divisible by 4, and hence is uniquely 
‘determined. 0 


(Dodunekov and Encheva 1988 show that linear codes €(4) and (5) are 
unique, and that there are precisely two linear codes ¢(6). The weight 
enumerators are 


x 
a 
3 


4 20 8 1+ 130x° + 120x!? + 5x?° 
5 19 7 1+ 78x° + 48x'? + x16 
6 18 6 14+ 46x° + 16x"? + x’ or 1 + 45x° 418%? ~~.) 
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Starting from S(5, 8, 24) and taking successive derived or residual designs we 
find designs with the following parameters: 


5-(24.8,1) 


3-(22,6,1) 3-(22,7,4) 3-(22,8,12) 
2-(21,5,1) 2-(21,6,4) 2-(21.7.12) 2-(21,8,28) 


Up to now we have seen uniqueness of the three largest Steiner systems (and 
used the uniqueness of S(2, 5,21) = PG(2,4)- an easy exercise). Such strong 
results are not available for the remaining 6 designs. (In fact, observe that a 
2-(21, 7,3) design exists —e.g., the residual of an SBIBD 2-(31, 10,3). Taking 4 
copies of such a design, independently permuting the point sets in each case, 
produces large numbers of nonisomorphic designs with parameters 2-(21, 7, 12), 
so this structure is certainly not determined by its parameters alone.) 

Tonchev (1986c) shows that there are unique quasi-symmetric designs 2- 
(22, 7, 16), 2-(21, 7,12) and 2-(21, 6, 4). (A design is called quasi-symmetric if it 
has only two distinct block intersection numbers; here 0, 2 in case of 2-(21, 6, 4) 
and 1, 3 for the other two designs. Note that 3-(22, 7,4) has A, = 16.) 

Tonchev (1986a) showed that there are unique designs 2-(23, 8,56), 2- 
(22, 8, 40), 2-(21, 8, 28) with intersection numbers 0, 2, 4. (Note that 4-(23, 8, 4) 
has A, = 56 and that 3-(22, 8, 12) has A, = 40. Tonchev is not quite explicit about 
the middle case — he assumes 3-(22, 8, 12) — but his methods also work when only 
2-(22, 8, 40) is given.) 

The proofs are always by generating a self-orthogonal code and using the 
classification of binary self-dual codes with n = 22. 

More information is contained in the following. Let @ be a collection of 
k-subsets of an n-set such that any two k-subsets have distance at least 8. Then for 
each of the cases listed below we have |9| <b, and when equality holds then the 
system is known to be unique, except in five cases. For (n, k, b) = (19, 5, 12) there 
are precisely two nonisomorphic systems, corresponding to the two Latin squares 
of order 4. For (n, k, b) = (18,5,9) there are precisely three nonisomorphic 
systems. For the three cases (n, k, b) = (19, 6, 28), (20,7, 80), (21,8, 210) no 
information is available. In all cases other than these three the block intersection 
numbers are as shown in table 16.1. 


Table 16.1 

k n intersections 
18 19 20 21 22 23 24 

5 9 12 16 21 1 

6 28 40 56 77 0,2 

7 80 120 176 253 1,3 

8 210 330 506 759 0,2,4 
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16.8. Related combinatorial structures 


Many important combinatorial or group-theoretical structures are closely linked 
to the Witt designs or Golay codes. We find among other things two sporadic 
SBIBDs. 

First of all, we have the Higman—Sims graph, the unique strongly regular graph 
with parameters (v, k, A, #) = (100, 22, 0, 6). It is constructed by taking a symbol 
x, the 22 points and 77 blocks of S(3,6,22) as the 100 vertices of the graph, 
joining ~ to the 22 points, cach point to the blocks containing it, and two blocks 
when they are disjoint. The automorphism group of this graph is HS - 2, where HS 
denotes the simple Higman—Sims group. In this graph, the collection of points at 
distance 2 from a given point induces the unique strongly regular graph with 
parameters (v,k, A, #) = (77, 16, 0, 4), and in this latter graph again the points at 
distance 2 from a given point carry the unique strongly regular graph with 
parameters (v, k, A, 4) = (56, 10, 0, 2). If we take the adjacency matrix of this last 
graph, and add in the diagonal, we obtain the incidence matrix of a biplane 
(SBIBD with A = 2) with parameters 2-(56, 11, 2). 

(Of course, more generally one obtains SBIBDs from strongly regular graphs 
with A= or A= yp — 2. In fact one obtains precisely those SBIBDs that have a 
polarity with all points absolute, or without absolute points.) 

Next, we have Higman’s design. Let x and y be two points in the point set of 
S(5, 8, 24). Construct a square design with as points the blocks containing x but 
not y and as blocks the blocks containing y but not x. A point B is incident with a 
block C when |BMC|€ {0,4}. This produces an SBIBD with parameters 2- 
(176, 50, 14) and with HS as (2-transitive) automorphism group Higman (1969), 
Smith (1976). 

Inside the Higman-—Sims graph we can find the Hoffman-Singleton graph, and 
using S(4, 7,23) we can build the McLaughlin graph and the unique regular 
two-graph on 276 points, see Brouwer and Van Lint (1984). These structures have 
automorphism groups PU,(5), McL:2 and Co-3, and conversely often provide 
the most convenient way to study these groups, just as one cannot study the 
Mathieu groups without knowledge of the Witt systems. Using S(5, 8, 24) we can 
build the Leech lattice, see Leech (1967) and Conway and Sloane (1988). Finally, 
the Leech lattice is the first ingredient for building the Monster group, FG. 


_ Appendix. The known SBIBDs 


In the following, q will denote a prime power; n:=k—A. 

(1) Hyperplanes in PG(d, q). v =(q**' — 1)/(q -1), k = (97 — 1)/(q—-1), A= 
(q7"' —1)/(q-1), n=q7"!. (Many other designs also have these parameters. 
Constructions with difference sets have been given by Singer 1938 and Gordon et 
al. 1962.) 

(2) Hadamard designs. v = 4n -— 1, k=2n—1, A=n-—1. These exist iff there 
exists a Hadamard matrix of order 4n. A Hadamard matrix is a matrix with +1 
entries whose rows are mutually orthogonal. It is a long-standing conjecture that 
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Hadamard matrices of order 4n exist for all m. They are known to exist for many 
orders, e.g., for all n < 166, n # 107, and for 4n = 2'm with m’ <2’, see Wallis et 
al. (1972), Wallis (1976), Geramita and Seberry (1979), Sawade (1985) and 
Seberry and Yamada (1992). Constructions with difference sets have been given 
when v is a prime power by Paley (1933) and Todd (1933) (take the set of 
quadratic residues as a difference set), and by Hall (1956) (using cubic and sextic 
residues); and in case v = q(q + 2) where both q and q + 2 are prime powers by 
Stanton and Sprott (1958). Many known construction techniques are discussed in 
the survey Hedayat and Wallis (1978). 

(3) Regular Hadamard matrices with constant diagonal, v = 4¢°, k=2r-+, 
A=f—t, n=, exist probably for all t; existence is known when or is the order 
ofa Hadamard matrix, if both 2 — 1 and 2¢ + 1 are prime powers, and for t =7; if 
existence is known for ¢, and ¢, then also for ¢ = 2¢,t,. (Cf. Menon 1962, Goethals 
and Seidel 1970, Wallis 1971, Wallis et al. 1972, section 5.3, Brouwer and Van Lint 
1984, section 8D.) 

(4) The adjacency matrix of the strongly regular graph with as vertices the 
elliptic or hyperbolic points off a quadric in PG(2m,3) where two points are 
adjacent when they are joined by an elliptic line is the incidence matrix of a 
symmetric design with parameters v = 3"(3" + €)/2, K=3" 1(3"—€)/2, A= 
37-13" — 6)/2, n=3°"") (€ = +1, m=2). 

(5) (Wallis 1971, McFarland 1973). v= eg —1)/(q-1)+1), k= 
q'(q""! — 1)(q-1), A= 44(q2 - I)K(q-1), n=4 

oO (Brouwer 1983). v = 2q(q° —1)(q-1)+1,k=q*,A=14q7 (q-1), n= 

(q+1) (q an odd prime power, d= 1). These designe are for d=2 
24 valent to equidistant binary codes with word length v and distance (uv — 1)/2, 
Stinson and Van Rees (1984), Van Lint (1984). Denniston (1982) has determined 
all 78 such designs with (d, q) = (2, 3). 

Ad) (Shrikhande and Singhi 1975, Van Leyenhorst 1980). v=A°+A4+1, k= 
A? +1, n=A?—A+1 whenever both A-1 and ? ~A+1 are prime powers. 

(8) (Mitchell 1979) v=q"*'-q+1, k=q", A=q""', n=q""'(q—-1) (h=2, 
q > 2) whenever there exists an affine plane AG(2, q — 1). 

(9) Other designs with a difference set listed in Baumert (1971). (See also Hall 
1974.) Examples are: . 

Biquadratic residues (Lehmer 1953, Hall 1956). v = 47° +1,k=2?,A=(t?-1)/ 
4, n=(3t? +1)/4 (¢ odd, v a prime power). Difference sets with these same 
parameters have been constructed by Whiteman 1962 for ¢ odd, v=pq where 
pown p and q are odd primes, p =1 (mod 4) and g =3p +2. 

Biquadratic residues with 0 (Lehmer 1953, Hall 1956). v = 4°49, k=2 +3, 
A=(t' + 3)/4, n =3(t? + 3)/4 (¢ odd, v a prime power). 

Octic residues, v= 8a? + 1 = 64b? + 9 with v prime and a, b both odd, k =a? 
A=)’, n=a’°-B’. 

Octic residues with 0, v = 8a” + 49 = 64b7 + 441 with v prime and a odd, b 
even, K=a° +6, A=b°+7,n=a'-b’~—1. 

Many other cyclotomic constructions exist; e.g., for 16th powers see Lehmer 
(1954), Whiteman (1957). v = 133, k =33, A=8, n=25. 
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(10) Spence et al. (1993) constructed an SBIBD with v = 160, k = 54, A= 18, 
n=36. Spence (1993) generalized this to v=(q+1)(q"—1)/(q-1), 
k=q" \(q+1)/2, A=q" "(q’ ~ 1/4, n= q""-7(q + 1)°/4 when q is of the form 

P—i, 
: Jungnickel and Pott further generalized this, and found v =p‘(q?”—1)/ 
(q-1), k=q’""'p'', A=p’'q’" *(p*'— 1)/(p-1) where p is prime and 
q =(p’ — 1)/(p — 1) is a prime power. 

(11) Some special constructions. 

v = 56,k =11,A=2,n=9 (Hall et al. 1970) , 

v=79,k=13,A=2,n=11 (Aschbacher 1971) , 

v=49,k=16,A=5,n=11 (Brouwer and Wilbrink 1984) , 

v=71,k=15,A=3,n=12 (Haemers 1980, Beker and Haemers 1980) , 

v=71,k=21,A=6,n=15 (Janko and Tran van Trung 1984) , 

v = 66,k =26,A=10,n=16 (Tran van Trung (1982), Bridges (1983), 

v= 70,k =24,A=8,n=16 (Janko and Tran van Trung 1984), 

v =78,k =22,rA=6,n=16 (Janko and Tran van Trung 1985, 
Tonchev 1987) 

v= 176,k =50,A=14,n=36 (Higman 1969). 
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1. Introduction 


Association schemes are perhaps the most important unifying concept in algebraic 
combinatorics. They provide a common view point for the treatment of problems 
in several fields, such as coding theory, design theory, algebraic graph theory and 
finite group theory. 

Roughly, an association scheme is a very regular partition of the edge-set of a 
complete graph. Each of the partition classes defines a graph, and the adjacency 
matrices of these graphs (together with the identity matrix) form the basis of an 
algebra (known as the Bose—Mesner algebra of the scheme). But since this basis 
consists of 0-1 matrices, we see that the algebra is not only closed under matrix 
multiplication, but also under componentwise (Hadamard, Schur) multiplication. 
The interplay between these two algebra structures on the Bose—Mesner algebra 
yields strong information on the structure and parameters of an association 
scheme. 

The relation of the theory of association schemes to the various fields varies. 
Finite group theory mainly serves as a source of examples— any generously 
transitive permutation representation of a group yields an association scheme — 
but also as a source of inspiration — many group-theoretic concepts and results 
can be mimicked in the (more general) setting of association schemes. On the 
other hand, in combinatorics association schemes form a tool. Using inequalities 
for the parameters of an association scheme, one finds upper bounds for the size 
of error-correcting codes, and lower bounds for the number of blocks of a 
t-design. Many uniqueness proofs for combinatorial structures depend crucially on 
the additional information one gets in case such inequalities hold with equality. 

The graphs of the partition classes of an association scheme are very special. 
For instance distance-regular graphs and Kneser graphs are of this type. For these 
graphs the spectrum of the adjacency matrix can be computed from the 
parameters of the association scheme. Thus results about the eigenvalues of 
graphs (cf. chapter 31) can be applied. 

An association scheme with just two classes is the same as a pair of 
complementary strongly regular graphs. Because strongly regular graphs are 
themselves important combinatorial objects and because their treatment forms a 
good introduction to the more general theory, we shall start with a section on 
strongly regular graphs. In the subsequent section we treat the elements of the 
general theory of association schemes: the adjacency matrices, the Bose-Mesner 
algebra, the eigenvalues, the Krein parameters, the absolute bound and Del- 
sarte’s inequality on subsets of an association scheme. The last section is devoted 
to some special association schemes and the significance to other fields of 
combinatorics, as indicated above. 

Association schemes were introduced by Bose and Shimamoto (1952) and Bose 
and Mesner (1959) defined the algebra that bears their name. Delsarte (1973) 
found the linear programming bound, studied P- and Q-polynomial schemes, and 
applied the resulting theory to codes and designs. Higman (1974) introduced the 
more general concept of coherent configuration to study permutation representa- 
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tions of finite groups. There is much literature on association schemes and related 
topics. Bannai and Ito (1984) is the first text book on association schemes. On the 
more special topic of distance-regular graphs a lot of material can be found in 
Biggs (1974) and in Brouwer et al. (1989). Some introductory papers on 
association schemes are Delsarte (1974), Goethals (1979), chapter 21 of MacWil- 
liams and Sloane (1977), Haemers (1979), chapter 17 of Cameron and Van Lint 
(1980) and Seidel (1983). 


2. Strongly regular graphs 


A simple graph of order vu is strongly regular with parameters v, k, A, w whenever 
it is not complete or edgeless and 
(i) each vertex is adjacent to k vertices, 

(ii) for each pair of adjacent vertices there are A vertices adjacent to both, 

(iii) for each pair of non-adjacent vertices there are y vertices adjacent to 
both. 

For example, the pentagon is strongly regular with parameters (uv, k, A, w) = 
(5, 2,0, 1). One easily verifies that a graph G is strongly regular with parameters 
(v,k, A, #) if and only if its complement G is strongly regular with parameters 
(v,u-—k-1,v—2k +p -—2,u—2k+ A). The line graph of the complete graph of 
order m, known as the triangular graph T(m), is strongly regular with parameters 

4m(m —1),2(m—2),m-—2,4). The complement of 7(5) has parameters 
(10, 3, 0, 1). This is the Petersen graph. 

A graph G satisfying condition (i) is called k-regular. It is well known and 
easily seen that the adjacency matrix of a k-regular graph has an eigenvalue k 
with eigenvector 1 (the all-one vector), and that every other eigenvalue p satisfies 
|p| <k (see chapter 31 or Biggs 1974). For convenience we call an eigenvalue 
restricted if it has an eigenvector perpendicular to 1. We let / and J denote the 
identity and all-one matrices, respectively. 


Theorem 2.1. For a simple graph G of order v, not complete or empty, with 
adjacency matrix A, the following are equivalent: 
(i) G is strongly regular with parameters (v, k, 4, «) for certain integers k, 2, w, 
(ii) A? =(A~ p)At(k—p)J + wd for certain reals k, 2, p, 
(iii) A has precisely two distinct restricted eigenvalues. 


Proof. The equation in (ii) can be rewritten as 
A =kI+AAt+p(J—-1-A). 
Now (i) (ii) is obvious. 


(ii) > (ili): Let p be a restricted eigenvalue, and u a corresponding eigenvector 
perpendicular to 1. Then Ju = 0. Multiplying the equation in (ii) on the right by uw 
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yields p?=(A-)p +(k—). This quadratic equation in p has two distinct 

solutions. (Indeed, (A — »)’ = 4(2 — k) is impossible since » <k and A<k~1.) 
(iii) > (ii): Let r and s be the restricted eigenvalues. Then (A — r1)(A — sI) = 

aJ for some real number a. So A? is a linear combination of A,lTandJ. O 


As an application, we show that quasi-symmetric block designs give rise to 
strongly regular graphs. A quasisymmetric design is a 2-(v, k, 4) design such that 
any two blocks meet in either x or y points, for certain fixed x, y. (Cf. chapter 
14.) Given this situation, we may define a graph G on the set of blocks, and call 
two blocks adjacent when they meet in x points. Then there exist coefficients 
a,,...,a@, such that NN’ =a,1 + a,J, NJ =a,J, IN=a,J, A= a,N™N +ait+ 
a,J, where A is the adjacency matrix of the graph G. (The a; can be readily 
expressed in terms of uv, k, A, x, y.) Then G is strongly regular by (ii) of the 
previous theorem. (Indeed, from the equations just given it follows straight- 
forwardly that A’ can be expressed as a linear combination of A, I and J)A 
large class of quasi-symmetric block designs is provided by the 2-(v, k, A) designs 
with A = 1 (also known as Steiner systems S(2, k, v)) - such designs have only two 
intersection numbers since no two blocks can meet in more than one point. This 
leads to a substantial family of strongly regular graphs, including the triangular 
graphs T(m) (derived from the trivial design consisting of all pairs out of an 
m-set). 

Another connection between strongly regular graphs and designs is found as 
follows: Let A be the adjacency matrix of a strongly regular graph with 
parameters (v,k,A, A) (i.e., with A=; such a graph is sometimes called a 
(v,k, A) graph). Then by Theorem 2.1 (ii) 


AA‘=A°=(k-A)I + AS, 


which reflects that A is the incidence matrix of a square (‘“‘symmetric’’) 2-(v, k, A) 
design. (And in this way one obtains precisely all square 2-designs possessing a 
polarity without absolute points.) For instance, the triangular graph 7(6) provides 
a Square 2-(15, 8, 4) design, the complementary design of the design of points and 
planes in the projective space PG(3, 2). Similarly, if A is the adjacency matrix of 
a strongly regular graph with parameters (v,k,A,A+2), then A+J is the 
incidence matrix of a square 2-(v,k, A+ 2) design (and in this way one obtains 
precisely all square 2-designs possessing a polarity with all points absolute). 


Theorem 2.2. Let G be a strongly regular graph and adjacency matrix A and 
parameters (vu, k, A, w). Let rand s (r >s) be the restricted eigenvalues of A and let 
f, g be their respective multiplicities. Then 

(i) K(k —-1-a)=p(v-k~-1), 

(ii) rs=p—k,r+s=A— Bp, 

Gil) f, g=4@—-14 (r +5) —- 1) 4 2k)/(r-5)), 

(iv) r and s are integers, except perhaps when f= g, (v,k, A, ») = (4t+ 1, 2¢, 
t—1,t) for some integer t. 
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Proof. (i) Fix a vertex x of G. Let F(x) and A(z) be the sets of vertices adjacent 
and non-adjacent to x, respectively. Counting in two ways the number of edges 
between I(x) and A(x) yields (i). The equations (ii) are direct consequences of 
Theorem 2.1 (ii), as we saw in the proof. Formula (iii) follows from f + g=»—-1 
and O=trA=kKt+frt+gs=k+i(rt+s\(f+e)t+i@—s)(f—g). Finally, when 
f *¥g then one can solve for r and s in (iii) (using (ii)) and find that r and s are 
rational, and hence integral. But f=g implies (4 —A)(v —1)=2k, which is 
possible only for 2 —A=1,»=2k +1. O 


These relations imply restrictions for the possible values of the parameters. 
Clearly, the righthand sides of (iii) must be positive integers. These are the 
so-called rationality conditions. As an example of the application of the rationality 
conditions we can derive the following result due to Hoffman and Singleton 
(1960). 


Theorem 2.3. Suppose (v,k,0,1) is the parameter set of a strongly regular graph. 
Then (v, k) = (5, 2), (10, 3), (50, 7) or (3250, 57). 


Proof. The rationality conditions imply that either f = g, which leads to (»,k) = 
(5,2), or r—s is an integer dividing (r + s)(v — 1) + 2k. By use of Theorem 2.2 
(i)-(ii) we have 


s=—r-1, kerertl,  vart4+2r° 43"? 4+2r42, 


and thus we obtainr=1,20r7. O 


The first three possibilities are uniquely realized by the pentagon, the Petersen 
graph and the Hoffman-Singleton graph. For the last case existence is unknown 
(but see Aschbacher 1971). 


2.1. Conference matrices 


Some important classes of strongly regular graphs are related to so-called 
conference matrices (the name stems from their application to networks for 
conference telephony). Since these matrices are not treated elsewhere in this 
Handbook, we shall elaborate a bit on them here. (See also Wallis et al. 1972). 

A conference matrix of order n is an m X nm matrix C with zero diagonal and 1 or 
—1 elsewhere, such that CC’ = (n — 1)/. Two conference matrices are equivalent 
if one can be obtained from the other by a permutation of rows and columns and 
multiplication of some rows and columns by —1. Delsarte et al. (1971) proved 
that every conference matrix C of order n is equivalent to one of the form 


(° r 

1 S/’ 

where S is skew-symmetric (i.e., S = —S") in case n = 2 or n=0 (mod 4), and S is 
symmetric in case n =2 (mod 4). We call S a core of C. If S is symmetric, then 
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4(J —S—1) is the adjacency matrix of a strongly regular graph with parameters 
(n—1, $(n — 2), 4(n — 6), 4(@ — 2)). We call these graphs conference graphs; they 
are characterized by f=g, so they are the only cases in which non-integral 
eigenvalues can occur. Conversely, one can reconstruct a conference matrix from 
a conference graph. Belevitch (1950) (see also Van Lint and Seidel 1966) proved 
that for a symmetric conference matrix n — 1 must be the sum of two squares. 
Thus we have a necessary condition for the existence of conference graphs, called 
the conference matrix condition. 

If a conference matrix C is symmetric and has constant column sums, then 
4(J-C-—1) is the adjacency matrix of a strongly regular graph with parameters 
(n = 4r? + 4r +2, 2r? + 2,r’—1,r’), or the complement. (Here n —1 is a square, 
so the conference matrix condition is automatically fulfilled.) For many values of 
n conference matrices are known; see for instance Goethals and Seidel (1967) and 
Mathon (1978). Paley (1933) defined a matrix § whose rows and columns are 
indexed by the elements of a finite field F, (q odd), by 


Sy =xX@-y), 


where y is the quadratic residue character (i.e., y(a) = 0, 1 or —1 according to a 
being zero, a non-zero square, or a non-square respectively). It follows that S is 
the core of a conference matrix (S is symmetric if g=1 (mod 4) and skew- 
symmetric otherwise). If g is a square, then C is equivalent to a conference matrix 
with constant row sum. Conference matrices are related to Hadamard matrices 
(see the appendix of chapter 14): if C is skew-symmetric, then C+/ is a 
Hadamard matrix, and if C is symmetric, then 


Coir c21) 
is one. 


2.2. Further restrictions 


Except for the rationality conditions and the conference matrix condition, a few 
other restrictions on the parameters are known. We mention two of them. The 
Krein conditions, due to Scott (1973), can be stated as follows: 


(r+ 1)\(k +r + 2rs)<(k +r)(s+1)’, 
(s+ 1)(k+s+2rs)<(k+s)(r+1)’. 


Seidel’s absolute bound (see Delsarte et al. 1975, Koornwinder 1976, Seidel 1979) 
reads 


vi f(f +3), 
v<3e(g+3). 


The Krein conditions and the absolute bound are special cases of general 
inequalities for association schemes — we will meet them again in the next section; 
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the conference matrix condition is the analogue of the Bruck-Chowla—Ryser 
theorem for square 2-designs. In Brouwer and Van Lint (1984) one may find a list 
of all known restrictions; this paper gives a survey of the recent results on strongly 
regular graphs. It is a sequel to Hubaut’s (1975) earlier survey of constructions. 
Seidel (1979) gives a good treatment of the theory. Some other references are 
Bose (1963), Cameron (1978) and Cameron and Van Lint (1980). 


3. Association schemes 


An association scheme with d classes is a finite set X together with d + 1 relations 
R, on X such that 
(i) {Ry, R,,-.-,Rg} is a partition of X x X; 

(ii) Ry = ((x, x) |x © X}; 

(iii) if C y) €R,, then also (y, ER, for all x, yEX andie€ {0,...,d}; 

(iv) for any (x, y) ER, the number Pi of zEX with (x, z) ER, and (2, y) ER, 
depends only on i, jand k. 

The numbers Py are called the intersection numbers of the association scheme. 
The above definition is the original definition of Bose and Shimamoto (1952); it is 
what Delsarte (1973) calls a symmetric association scheme. In Delsarte’s more 
general definition, (iii) is replaced by: 

(iii’) for each i € {0,...,d} there exists aj € {0,...,d} such that (x, y)ER,; 
implies (y, x) E R;, 

(iii”) P= Py for all i, j, KE {0,..., d}. 

Higman (1974, 1975, 1976) studied the even more general concept of a coherent 
configuration. He requires (i), (iii’), (iv) and 

(ii) {(x, x)|x © X} is a union of some R,. 

If (ii) holds, then the coherent configuration is called homogeneous. We shall 
not treat coherent configurations here, but content ourselves with the remark that 
a coherent configuration with at most five classes must be an association scheme 
(in the sense of Delsarte), see Higman (1975). 

Define n:=|X|, and n,:= p°. Clearly, for each i€ {l,. ..,@}, (X,R,) is a 
simple graph which is regular of degree n,. 


Theorem 3.1. The intersection numbers of an association scheme satisfy 
kook 
(i) Po = Bis P= 6,7;, Pi; = Pjis 
(ii) u, Dy= bs Dy n,n, 
(iii) pin, = Dials 
(iv) Dy py Per = Lt Pg Pr 
Proof. (i)—(iii) are straightforward. The expressions at both sides of (iv) count 


quadruples (w, x, y,z) with (w,x)ER,, (x, y)ER;, (y,z)ER,, for a fixed pair 
(w,zJER,. O 
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It is convenient to write the intersection numbers as entries of the so-called 
intersection matrices Ly,..., Lg: 


(Lia = Pi : 


Note that LZ, =J. From the definition it is clear that an association scheme with 
two classes is the same as a pair of complementary strongly regular graphs. (With 
Delsarte’s more general definition one also gets the skew-symmetric conference 
matrices.) If (X,R,) is strongly regular with parameters (v,k, A, 2), then the 
intersection matrices of the scheme are 


Ok 0 0 0 v-k-1 
L,=(1 A k-A-1], L,=|0 k-a-1 v—2k+2 . 
O nw k-p 1 k-p y-2k+p-—2 


We see that (iii) generalizes Theorem 2.2(i). 


3.1. The Bose—Mesner algebra 


The relations R; of an association scheme are described by their adjacency 
matrices A; of order n defined by 


(A), ={ 


In other words, A, is the adjacency matrix of the graph (X, R,). In terms of the 
adjacency matrices, the axioms (i)—(iv) become 
(@) Dy A= J, 

(iii) A; =A;, for all i€ {0,..., d}, 

(iv) A,A;= XL, piyA,, for all i, j, KE {0,..., d}. 

From (i) we see that the 0-1 matrices A, are linearly independent, and by use 
of (ii)-(iv) we see that they generate a commutative (d + 1)-dimensional algebra 
of symmetric matrices with constant diagonal. This algebra was first studied by 
Bose and Mesner (1959) and is called the Bose—Mesner algebra of the association 
scheme. 

Since the matrices A; commute, they can be diagonalized simultaneously (see 
Marcus and Minc 1965), that is, there exists a matrix S such that for each AE &, 
S~'AS is a diagonal matrix. Therefore sf is semi-simple and has a unique basis of 
minimal idempotents Ey,...,£, (see Burrow 1965). These are matrices satisfy- 
ing 


1 whenever (x, y)ER; , 
0 otherwise . 


The matrix (1/n)J is a minimal idempotent (idempotent is clear, and minimal 
follows since rk J=1). We shall take Ey=(1/n)J. Let P and (1/n)Q be the 
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matrices relating our two bases for sw: 


A, = PyEis 
i=0 
1 d 
E= 7, 2 OA, 
Then clearly 
PQ=QP=nl. 
It also follows that 
A, E, = PE, 


ii 


which shows that the P,, are the eigenvalues of A, and the columns of E; are the 
corresponding eigenvectors. Thus yp, ‘= rk E, is the multiplicity of the eigenvalue 
, of A; (provided that P,, # P,, for k #i). We see that My =1, Loj20 Hy =”, and 
B= tr E. =n(E;);, (indeed, E, has only eigenvalues 0 and 1, so rk E, equals the 
sum of the eigenvalues). 


Theorem 3.2. The numbers P, and Q ,, satisfy 
(i) ee Qio=1, Po =1;, Qo; = Mis 
i 
(ii) P ij Pix =Y. =0 Pi P if) 
(iii) BP iy =n, Qui yi: uP, P, ik 7 nnd ik? y, n; QQ = ny,6 ik? 
(iv) | P,| <n,, |Q | <u,- 


Proof. Part (i) follows easily from 0, E,=!= Ay, 4, A;=J=nEo, AJ =nJ, 
and tr E, = y,. 

Part (ii) follows from A|A, =X, DuAp 

The first equality in " (iii) follows from yin {0iP, = nO, =tr AA, = 

;#;P,,P.,, since P is non-singular; this also proves the second equality, and the 
last one follows since PQ = nl. 

The first inequality of (iv) holds because the P,, are cigenvalues of the 
n,-regular graphs (X, R,). The second inequality then follows by use of (iii). O 


Relations (iii) are often referred to as the orthogonality relations, since they 
State that the rows (and columns) of ? (and Q) are orthogonal with respect to a 
suitable weight function. 

If d=2, and (X,R,) is strongly regular with parameters (v,k, A, «), the 
matrices P and Q are 


1k v-k-1 1 f g 
r s 
p=|! 7 -r-l O= 1 PE 8] , 
r+1 stl 
1s -s-1 1 -f>-K-1 ~8o-k-1 
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where r, 5, f and g can be expressed in terms of v, k, A by use of Theorem 2.2. 
In general the matrices P and Q can be computed from the intersection 
numbers of the scheme, as follows from the following. 


Theorem 3.3. For i=0,...,d, the intersection matrix L has eigenvalues P, 
(O<i<d). 


Proof. Theorem 3.2(ii) yields 
P, 2 Pu (Po "Yam = > Pi(Lj)ig(P Yam ’ 
hence PL,P~' = diag(Py,,..-,P,;)- O 


Thanks to this theorem, it is relatively easy to compute P, Q (=(1/n)P~') and 
HM; (=Qo,). It is also possible to express P and Q in terms of the (common) 
eigenvectors of the L,. Indeed, PL,P = diag(P,,,..., P4;) implies that the rows 
of P are left eigenvectors and the columns of Q are right eigenvectors. In 
particular, 4, can be computed from the right eigenvector u; and the left 
eigenvector v,, normalized such that (u,)y=(v,))=1, by use of p,u;v, =n. 
Clearly, each yz, must be an integer. These are the rationality conditions for an 
association scheme. As we saw in the case of a strongly regular graph, these 
conditions can be very powerful. Godsil (see chapter 31) puts the rationality 
conditions in a more general form, which is not restricted to association schemes. 


3.2. The Krein parameters 


The Bose-Mesner algebra x is not only closed under ordinary matrix multiplica- 
tion, but also under componentwise (Hadamard, Schur) multiplication (denoted 
°). Clearly {A,,...,A,} is the basis of minimal idempotents with respect to this 
multiplication. 

Write 


The numbers qi thus defined are called the Krein parameters. (Our qi; are those 
of Delsarte, but differ from those of Seidel (1979) by a factor n.) As expected, we 
now have the analogue of Theorems 3.1 and 3.2. 


Theorem 3.4. The Krein parameters of an association scheme satisfy 
(i) qo; = Sin» qi — 5;;h;» qi; = Gyr 
Gi) Bay = ay Beam, 
(ili) Gish = Dig bys 
iv) 3 gi@7= i qn 
(iv) 1Fy Tk: y, Fei Tit > 
(v) 2 Qi = vi<0 Gin Qits 
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(vi) ni, |; = uy 1,0,,2,Q,- 


Proof. Let (A) denote the sum of all entries of the matrix A. Then JAJ = 
¥ (A)J, 0 (AeB)=tr AB" and ¥ (E,) = 0 if i#0; since then E,J = nE,E, =0. 
Now (i) follows by use of £,°E, = (1/n)E,, qi, = b (E,°E;) = tr E,E, = = 6,4, and 
E,° E, = E,° E,, respectively. Equation (iv) follows by evaluating E,°E,°E, in two 
ways, and (iii) follows from (iv) by taking m=0. Equation (v) follows from 
evaluating A,;°E,°E, in two bie and (vi) follows from (v), using the ortho- 
gonality relation ‘Yn N,Q imQ ne = Omg yt Finally, by use of (iii) we have 


py De Gi) = De Tham = HE, Ey) =D (Eu (Edu = ibe » 
i 
proving (ii). O 


The above results illustrate a dual behaviour between ordinary multiplication, 
the numbers Pi and the matrices A, and P on the one hand, and Schur 
multiplication, the numbers qi; and the matrices E; and Q on the other hand. If 
two association schemes have ‘the property that the intersection numbers of one 
are the Krein parameters of the other, then the converse is also true. Two such 
schemes are said to be (formally) dual to each other. One scheme may have 
several (formal) duals, or none at all (but when the scheme is invariant under a 
regular abelian group, there is a natural way to define a dual scheme, cf. Delsarte 
(1973). In fact usually the Krein parameters are not even integers. But they 
cannot be negative. These important restrictions, due to Scott (1973), are the 
so-called Krein conditions. 


Theorem 3.5. The Krein parameters of an association scheme satisfy qi; =0 for all 
i, j, KE{O,..., d}. 


Proof. The numbers (1/n)qi; (0<k<d) are the eigenvalues of E,°E, (since 
(E,;° E,)E, = (l/n)qiE, ). On the other hand, the Kronecker product £, '@E, is 
positive semidefinite, since each EF, is. But E,°£; is a principal submatrix of 
E,@E,, and therefore is positive ‘semidefinite as well, i.e., has no negative 
eigenvalue. O : 


The Krein parameters can be computed by use of Theorem 3.4(vi). This 
equation also shows that the Krein condition is equivalent to 


>» 1,0,,QyQ,20 for alli, j,k {0,...,d). 
t 


In case of a strongly regular graph we obtain 
2 3 3 
I = f (1 + = 5) > 0 ; 
Nake (Gerd) 


ah-£ (1+ 5-—& ol )e 0 
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(the other Krein conditions are trivially satisfied in this case), which is equivalent 
to the result mentioned in the previous section. 

Neumaier (1981) generalized Seidel’s absolute bound to association schemes, 
and obtained the following. 


Theorem 3.6. The multiplicities p,; (Q<i<d) of an association scheme with d 
classes satisfy 


MiB; ifixs, 
Zaetitwen gies 
5 ie; ie a as ie 


Proof. The left-hand side equals rk(E;° E;). But rk(E,° E,) <rk(E, @ E,) = rk E, x 
rk E, = p,p,. And if i=j, then rk(E; cE) < 5 (uw, +1). Indeed, if the rows of E, 
are linear combinations of y, rows, then the rows of £,°E, are linear combina- 
tions of the 4; + 3 4,(u; — 1) rows that are the elementwise products of any two of 
these »;, rows. O 


For strongly regular graphs with g}, =0 we obtain Seidel’s bound: v < ff + 
3). But in case Git > 0, Neumaier’s result states that the bound can be improved 
tov<$f(f +1). 


3.3. Subsets of association schemes 


The last subject of this section is a result of Delsarte (1973, Theorem 3.3, p. 26) 
on subsets in association schemes. For Y CX, Y ¥9, we define 


1 
a,= x TAX 
Fal 


where y is the characteristic vector of Y. In other words, a; is the average degree 
of the subgraph of (X, R;) induced by Y. Clearly a, =1, and )1,.. a, =|Y|. The 
vector @ = (a@,,...,@,) is called the inner distribution of Y. 


Theorem 3.7. The inner distribution a of a non-empty subset of an association 
scheme satisfies aQ = 0. 


d : sas 
Proof. |Y| 2. 4,0, = <0 x'Q,Aix=nx'E,x 20, since E, is positive 
‘semidefinite. O 


This inequality leads to Delsarte’s linear programming bound, as we shall see in 
the next section. 

As an application we have the following result (Delsarte 1973, Theorem 3.9, p. 
32). 


Theorem 3.8. Let {{0}, Z,, 1,} be a partition of {0,...,d}, and assume that y and 
Z are non-empty subsets of X such that the inner distribution b of Y satisfies b; = 0 
for i€1,, and the inner distribution c of Z satisfies c,=0 for i€I,, Then 
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|Y|-|Z|<|X|, and equality holds if and only if for all i #0 we have (bQ), =0 or 
(cQ); =9 


ses Define B; = uw; '|Z|"' Uj ¢,Q;;. Then B, = 1, 8, =0 for all i, and Y), B,Q,, = 
i; '|Z|"'ne,. Now we have 


I= Db, =60)05 2 bQu8= 2 Ooh=TZi- 0 


Let us investigate a special case of this situation somewhat closer. Let, for 
1C{0,...,d}, the /-sphere around the point x € X be the set {y EX | (x, y)ER, 
for some i€ J}. A non-empty subset Y of X is called perfect (more precisely, 
I-perfect) when the /-spheres around its points form a partition of X. 


Theorem 3.9 (‘“Lloyd’s theorem”’, cf. Lloyd 1957, Delsarte 1973, p. 63). Let Y be 
perfect, with inner distribution a. Then Yc, P,,=0 for all j AO such that 
(aQ), #0. 


Proof. Apply the previous theorem, with for Z an /-sphere. If c is the inner 
distribution of Z, then 


IZ]; (cQ), =H > n Pin y= 2 P Pak = >> i PiePin 
=(5 sy gig 


4. Applications 


In this section we discuss some special types of association scheme and their 
significance to other fields of combinatorics. 


4.1. Distance regular graphs 


Consider a connected simple graph with vertex set x of diameter d. Define 
R, CX by (x, y) € R, whenever x and y have graph distance /. If this defines an 
association scheme, then the graph (X,R,) is called distance-regular. The 
corresponding association scheme is called metric. By the triangle inequality, 
Py= =Oifi+j<k or |i—j|>k. Moreover, p/;’ >0. Conversely, if the intersection 
numbers of an association scheme satisfy these conditions, then (X, R,) is easily 
seen to be distance-regular. 

Many of the association schemes that play a r6le in combinatorics are metric. In 
fact, all the examples treated in this chapter are metric. Strongly regular graphs 
are obviously metric. The line graph of the Petersen graph and the Hoffman— 
Singleton graph are easy examples of distance-regular graphs that are not strongly 
regular. 
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Any k-regular graph of diameter d has at most 
L+k+k(k—1)4+---+k(k-1)77' 


vertices, as is easily seen. Graphs for which equality holds are called Moore 
graphs. Moore graphs are distance-regular, and those of diameter 2 were dealt 
with in Theorem 2.3. Using the rationality conditions Damerell (1977) and 
Bannai and Ito (1973) showed the following. 


Theorem 4.1. A Moore graph with diameter d =3 is a (2d + 1)-gon. 


A strong non-existence result of the same nature is the theorem of Feit and G. 
Higman (1964) about finite generalized polygons. A generalized m-gon is a 
point-line geometry such that the incidence graph is a connected, bipartite graph 
of diameter m and girth 2m. It is called regular of order (s, t) for certain (finite or 
infinite) cardinal numbers s, ¢ if each line is incident with s +1 points and each 
point is incident with ¢ + 1 lines. (It is not difficult to prove that if each point is on 
at least three lines, and each line has at least three points (and m<~), then the 
geometry is necessarily regular, and in fact s =¢ in case m is odd.) From such a 
regular generalized m-gon of order (s, t), where s and ¢ are finite and m>3, we 
can construct a distance-regular graph with valency s(t+1) and diameter d= 
[4m] by taking the collinearity graph on the points. 


Theorem 4.2. A finite generalized m-gon of order (s,t) with s > 1 and t> 1 satisfies 
mé€ {2, 3, 4, 6, 8}. 


Proofs of this theorem can be found in Feit and Higman (1964), Kilmoyer and 
Solomon (1973) and Roos (1980); again the rationality conditions do the job. The 
Krein conditions yield some additional information: 


Theorem 4.3. A finite regular generalized m-gon with s>1 and t> 1 satisfies s < tr 
and t<s* if m=4 or 8; it satisfies s<t° andt<s° if m=6. 


This result is due to Higman (1971, 1974) and Haemers and Roos (1981). For 
each m € {2, 3, 4, 6, 8} infinitely many generalized m-gons exist. (For m=2 we 
have trivial structures — the incidence graph is complete bipartite; for m =3 we 
have (generalized) projective planes; an example of a generalized 4-gon of order 
(2, 2) with collinearity graph 7(6) can be described as follows: the points are the 
pairs from a 6-set, and the lines are the partitions of the 6-set into three pairs, 
with obvious incidence.) 

Many association schemes have the important property that the eigenvalues P,, 
can be expressed in terms of orthogonal polynomials. An association scheme is 
called P-polynomial if there exist polynomials f, of degree k with real coefficients, 
and real numbers z, such that P,, =f,(z;). Clearly we may always take z; = P,,. 
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By the orthogonality relation Theorem 3.2(iii) we have 
D hE Mi) =D ePyPue = MB » 
which shows that the f, are orthogonal polynomials. 
Theorem 4.4. An association scheme is metric if and only if it is P-polynomial. 


Proof. Let the scheme be metric. Theorem 1.1 gives 
A,A, = pi; A;- 1 + pA; +P Ais, 


Since p\;' #0, A;,, can be expressed in terms of A,, A;_, and A;. Hence for 
each j there exists a polynomial f, of degree j such that 


A, = f(A.) : 


Using this we have P,,F, = A,E,;=f(A,)E, =f(AsE JE, =F(Pa)E,, hence P, 
GPia)- 
"Now suppose that the scheme is P-polynomial. Then the f, are orthogonal 


polynomials, and therefore they satisfy a 3-term recurrence relation (see Szegé 
1959, p. 42) 


O41 f41(2Z) = (CB; — 22) + y-1f-1@) - 
Hence 
P; Py = 01 P iy. + BP, ; +; P, 


j-1" y-1 


fori=0,...,d. 


Since P,P, = py PP and P is non-singular, it follows that Py =0 for |/—j|>1. 
Now the full metric property easily follows by induction. O 


This result is due to Delsarte (1973, Theorem 5.6, p. 61). There is also a result 
dual to this theorem, involving so-called Q- polynomial and cometric schemes. 
However, just as the mtersection numbers Py have a combinatorial interpretation 
while the Krein parameters qi; do not, the metric schemes have the combinatorial 
description of distance-regular graphs, while there is no combinatorial interpreta- 
tion for the cometric property. For more information on P- and Q-polynomial 
association schemes, see Delsarte (1973) and Bannai and Ito (1984); for distance- 
regular graphs, see the book by Brouwer et al. (1989). 


4.2. The Hamming scheme and the linear programming bound for 
error-correcting codes 


Let X = 2%, the set of all vectors of length d with entries in 2, where 2 is some 
set of size q. Define R,C X” by (x, y)ER, if the Hamming distance between x 
and y (i.e., the number of coordinates in which x and y differ) equals i. This 
defines an association scheme, the Hamming scheme H(d,q). The Hamming 


Association schemes 763 


scheme is easily seen to be metric, and hence by Theorem 4.4 P-polynomial. The 
orthogonal polynomials involved are the Kravéuk polynomials K,(x). 


Theorem. 4.5. For the Hamming scheme H(d, q) we have 
i ‘ ; 
perfi\f(n-i 
P,= y= KO= DCN - "GG aa) 
See Delsarte (1973, p. 38) or MacWilliams and Sloane (1977) for proofs. From 
Theorem 4.5 we see that P = Q, so the Hamming scheme is self-dual. 
A subset YCX of H(d,q), such that Y7OR,=@ for i=1,...,6—1 and 
y7n R, #9 (i.e., a subset Y such that the minimum Hamming distance between 
two vectors of Y equals 6), is nothing but an error-connecting code with 


parameters (d, |Y|, 5) over the alphabet 2 (cf. chapter 16). Let a= (a, ... , ag) 
be the inner distribution of Y. Then 1, a, = |Y|, a) =1, a, =": =a,_, =0, a=0 
(by definition), and aQ =0 by Theorem 3.7. 

Consider a,,...,a, aS variables and define 


d 
a*=1+max>, a; 
i=6 


d 
subject to K(1)+> aK(i)20, j=0,...,d, 
i=6 
a,20, i=4d,...,d. 


Then clearly |Y| <a*. So a* is an upper bound for the number of codewords with 
a given length and minimum distance. This bound, due to Delsarte (1973), is 
called the linear programming bound, since the value of a* can be computed by 
linear programming. Of course, the above-mentioned inequalities are not the only 
ones satisfied by the a;, and by adding extra inequalities to the system, one may 
obtain sharper bounds on |Y|. For details and more applications to coding theory, 
see Delsarte (1973), MacWilliams and Sloane (1977), Best et al. (1978), Best and 
Brouwer (1977), Brouwer (1993). 


4,3. The Johnson scheme and t-designs 


Let the set X consist of all subsets of size d of a set M, where |M| =m 2 2d. 
Define relation R, CX’ by (x, y) ER, if the Johnson distance between x and y 
(i.e., the cardinality of x\y) equals i. This defines an association scheme, the 
Johnson scheme J(m, d). Since the Johnson distance between x and y equals twice 
the Hamming distance between (the characteristic vectors of) x and y, it follows 
that also the Johnson distance satisfies the triangle inequality, so that the Johnson 
scheme is metric. Note that the graph (X, R,) is complete for d= 1, and is the 
triangular graph 7(m) for d = 2. The following result (due to Ogasawara 1965 and 
Yamamoto et al. 1965) gives some parameters. 
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Theorem 4.6. For the Johnson scheme J(m, d) the following hold: 


“ig 2,=8()= 3 1° (dz Ge 2) Garis 
-Lou@M EN): 
me (T)- GE). me G7). 


Here E,(x) is a so-called Eberlein polynomial. It has degree 2j in the indetermi- 
nate x, and degree j in the indeterminate x(m+1-— x). Since P,, = d(m—1)- 
im +1—i), E,(é) indeed has degree j in P,, as required by the definition of 
P-polynomial scheme. 

The graph (X, R,) of a Johnson scheme is called a Kneser graph. A subset Y of 
X such that any two elements of Y have non-empty intersection is a coclique 
(independent set) of the Kneser graph. By use of Theorem 3.7 and the above 
formulas, it can be deduced that 


vis (> 


the famous result of Erdés et al. (1961). 

A t-(m, d, A) design is a subset Y C X of the Johnson scheme J(m, d) such that 
each f-element subset of M is contained in precisely A elements of Y. Delsarte 
(1973, Theorem 4.7, p. 51) proved the following. 


P= 


4) 


Theorem 4.7. A non-empty subset Y of the Johnson scheme J(m,d) with inner 


distribution a =(ay,...,@,) is a t-(m, d, A) design if and only if 
d 
» 4,0,,=0 for j=1,...,¢. (*) 
s=0 


Just as we did in case of the Hamming scheme, we can define 


d 
a,=1+min > aq, 


1 


d 
subject to a,20fori=1,...,d, and } a,0,,=Oforj=1,...,%, and 
i=0 


d 

2» 4,0; 2=Qforj=r+l1,...,d, 
where the Q;, are found in Theorem 4.6. Now we have the linear programming 
(lower) bound for the number of blocks in a t-design; |Y|=a,. Using the simplex 
algorithm, Haemers and Weug (1974) showed that this inequality implies the 
non-existence of the designs with parameters 4-(17, 8, 5), 4-(23, 11, A), A=6, 12, 
4-(24, 12, 15), 6-(19, 9, A), AS10, 6-(20, 10, A), A=7, 14. In certain other cases, 
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such as 5-(19, 9, 7), Kohler (1985) ruled out the existence of t-designs by showing 
that no solution of the above system of inequalities corresponds to an actual 
design. 

If we replace the restrictions (aQ),; = 0 by restrictions a, = 0 (1 <i <48) in this 
system of inequalities, and maximize La,, we obtain upper bounds for the 
cardinality of codes with minimum distance 6 and constant weight d. 

We might also require both a,=0 and (aQ),=0 for suitable i, j, and obtain 
results for t-designs with restricted block intersections, such as quasi-symmetric 
block designs. By this method it follows for instance that a quasi-symmetric 
2-(29, 7, 12) design with block intersections 1 and 3 cannot exist (Haemers 1975). 
For details and more results we refer to Delsarte (1973), MacWilliams and Sloane 
(1977), Best et al. (1978), Cameron and Van Lint (1980), Calderbank (1988). We 
also point out that several results of section 3 of chapter 31 can also be obtained 
by use of the framework of Johnson schemes. 

Delsarte (1973) generalized the notion of t-designs to subsets of arbitrary 
association schemes satisfying (*). (Equivalently, a ¢t-design is a subset Y of X 
such that its characteristic vector y satisfies E,y =0 for j=1,...,¢.) He shows 
that a ¢-design in the Hamming scheme is what is known as an orthogonal array of 
strength ¢ (see chapter 14). Thus, we also have a linear programming bound for 
orthogonal arrays. More generally, one may give an interpretation of the classical 
concept of -design in terms of ranked posets in the obvious way, and then prove 
for each of the eight known infinite families of P- and Q-polynomial association 
schemes that a subset is a classical t-design if and only if it is a Delsarte f(t)-design 
(where usually f(¢) =1), see Delsarte (1976) and Stanton (1986). 


4.4, Imprimitive schemes 


In section 2 we saw how to associate to a quasi-symmetric 2-design an association 
scheme whose points are the blocks of the design. In the previous subsection we 
saw how ¢-designs give rise to interesting subsets of the Johnson scheme. Let us 
give one more example of a relation between designs and association schemes. 
Let N be the incidence matrix of a square 2-design. Then, defining A, =1, 


a a9 


and A,=J—I-—A,-—A,, we obtain a 3-class association scheme. It is imprimi- 
tive, that is, the union of some of the R; form a non-trivial equivalence relation 
(here R, is an equivalence relation). Another imprimitive association scheme we 
have seen in the line graph of the Petersen graph (where having maximal distance 
is an equivalence relation). 

Given an imprimitive association scheme one may produce new association 
schemes; on the one hand, there is a natural way to give the set of equivalence 
classes the structure of an association scheme (the “quotient scheme”), and on 
the other hand, each equivalence class together with the restrictions of the 
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original relations becomes an association scheme (a ‘‘subscheme”’ of the original 
scheme). See Brouwer et al. (1989, section 2.4) for details. 


4.5. The group case 


We very briefly discuss some relations between association schemes and finite 
permutation groups. 

Let G be a permutation group acting on a set X. Then @ has a natural action 
on X’, the orbits of which are called orbitals. Suppose @ acts generously 
transitive on X, that is, for any x, y@X there exists an element of ¢ 
interchanging x and y. Then the orbitals form an association scheme. 

(Without any requirements on &, the orbitals form a coherent configuration 
(see section 3). The coherent configuration is homogeneous if @ is transitive. We 
get an association scheme in the sense of Delsarte when the permutation 
character is multiplicity free.) 

For any x € X, the number of orbitals equals the number of orbits on X of & 
(the subgroup of $ of permutations fixing x). This number is called the rank of &. 
Thus, the number of classes in the association scheme is one less than the rank of 
@. We can also transfer other permutation group theoretic terminology and results 
to the theory of association schemes. For instance, the Bose—Mesner algebra is in 
the group case known as the centralizer algebra, and all standard results on this 
centralizer algebra (cf., e.g., Wielandt 1964) have their direct analogue for the 
Bose—Mesner algebra. 

The Hamming and Johnson schemes are derived from generously transitive 
permutation representations as discussed above; for instance, the Johnson scheme 
is derived from the representation of the symmetric group Sym (m) on the 
d-clement subsets of an m-set. 

If a metric association scheme belongs to the group case, then the corre- 
sponding distance-regular graph is called distance-transitive. In other words, a 
graph is distance-transitive when its group of automorphisms is transitive on pairs 
of vertices with a given distance. Distance-transitive strongly regular graphs are 
known as rank 3 graphs. A rank 3 permutation group is generously transitive if 
and only if it has even order; consequently every rank 3 permutation group of 
even order provides a strongly regular graph. All such strongly regular graphs 
have recently been classified, see Kantor and Liebler (1989), Liebeck 
(1985, 1987), and Liebeck and Saxl (1986). 


4.6. Euclidean representations of graphs and association schemes 


It is possible to represent a graph in a Euclidean space and obtain combinatorial | 
information from facts in Euclidean geometry. For example, let G be a graph 
with adjacency matrix A with smallest eigenvalue s. If m= —s, then the matrix 
A + ml is positive semidefinite, and hence we can write A + mI = Z"Z, for some 
teal f xn Ee Z (where f is the rank of A + mJ). This yields a representation 
:G>R! sending the vertex x of G to column x of Z. For this representation we 
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have for the inner products 


m if y=x 
(x,y)=(Atm),,= )1 ifyEerG), 
QO otherwise . 


The above shows that a lower bound on the smallest eigenvalue of G yields a 
Euclidean representation of G with known inner products. But conversely, 
suppose we have a Euclidean representation of G with inner products m, 1, 0 for 
equal, adjacent and non-adjacent vertices, respectively. Then the Gram matrix of 
this representation is m/ + A, and since Gram matrices are positive semi-definite, 
it follows that A has smallest eigenvalue = —m. 

As a very simple application, consider a graph G with smallest eigenvalue —1. 
We find a representation by vectors of norm 1 and it follows that having distance 
at most one is an equivalence relation on G, that is, G is a union of cliques. 

As a second application, consider graphs G with smallest eigenvalue = —2. Our 
representation (for m = 2) satisfies 


2 ify=x, 
wef if yET(x), 
0 otherwise 
and hence the Z-span of the image G is a root lattice. But all root lattices are 
known (they are direct sums of the lattices A,, D,, E,, E;, E,) and this makes it 
possible to classify all such graphs G. See Cameron et al. (1976). 

For an association scheme we have a natural supply of positive semi-definite 
matrices, namely the idempotents, and we can play the same game. Let (X, @) be 
an association scheme, E an idempotent, and consider the map :X—R” 
sending x EX to E.,, column x of E. (Then the dimension of the span of the 
image of X under this map is the rank of E.) For this representation we have for 
the inner products: 


(x, y) os E,.E., = ey ¥ 
In particular, if E=E,, then E=(1/n) ©, QA; and we find that 
(x,y)=Q,, whenever (x, y)ER;. 


' Suppose a graph G has a representation — in R/ satisfying 


p ify=x, 
(x, y)=9q if dG, y)=1, 
r if d(x, y)=2, 


where d(x, y) denotes the graph distance between the vertices x and y. Fix a 
vertex x of G, and define a representation of I(x) by ¥=y—(q/p)x. Then 
(¥, x) =0 for all y€ F(x), and we have a representation of I(x) in Ro 
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satisfying 
p-q'ip if y=x, 
(8,.9=549-Q lp ifd,y)=1, 
r—q’/p otherwise . 


In a number of cases the above techniques suffice to determine all graphs with 
given parameters. For example, let G be a distance-regular graph with the same 
parameters as the Johnson graph J(m, d) (with as vertices the d-subsets of a fixed 
m-sets, two d-subsets being adjacent when they meet in a (d — 1)-set). From 
Q,,=P,,m,/k,; and the formulas given in Theorem 4.6 above, it follows that 
Q,, =m —1—im(m — 1)/d(m — d), which is linear in i. It follows that every local 
graph I(x) has smallest eigenvalue =-—2, so that the theory developed in 
Cameron et al. (1976) applies. This enabled Neumaier (1985) and Terwilliger 
(1986) (independently) to completely classify such graphs G. (The result being 
that necessarily G = J(m, d), except in the case (m, d) = (8, 2), where there are 
three other graphs, known as the Chang graphs, with the same parameters as the 
triangular graph 7(8).) 

As a second example, due to Ivanov and Shpectorov (1984), consider a 
distance-regular graph G with the same parameters as the dual polar graph for 
U(2d,r) (that is, the graph with as vertices the maximal totally isotropic 
subspaces in PG(2d—1,q) provided with a non-degenerate hermitean form, 
where g=r’; two vertices are adjacent when they meet in a subspace of 
codimension one in both). The graph G is regular of valency k =rm, its local 
graphs I(x) are regular of valency A=r—1, and G has smallest eigenvalue 
6, = —m with multiplicity f=k +1—-m, where m =(q* — 1)/(q — 1). Using the 
idempotent E, we find a representation of G in R/, and then of F(x) in R’~', with 
inner products (%, y) = Q,, = c((-r)” ~ 1)/(—r—1) (for some positive constant c) 
whenever d(x, y) =i. Now the Gram matrix M of the representation of I(x) has 
eigenvalue 0 with multiplicity at least m, so that I(x) has eigenvalue A with 
multiplicity at least m, so that I(x) has at least m connected components. It 
follows that F(x) is a union of cliques, and that our graph G is the collinearity 
graph of a near polygon with classical parameters. If d >3 then every such G is 
known, and we find that G must be the U(2d,r) dual polar graph. 
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1. Introduction 


Error-correcting codes (sometimes called error-control codes) are used to protect 
digital data against errors that can occur when the data is transmitted over a 
so-called “‘noisy’’ communication channel. One of the modern applications is what 
is known as fault-tolerant computing. Here, a system of collaborating computers 
produces correct results, even though some of the intermediate results are 
communicated incorrectly from one computer to another. There are two other 
spectacularly successful applications of coding theory. The first are the pictures of 
several planets taken by satellites such as Voyager. The signals that the satellites 
send to earth are so distorted by noise that completely unintelligible pictures 
would result if one did not use error-correcting codes. The second application is 
the compact disc digital audio system that produces music of the finest quality, 
even if there are dust particles, scratches, etc. on the disc. Without Reed- 
Solomon codes (see section 5) this would not be the case. One of the reasons that 
the topic is increasing in importance is the fact that electronic circuits that realize 
the possibilities of the theory of error-control codes, have become sufficiently 
cheap to produce. 

The subject of coding theory (and information theory) began in 1948 when 
Shannon (1948) published two papers on ‘‘A mathematical theory of communica- 
tion” in the Bell System Technical Journal. As an introduction and in order to 
establish some terminology, we explain the idea of block codes. Consider a source 
that produces information in the form of a very long sequence of zeros and ones. 
We partition this sequence into blocks of Jength & and consider these blocks as 
elements of F5. The “encoder” is an injection from F5 to F3. The image of an 
information block is a block of length n, called a codeword c, and the set of 
codewords is called the code C. A particularly simple situation occurs when the 
first kK bits of ¢ are the original information block. The remaining bits are the 
so-called redundancy. \t is clear why the number k/n is called the information rate 
of C. (In general, if C is a binary code of length n, then R:=n ‘ log,|C| is called 
the information rate of C.) Examples of channels are the glass fibre through which 
digitized telephone messages are communicated as light pulses, or the track on a 
compact disc. On this track the ones are stored as pits (produced by the 
write-laser). The zeros are so-called lands. (Actually, the situation is slightly more 
complicated but this suffices to understand the idea.) The reading is done with a 
weaker laser by measuring the reflection, thus distinguishing between pits and 

‘lands. Errors can be caused by dust, scratches, fingerprints, etc. These result in 
the interpretation of a 0 as a 1 or vice versa. In our theory, we consider the 
received word r as a sum c +e in F5, where e is the so-called error-vector. The 
model we use for the channel is what is known as the binary symmetric channel 
(BSC). Here, each transmitted bit has a probability p of being received in error. 
The number of errors in a received word is equal to the number of ones in the 
vector e, which is called the weight w(e) of e (in general, the weight of a vector is 
the number of nonzero coordinates). A code C is called a t-error-correcting code 
if, for every c@C and every e with w(e)<z, the sum c+e=r uniquely 
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determines e and c. We use the following terminology. If x and y are vectors with 
n coordinates (in any vector space), then their distance d(x, y) is the number of 
places where they differ. Therefore, d(x, y)= w(x — y). In decoding a received 
message r, One usually chooses a codeword ¢ such that d(r, c) is minimal. If C is a 
code, then the minimal value of d(x, y) for all pairs of distinct codewords is called 
the minimum distance of C. If the distance is 2e + 1, then the code is clearly 
e-error-correcting. 

We mention without proof the following quite sensational theorem due to 
Shannon (1948). Assume that we are using a BSC with error probability p and a 
code C of length n for which every codeword has the same probability of 
occurring in a message. Decoding of received messages r is done by replacing r by 
a codeword ¢ such that w(r—c) is minimal. Let P,. be the probability that a 
received codeword is interpreted incorrectly. 


Theorem 1.1 (Shannon, 1948). Let e>0. Jf O<R<1+p log p+(1—p) log(l — 
p), then there is an n and a code C of length n and rate R, such that Po<e 
(logarithm to base 2). 


For a proof see Van Lint (1983). The proof is an existence proof and coding 
theorists are still trying to actually construct “good” codes that can be efficiently 
implemented in practice. 

In this chapter we shall ignore large parts of coding theory such as encoding and 
decoding algorithms, convolutional codes, etc. After an initial period of develop- 
ment of the theory, coding theorists and combinatorialists became aware of each 
other’s existence. Furthermore, it turned out that ideas from each of these areas 
could be used fruitfully in the other. After some introductory material, this 
chapter concentrates on those parts of coding theory that are strongly related to 
combinatorics (see also Assmus and Mattson 1974). As we saw above, the 
question of error-correction is essentially a question of packing spheres in a 
metric vector space. Connected to this there is the combinatorically interesting 
covering problem. If C is a code, then the covering radius of C is defined to be 
the smallest number p such that the spheres of radius p around the codewords 
cover the whole space. We shall consider this parameter in section 7. 

In the References, we list several textbooks on coding theory. The major 
references for this chapter are Cameron and Van Lint (1991), Van Lint (1982), 
and MacWilliams and Sloane (1977). 


2. Linear codes 


We choose some field F, as alphabet. Let F”? be the vector space F”. A subset C 
of F” is called a k-dimensional linear code of length n or [n, k] code if C is a 
k-dimensional linear subspace of F’”. The rate of an [n, k] code is k/n. If C has 
minimum distance d, then C is called an [n,k,d] code. Since d(x, y) = d(x — 
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y, 0) = w(x — y), the minimum distance of a linear code is equal to the minimum 
weight of the code. 

A generator matrix G for a linear code C is a k by » matrix for which the rows 
are a basis of C, i.e., C= {aG|a€ F* gi G is in standard form if G = (1, P), where 
I, is the k by k identity matrix. In this case, the first k symbols of a codeword are 
called information symbols (these can be chosen arbitrarily). The remaining 
symbols, so-called ‘‘check symbols”, are determined by the information symbols 
(and P). 

An example is the [4, 2] ternary Hamming code with 


a 


In this example, all nonzero words have weight 3. 

Two codes C, and C, (not necessarily linear) are called equivalent if C, is 
obtained from C, by a permutation of the positions in codewords (e.g., 
interchanging the first and second letters in all the codewords). Every linear code 
is equivalent to a code with a generator matrix in standard form. In general, a 
code C (not necessarily linear) is called systematic on k positions if |C| =q* and 
there is exactly one codeword for every possible choice of coordinates on these k 
positions. Since one can separate information symbols and redundant symbols, 
these codes are also called separable. 

If a code C has length n and distance d, then deleting d — 1 symbols from every 
codeword yields a code of length n — d + 1 for which no two words are the same. 
Therefore, |C| <q” “*'. This is known as the Singleton bound. It can also be 
expressed as follows. If |C|=gq*, then d<n—k +1. In the case of equality, the 
code C is called maximum distance separable (MDS). Such a code is systematic on 
any k-tuple of positions. Instead of writing a separate section on MDS codes, we 
have included the relevant information in appropriate places; we refer the reader 
to (i) the observation preceding Definition 2.2, and (ii) section 5.4 on RS codes. 

If C is an [n, k] code, we deiine the dual code C* by: 


t= {yEF|Vecl(x, ¥) =O}, 


where (x, y):= ae , x,y; is the usual inner product. C* is also a linear code, 
namely an [n,n — k] code. If C=C", then C is called self- dual. lf CCC, then 
C is called self-orthogonal. A generator matrix H of C* is called a parity- “check 
. Matrix for the code C. We have 


={xEF” |xH" =0}. 


If G =(/,P) is a generator matrix for C in standard form, then H =(—P'I,_,) is 
a parity-check matrix for C. 

Since the generator matrix G of the binary [4,2] Hamming code, defined 
above, satisfies GG‘ =0, the code is self-dual. 

For any x € F”? we call the vector xH" the syndrome of x. One of the standard 
methods of decoding linear codes is the so-called syndrome decoding. First, note 


778 J.H. van Lint 


that if x =c +e, when c is a codeword and e the error-vector, then xH ' =eH'". 
The space F™ is divided into cosets of the code C, and for each coset a word of 
minimum weight in the coset is chosen as ‘coset leader”. Every word in a coset 
has the same syndrome as the coset leader. Decoding is done by subtracting the 
coset leader from the received word x. So, the receiver only needs a list of 
syndromes with the corresponding coset leaders. 

Let C be an [n, k] code with parity-check matrix H. A codeword x corresponds 
to a linear combination of the columns of H equal to the 0 column. Therefore, the 
minimum distance of C equals the smallest number of columns of H that are 
linearly dependent. This observation leads to one of the best known classes of 
linear codes. We first observe that the columns of the / by n matrix H (where 
l:=n-—k) can be interpreted as points in PG(J—1,q). If these points are 
different, then the code C* is called projective and by the observation above, the 
code C has minimum distance at least 3. Now we fix / and then maximize n, 
keeping C~ projective. 


Definition 2.1. A linear code over F, that has an / by  parity-check matrix [where 
n= (q' —1)/(q—1)], which has all the points of PG(/—1, q) as its columns, is 
called a Hamming code. 


As we saw before, a Hamming code has minimum distance 3. For any x € F” 
the set {yEF | d(x, y)<e} is called a sphere of radius e around x. For a 
Hamming code, the spheres of radius 1 around codewords are clearly disjoint. 
Each sphere contains 1+(q-1)n=q' words and there are g‘=q""' such 
spheres. Therefore, these spheres partition the space F“. Such a code is called a 
perfect code. In the case of an e-error-correcting perfect code, every word in F @ 
has distance < e to exactly one codeword. 

Now consider an [n, k] code C that is MDS. Since C is systematic on any k 
positions, any k columns of the generator matrix of C are linearly independent. 
Therefore, C* has minimum distance > k+1=n-—(n—k)+1, ie., the dual 
code is also MDS. Note that if we consider the columns of a generator matrix of 
an [n, k,n —k + 1] code as points in PG(k — 1, q), then no & of these points lie in 
a hyperplane, i.e., we have an n-arc in PG(k — 1, q). 


Definition 2.2. If C is a code of length over F,, we define the extended code C 
by: 


n+l 
C= (eps e20 2216p Cues) Crs Car os VEC, > c= 0}. 
i=1 


If C is linear with generator matrix G and parity-check matrix H, then C has a 
generator matrix G obtained from G by adding a column such that the sum of all - 
columns is 0, and a parity-check matrix 


A= 0 


0 
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If C is binary code with odd minimum distance d, then C has minimum distance 
d+ 1. (In an extended binary code all weights and all distances are even.) 


Let C be a code of length n over F,. 


Definition 2.3. Let A; =|C|~'-|{(@, y) |x EC, yEC, d(x, y) = i}}. Then (Ay, Aj, 
, A,,) is called the distance distribution of C and A(z) = ae ~o A,z’ is called the 
distance enumerator of C. 


If C is linear, then A, equals the number of words of weight i in C and then 
A(z) is usually called the weight enumerator. If we change the origin to another 
codeword (i.e., replace C by C — ¢ for some codeword c), the weight enumerator 
of a linear code does not change. Codes that have this property are said to be 
distance invariant. 

One of the most useful theorems in the theory of linear codes is MacWilliams’s 
Theorem (MacWilliams 1963). We shall see that it is crucial in the proof of 
Theorem 7.8; it can be used to show nonexistence of codes (nonintegral 
coefficients). It is used heavily in the nonexistence proof of the projective plane of 
order 10. 


Theorem 2.4. Let C be an [n, k] code over F, with weight enumerator A(z) and let 
B(z) be the weight enumerator of C~ _ Then 


1- 
Bz)=q *“+(q- vey"Alae pe) ' 


Proof. For a proof we refer to Van Lint (1982) or several of the textbooks 
mentioned in section 1. The essence of the proof is present in the proof for the 
binary case. We interpret words in F> as elements of Z”. Then for any « we have 


g(u):= > (—1) er") = a ac 7 ala Un yds 4 +Uy 


very Wyrries 
=|] [1+ (-1)"z)=(1-2)"(1 +2)" "™, 
i=1 


The result then follows from 


yy gu) = > 2" DY (-1y) 


vers uEC 


Ic| > 2” =|C|BQ), 


vect 


because the inner term is 0 if u@C~ and it is |C| forveEC*. O 


It is sometimes more convenient to replace A(z) by a function of two variables. 
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For an arbitrary binary code C we shall also call 
n 
A(t, y= 2 Ax 'y! (2.5) 
jz 


the distance enumerator of C and 
1 . 
A'(x, y) = A(1, 1) AQ +y,x-y) (2.6) 


the MacWilliams transform of A(x, y). 

Note that for any binary code C, the number of codewords equals A(1, 1) and 
A, = 1. If C is linear, then A’(x, y) is the distance enumerator of C*. If A(x, y) is 
any polynomial as in (2.5) with A, =1, then A’(x, y) = A(x, y). 

There are several ways of obtaining new codes from old ones (not necessarily 
linear). We have already mentioned extension. The inverse of this procedure is 
called puncturing. If one replaces a code C by a subset of C, this is called 
expurgating the code. For example, the even-weight subcode of an [n, k] code, 
which has words of odd weight, is the [n, k — 1] code obtained by deleting all 
odd-weight words. The inverse process is called augmenting a code (for example, 
by adding an extra row to the generator matrix of a linear code). If one takes all 
the codewords of C starting with a fixed symbol (say 0) and then deletes this 
symbol, the resulting code is called a shortened code. The inverse process, 
lengthening, is best illustrated by a linear code. Here one increases both the 
length and the dimension by 1. 

We mention one more method that is often used: the so-called (u, u + v)- 
construction. Here one forms all words of the form (wu, # + v), where u is from a 
code C, of length n and distance d,, and v is from a code C, of the same length 
and distance d,, The new code has length 2n, |C,|-|C,| words, and distance 
min{2d,, d,}. 


3. Bounds on codes 


One of the central problems in combinatorial coding theory is the study of the 
numbers A(n, d) defined below. We assume that a fixed alphabet of g symbols has 
been chosen. A code C is called an (n, M, d) code if it has M words of length n 
and minimum distance d. 


Definition 3.1. A(n,d):=max{M|an (n,M,d) code exists}. A code C with 
|C| = A(n, d) is called optimal. 


In this section we shall treat some of the important theorems concerning 
A(n, d) and some related problems. We restrict ourselves to q = 2. For the case of 
arbitrary ¢ and for more details we refer to Van Lint (1982) and MacWilliams and 
Sloane (1977). If the proof of a theorem is no more than an exercise, we omit it. 
It is difficult to compare the bounds since the results depend heavily on the choice 
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of n and d. Usually, bound 3.4 is better than 3.3 and nearly always, the best 
results are obtained from bound 3.16. Clearly, bound 3.10 is stronger than 3.4. 
All these bounds have an asymptotic version, obtained by taking d = 5n and then 
considering a(S) :=lim sup, _,,.n ‘log A(n, 5n) (see Van Lint 1982). 


Theorem 3.2. 
A(n, 21 — 1) = A(n + 1, 22). 

Theorem 3.3 (Singleton bound, see section 2). 
A(n,d) <2" “*!. 

Theorem 3.4 (Hamming bound). 


A(n, 2e + 1<2"/> (") 
i=0 
Theorem 3.5 (Gilbert-Varshamov bound). (A lower bound!) 


A(n, d) 22/S (") 


Proof. If C is optimal, then every word x EF 
codeword. O 


™ has distance less than d to some 


Remark. Suppose that C is linear and 


d-) 
n 
Ic D (") <2". 
izo S# 

Then there is a word x € C with distance = d to every codeword. If we adjoin x 
to a basis of C, we find a new linear code of larger dimension that also has 
minimum distance > d. Therefore, the lower bound 3.5 also applies if we restrict 
ourselves to linear codes! 


Theorem 3.6 (Plotkin bound). if d>4n, then A(n, d) <2d/(2d — n). 


Proof. If C has M words, consider the M by n matrix that has the codewords of C 
‘as its rows. Let the column sums of this matrix be m, (1<i=<n). The sum of all 
distances of pairs of distinct codewords is at least d('4). On the other hand this 
sum equals ae ,™,(M —m;), which is at most 1nM’. From this the inequality 
follows. O 


We give one example to show how to use Theorem 3.6. We wish to find a bound 
for A(13,5). By Theorem 3.2 this equals A(14,6). If we shorten a (14, M, 6) 
code three times, we obtain an (11, M',6) code with M’ =2~°- M. By Theorem 
3.6 we have M’<12. Therefore, A(13,5)=<96. (Actually A(13, 5) = 64.) 
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Theorem 3.7 (Griesmer bound). For a binary [n,k,d] code we have 
n>=Li [d/2']. 


Proof. Let G be a generator matrix of the code. We may assume that the first row 
of G is the vector ec = (11---100---0) of weight d. Every linear combination x of 
the remaining rows of G must have at least [d/2] ones in the last n — d positions 
because either x or x +e has at most [d/2] ones in the first d positions. The 
theorem follows by induction. O 


Definition 3.8. A(n, d, w) denotes the maximum number of codewords in a binary 
code of length n and minimum distance d for which all codewords have weight w. 


Theorem 3.9. 
A(n, 21-1, w) = A(n, 2, Od ee 


Proof. Consider a shortened code and use induction. 0 


Theorem 3.10 (Johnson bound). 


5 (0) lets) (aed 
etl 


A(n, 2e + 1) arty 


imo St 


Proof. If there are N,,, words in F ™) with distance e + 1 to the (n, M, d) code C, 
then 


uS (ema 
i=0 
Consider an arbitrary codeword c; w.l.o.g. e=0. C has at most A(n, d, d) words 


of weight d, and each ot these has distance e to (7) words of weight e+ 1. 
Therefore, at least 


(054) (atm aa 


words have distance e + 1 to ¢ and to C. By varying ¢, the same word at distance 
e+1 to C can be counted at most |n/(e + 1)] times. The result follows. O 


From Theorem 3.9 we have 


({)acn, a, a)<(7)[ SI. 


Combining this with Theorem 3.10 yields the original form of the Johnson bound 
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(Johnson 1962): 


(*) 


5 yet) <2". (3.11) 
e+1 


Note that if e+ 1 divides n +1, then (3.11) becomes the Hamming bound. If 
e=1 and n is even, then (3.11) yields A(n, 3) <2”/(n +2). For example, A(2! - 
2,3)<27 '?. Since equality holds for shortened Hamming codes, these codes 
are optimal. 


icy > (7) + 


Definition 3.12. A code C for which equality holds in (3.11) is called nearly 
perfect. 


From the proof of Theorem 3.10 we see that if C is nearly perfect, then every 
word at distance e + 1 to C has distance e + 1 to exactly |[n/(e + 1)] codewords. 
Similarly, a word at distance e to C has distance e or e + 1 to the same number of 
codewords. 

Many of the best known bounds for A(n, d) known at present are based on a 
method that was developed by Delsarte (1972). It is known as the linear 
programming bound. Again, we restrict ourselves to q = 2. We assume that n is 
fixed. (Also see chapter 15.) 


Definition 3.13. The Krawtchouk polynomial K,(x) is defined by: 
k 
{X\{N—-xXx 
cio-S-w()(t2). 
= BOW) j 
where 


(7) -2- be) ene 


j } 
Lemma 3.14. if x EF} has weight i, then 


y = KG). 
yEFS 
we(y)ok 
Proof. Trivial by counting. O 


Lemma 3.15. If C is a binary code of length n with distance distribution 
(Ay, A},--.,A,), then 


> A,K,(i)20, kE{0,1,...,n}. 
i=0 
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Proof. Let |C| = _M. Then 


n 


Md AKG a> > Ss (“1 


=O(x,yeC? EFF 
d{x,y)=i wu(z)=k 


2 
= > (> 1") >0. ol 
zeF3 sEC 
wt(zj=k 


From Lemma 3.15 and the fact that M = ae A,, we see that we can bound 
A(n, d) as follows. 


Theorem 3.16. 


A(n, d)< max| >, A,|4y=1,A,=Az= "= Ay, =0,A,20(dSi sn), 
i=0 
2 A/K,(i) 20 for kE (0,1,. ny}. 


4. Cyclic codes 


If C is a code of length n, then the subgroup of S, consisting of those 
permutations of the n positions that map all codewords into codewords is called 
the automorphism group Aut(C) of the code. Many of the most interesting codes 
have the cyclic group of order m as subgroup of Aut(C). 


Definition 4.1. A linear code C is called cyclic if a cyclic shift of a codeword is 
always again a codeword, i.e., 


Vicg. Cpe, c, eclC,~1» Cos siaugtaa) SC]: 


In the theory of cyclic codes, one usually makes the convention (n, q)=1. In 
the following, this is assumed. 

In order to introduce more algebraic tools, we use the following isomorphism 
between F? and the additive group of the ring F,[x]/(x” — 1): 

(@9,4,,...,@,_,)@ayta,x+---+a,_,x" 

From Definition 4.1 it immediately follows that a cyclic code corresponds to an 
ideal in F,[x]/(x"—1). Since this is a principal ideal ring, a cyclic code is a 
principal ideal generated by a polynomial g(x) that divides x” — 1. This is called 
the generator polynomial. (We observe that replacing x” — 1 by x” + 1 leads to a 
similar theory. The corresponding ae are called negacyclic.) 

In the following, x" — 1=f,(x)f,(x)--- f(x) will be the factorization of x” — 1 
into its irreducible factors [no repeated factors because (n, q) = 1]. So, there are 
2’ possible cyclic codes of length n (where some may be equivalent). Clearly the 


Codes 785 


dimension of the cyclic code C generated by g(x) is n-degree g(x). If x" -1= 
g(x)h(x), then A(x) is called the check polynomial of C. If 


a(x) =Bo texte +g, x" * 


and 
A(x) =hy thyx tes thyx*, 
then 
&o &1 Sn~k 0 0 0 
0 80 8n-k 0 0 
G- 
0 80 8n-k 
and 
0 0 h, h, No 
0 0 h, h, 90 
H= 
0 
h, ae h, 0 --+ 0 


are a generator matrix, respectively parity-check matrix for C. Note that the code 
generated by A(x) is equivalent to the dual of C (symbols in reversed order). 

Consider the special case of a cyclic code generated by one of the irreducible 
factors f(x). This code, denoted by M;’, is called a maximal cyclic code. The code 
generated by (x” — 1)/f,(x), denoted by M , iS a minimal cyclic code, also called 
irreducible cyclic code. Clearly an irreducible cyclic code has no zero divisors and 
is therefore a field (isomorphic to F,x, where k = deg f(x) = dim M; ). Suppose 
that n =2* -1, q=2. Then M; consists of 0 and the 7 cyclic shifts ‘of the word 
g(x) = x" - Vif (x). Therefore, all the words of M; except 0 have the same 
weight. This weight must be 2“~' (because the sum of all the weights is n- jae 
This is an example of a so-called equidistant code (any two distinct words have the 
same distance). 

Note that any cyclic code C is the direct sum of the irreducible cyclic codes M; 
that are subcodes of C. This is an example of orthogonal decomposition (of an 
algebra) since the product of a word from M; and a word from M, (i #/) is zero. 

It is often useful to have the following alternative description of an irreducible 
cyclic code. 


Theorem 4.2. Let k be the multiplicative order of p mod n, q =p“, and let B bea 
primitive nth root of unity in F,. Then the set 


V= {e(€) =(Tr(€), Tr(€B),..., Tr(€8""") |E EF} 


786 J.H. van Lint 


is an |n, k) irreducible cyclic code over F,. Here 


Tr(a)=ata?+a? t---+ar 


is the trace mapping from F, to F,. 
Proof. Since Tr is a linear mapping, V is linear. Since e(&B “ty is a cyclic shift of 
c(€) the code is cyclic. Since B is in no subfield of F,, we know that 8 is a zero of 
an irreducible cyclic polynomial h(x) =h,+h,x+---+h,x“ of degree k. If 
c(€) = (Co, €1,--+5€,-1), then 


> ¢)h, = Tr( gh(B)) = Tr(0) = 0. 


It follows that x*h(x~') is the check polynomial for V, i.e., V is an irreducible 
cyclic code. O 


Now, let 8, be a zero of f(x) in some extension field of F,. The polynomial f(x) 
is determined by one of its zeros. Let R be a set that contains at least one zero of 
each of the factors f(x) of the generator of a cyclic code C. Then 


C= {e(x) € F,[x}/(” — 1) |Vecale(é) = 0). 


We say that C is defined by the zero-set R. R is complete if it consists of all the 
zeros of g(x). For the code M; we can take R = {6;}. 


Example 4.3. Let n =(q”™ — 1)/(q —1) and let 8 be a primitive nth root of unity 
in Fj. Furthermore, let (m,q—1)=1. The cyclic code 


C = {c(x)|c(B) = 0} 


is an [n,n — m] Hamming code over F,. 


In order to see this, consider a matrix H for which the columns are the 
representation of 1,8, B’,...,8" ' as vectors in F7. By definition, every 
codeword (cy,¢,,...,C,-,) has inner product 0 with every row of H. So H isa 
parity-check matrix for C. Since (n, q—1)=(m,q-—1)=1, the columns of H 
represent all the points of PG(m — 1, g). Therefore, C is a Hamming code. 

Note that if R contains more than one element, the same idea can be used to 
construct a ‘‘parity-check” matrix for the cyclic code (but it may have more rows 
than necessary). 

Every cyclic code has an idempotent element that generates the code. This is a 
consequence of the following theorem. 


Theorem 4.4. Let C be a cyclic code. Then there is a unique codeword that is an 
identity element for C. 
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Proof. Let x” — 1 = g(x)A(x), where g(x) generates C. Since (g(x), A(x)) = 1, there 
are polynomials a(x) and d(x) such that 


a(x)g(x) + b&)AQx) = 1 in F,[x]. 


Let c(x) = a(x)g(x) = 1 — b(x)h(). Clearly, c(x) is a codeword and since any 
codeword is a multiple of g(x), we see that c(x) is a unit for C (and therefore 
unique). O 


Definition 4.5. The idempotent of an irreducible cyclic code M, is called a 
primitive idempotent and denoted by 6,(x). 


Now, Suppose g = 2. Let @ be a primitive nth root of unity in some extension 
field of F,. If a" is a zero of f(x), then the zeros of f(x) are the elements a’, 
where b runs through the cyclotomic coset {a,2a,4a,...,2° 'a} and s is the 
minimal exponent such that (2°—1)a=0 (mod n). If (x) is an idempotent of 
some cyclic code, then c(a’) = 0 or 1. If c(x) is a primitive idempotent, then there 
is an irreducible factor f(x) of x” — 1 such that c(a’) = 1 if and only if f(a’) =0. 
Therefore, the nonzeros of c(x) are a”, where 6 runs through some cyclotomic 
coset. This information along with the following theorem gives us an easy way to 
construct and describe all cyclic codes. 


Theorem 4.6. If C, and C, are binary cyclic codes with idempotents c,(x) and 
C2(x), then C, VC, has idempotent c,(x)c,(x) and C, + C, has idempotent c,(x) + 
Co(x) + ¢,(x)c,(x). For the primitive idempotents we have 0,(x)0,(x) =0 if ij. 


The proof is an easy exercise. 


We introduce one more tool that is often useful when treating cyclic codes. It is 
a discrete analogue of the Fourier transform. In coding theory it is usually 
teferred to as the Mattson—Solomon polynomial. If F, is our alphabet, let B be a 
primitive mth root of unity in the extension field F “of F,. Let T be the set of 
polynomials over F of degree at most n — 1. We define a transform ®:T-T as 
follows. Let a(x) € T. Then A(X) = (@a)(X) is defined by: 


A(X) = 2 a(B/)x"™. (4.7) 
y=1 
If a=(ay,a,,...,a,_,) is a codeword in a cyclic code, we identify it with 


n-l 


a(x) =a, +a,x+---+a,_,x""" as usual, and call A(X) the Mattson—Solomon 
polynomial of a. 


Theorem 4.8. The inverse of ® is given by 


a(x) =n" \(PA)(x') (mod x” - 1). 
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Proof. 
noa-l na-l fn 
A(B*) => & a8 B “=> a, Bo =n. 
j=l i=0 i=0 0 j=l 


5. Important classes of linear codes 


We treat several classes of linear codes. The reader who wishes to compare these 
codes should consult table 1 at the end of this section. 


5.1. Lexicodes 


We define a lexicographically least binary code (lexicode) with distance d as 
follows. The word length depends on the number of codewords and is chosen such 
that there is no position where all codewords have a 0. Start with c,=0 and 
c,=(1,1,...,1,0,0,..., 0) of weight d. If ¢),¢,,...,¢€,_,; have been chosen as 
codewords, then c, is defined to be the first word in the lexicographic ordering 
with 1’s as far to the left as possible, such that d(e,,c,)=d for O<i</-1. (At 
each step the length may have to be adapted.) To show that one obtains linear 
codes in this way is somewhat tricky (cf. Van Lint 1982). The idea is to prove the 
following assertion by induction on k: “After 2“ codewords have been chosen, a 
lexicode is linear and during the process of choosing codewords the word length 
increased exactly after 2' steps for i<k.” 

Consider the special case d =3. Let C, be the linear code that is obtained after 
2* codewords have been chosen and let n, be the length. If C, is not a perfect 
code, then there is a word x of length n, that has distance 2 to C,. Therefore, 
(x, 1) is a possible choice for c,.. Therefore, we know that 2,,, =", +1. If, on 
the other hand, C, is perfect, we must have c,.=(1,0,0,...,0,1,1) and 
ny, =", +2. It is now possible to prove by induction on a: n, =2°+i for 
k=it+(2*-a-1) and 1<i<2°. This shows that the Hamming codes are 
lexicodes with d = 3 (take i=2°— 1). 

For applications to combinatorial games see chapter 43. 


5.2. Hamming codes 


These codes, introduced in Definition 2.1, turned out to be cyclic (in many cases) 
in Example 4.3 and appeared as lexicodes above. Since the codes are perfect, they 
are also the optimal solutions to the following combinatorial covering problem. 
Find a set C of minimal cardinality in £7} such that every word x has distance < e 
to some element of C. A well-known example is the so-called football-pool 
problem. Here, one has to forecast the outcome (win, lose, draw) of n football 
matches. The question is: how many forecasts of the results are necessary to 
guarantee winning at least the second prize (one result may be incorrect). This is 
the covering problem mentioned above with g =3 and e=1. If n= 403! -lj)a 
Hamming code provides an optimal solution. For more about football-pool 
problems see Blokhuis and Lam (1984), and Wille (1987). 
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5.3. BCH codes 


One of the most widely studied classes of cyclic codes is the class of so-called 
BCH codes. The codes were introduced by Bose and Ray-Chaudhuri (1960), and 
Hocquenghem (1959). Their practical importance is due to the fact that there is 
an efficient decoding algorithm for these codes (when used on a BSC, see section 


1). 


Definition 5.1. A cyclic code of length n over F, is called a BCH code of designed 
distance 6 if, for a suitable choice of a primitive nth root of unity 8, the complete 
zero-set of the code contains B', et ies a for some J. If 7= 1, the code is 
called a narrow-sense BCH code. If n =q”™ ~1 (i.e., B is a primitive element of 
Fm) the BCH code is called primitive. 


(Note that in Example 4.3 87~' is also primitive and the zero-set contains B 
and B%, i.e., consecutive powers of B‘~'. So, the designed distance is at least 3. 
In fact 3 is the distance of the code.) 

The expression “designed distance”’ is explained by the following theorem. 


Theorem 5.2 (BCH bound). The minimum distance of a BCH code with designed 
distance & is at least 6. 


Proof. Let C be the code of Definition 5.1 and let /=1. For any codeword a the 
Mattson—Solomon polynomial (4.7) has degree < mn — 6, and hence by Theorem 
4.8 a, =0 for a most 2 ~— 6 values of k. Since the only zeros of A(X) that play a 
role in this argument are nth roots of unity, working mod X” — 1 does not alter 
the argument, i.e., taking /=1 was no restriction. O 


For generalizations of this theorem, see Van Lint and Wilson (1986). By 
definition, a BCH code has the cyclic group of order n as subgroup of its 
automorphism group. We can show a lot more. 


Theorem 5.3. Every extended primitive BCH code of lengthn + 1 = q™ over F, has 
AGL(1, q”) as a group of automorphisms. 


Proof. Let C be the BCH code and (cy,¢,,...,¢,-,) a codeword. We identify 

the position of c, with the field element a’ in F,» and the position of c, with the 

zero element of Fn. Let P,,, be the element of AGL(1, q”) defined by: 
Pyy(X)=uX +0, “EF mn, VE Fm, UFO. 


Suppose P,,,, maps (cy,¢,,..-,¢€,) into (cg,cj,--.,¢,)- Then forOsk<d~—1, 
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we have 
> c(a*)' = z cua’ +v)* = bs C, > « ulate 1 


k 
= (ce c(a'y. 


Here the inner sum is 0 for O</<d —1. If />d, then />& and then (4) =0. It 


follows that the permuted codeword isin C. O 
Corollary 5.4. The minimum weight of a primitive binary BCH code is odd. 


Proof. Aut(C) is transitive on the positions. Therefore, there are codewords of 
minimum weight with a 0 in the check position. O 


5.4. Reed-Solomon codes 


The special case of Definition 5.1 in which n = q —1 is called a Reed-Solomon 
code (RS code). Usually one takes /=1. By Theorem 5.2 the minimum distance 
of the code is at least 6 =n —k + 1, where k is the dimension of the code. By the 
Singleton bound, the distance cannot be larger. Therefore, an RS code is MDS. 
Note that the dual of an RS code is again an RS code. If g is even, then the dual 
of the RS code with distance 6=4gq has the same zeros as the code and 
furthermore it has 1 as a zero. Therefore, the extension of this code is self-dual. 
Another interesting property of an RS code is that its extension is again an MDS 
code as the following theorem states. These codes are widely used (e.g., in the 
compact disc audio system) because there is a very fast decoding algorithm. 


Theorem 5.5. Let C be an [n,k,d] narrow-sense RS code over F,. Then C is an 
[n +1,k,d+1] code. 


Proof. By definition, x = q— 1 and C is generated by TI]! (x - a’), where @ is 
primitive in F,; k =n —d +1. If c(x) is a codeword, then the parity-check symbol 
that is added j is —c(1). If this is 0, then c(x) is in fact a multiple of IT. 9 (*%—-a’) 
and therefore has weight at least d+1 by Theorem 5.2. O 


The same argument works if we use —c(a“) as parity-check symbol, and by 
using both of them we find the following result. 


Theorem 5.6. An [n,k,d] narrow-sense RS code can be extended twice to an 
[1 +2,k,d + 2] code. 


Actually, it is possible to find cyclic codes with the same parameters (unless k is 
even ands q is odd). In order to see this, observe that if a is primitive in F, 2, then 
B =a" is a primitive (g + 1)th root of unity and then B’ and =p “are the 
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zeros of a polynomial of degree 2 in F,[x]. Therefore, it is possible to find a 
polynomial of degree g — k + 1 in this way, which has consecutive powers of 8 as 
its zeros (unless k is even and q is odd). 


Theorem 5.7. If 1<k <q +1, then there exists a cyclic [q+ 1,k, q—k +2} code 
[if q(k — 1) is even]. 


If g is odd and k is even, then one can replace cyclic by negacyclic in Theorem 
5.7. 

We consider a special example, namely the [8, 4, 5] extended RS code C over 
F,. AS we saw above, C is self-dual. For F, we take a primitive element a such 
that ait+a?tl ~ Then the normal basis (B;, Bz, 83) = (a, a, a*) has the 
property Tr(8,8,) = 6,,. If we represent each element of F, as a triple i in F3 using 
this basis, then C is nspped into a [24, 12] binary code G. It is easily seen that the 
special property of the basis ensures that G is also self-dual. By inspection of the 
generator of C, one also sees that all basis vectors of G have weight 8. Therefore, 
all codewords of G have weight divisible by 4 and since C has distance 5, it 
follows that G is a [24, 12,8] code. Therefore, G is the (unique) extended binary 
Golay code (see chapter 14). This construction is due to Pasquier (1980). 

The original representation (and encoding) of RS codes was as follows. A 
sequence of k information symbols ay,a,,...,@,., is used to define the 
polynomial: 


a(x) =a, +a,xt---+a,_,x*7! 


The corresponding codeword (cy,¢,,..-,¢,-1) has c;=a(f'), where B is 
primitive in F, and n = q — 1. Since the polynomial a(x) has at most k — 1 zeros, ¢ 
has weight at least n —k +1 (if c #0). To see that this defines the same code as 
above we set 


c(x)= cg tex +--+ te,_.x"7! 


and then the same proof as for Theorem 4.8 shows that c(8’) =0 for 1<j<d= 
n—k+1. The extended code has the extra symbol a(0). 


5.5. Reed-Muller codes 
Consider the points of the affine geometry AG(m, 2) as column vectors in F 2. We 
denote the standard basis by u,,u,,...,4,,-,- If an &,,2' (0<j <2”) is the 


. . . m-1 m 
binary representation of j, then we define x, = vo &4;. Letn = 2”. Then the m 


by n matrix E with x, as jth column represents the points of AG(m, 2). We define: 
A, = {x,; © AG(m, 2)| é,,=1}, ahyperplane , (5.8) 


v, =the ith row of E, i.e., the characteristic function of A,, 
considered as vector in F% . (5.9) 
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As usual we write 1 for the characteristic function of the space AG(mm, 2). 
If a=(a,,4,,...,4,-,) and b=(by, b,,...,6,_,) are words in F3, then we 
define: 


ab = (aybo,4,b,,...,4,_15,-1)- (5.10) 


Note that if i,,i,,...,é, are different, then uv, v,---v, is the characteristic 
function of the (m—s)-flat A, OA, 9°°°0A; Therefore, the weight of this 
word is a *. The characteristic function of the set {x,}, i.e., the jth basis vector 
of F” 


m-1 
e= IT @+0+é,)0), 
so the products u,v, ---v;,, (0<s<m) form a basis of F™. 
Definition 5.11. Let 0<r<m. The linear code of length n=2”™ that has the 
products vu, v,,---v, with s<r factors as basis is called the rth order binary 
Reed-Muller code [RM code; notation R(r, m)]. 
The special case R(0, m) is the repetition code. 
Theorem 5.12. R(r, m) has minimum distance 2”~". 
Proof. From the definition one immediately sees that R(r + 1, m+ 1) is obtained 
from R(r + 1, m) and R(r, m) by the (u, w+ v) construction of section 2 (consider 
products without, respectively with, the factor v,,_,). 0 
Theorem 5.13. The dual of R(r,m) is Rm —r—1,m). 
Proof. The dimensions of the two codes have sum 2”. If @ is a basis vector of 
R(r, m) and 6 a basis vector of R(m —r—1,m), then by Definition 5.11 ab is the 
product of at most mm — 1 factors v,; and hence the characteristic function of a flat 
of dimension at least 1, i.e., a set with an even number of points. Therefore, 
(a,b) =0. O 
Corollary 5.14. R(m — 2, m) is the [n,n —m— 1] extended Hamming code. 


Our definition allows us to consider elements of AGL(m, 2) as permutations of 
the positions of codewords in Reed-Muller codes. 


Theorem 5.15. AGL(m, 2) C Aut(R(;, 7)). 
Proof. Let a be the characteristic function of an /-flat A in AG(m, 2) and let v be 


a basis vector of the code R(/ — 1, m), i.e., v is the characteristic function of a flat 
V with dimension = m-—1+1. Therefore, VM A is either empty or it has 
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dimension at least 1. In both cases |V N A| is even and therefore (v,a) = 0. By 
Theorem 5.13 a is in R(m —/, m). So, a word is in R(7, m) if and only if it is a sum 
of characteristic functions of affine subspaces of dimension = m—r. Since 
AGL(m, 2) maps k-flats into k-flats (for every k) the proof is complete. O 


We now reorder the elements of AG(m, 2) as follows. Let a be a primitive 
element of F,.. We form the matrix E* with as columns the representations of 
1,a,a7,...,@" ' as elements of F;’. Finally, we add the 0 column to obtain a 
permutation of E. A cyclic shift of the columns of E* corresponds to multiplica- 
tion of elements of Fj,» by a. Since this is a linear mapping, our geometric 
description of Reed-Muller codes shows that, with this ordering of the coordi- 


nates, they are extensions of cyclic codes. 
Theorem 5.16. The codes R(r,m) are equivalent to extended cyclic codes. 


For more information about the connection between cyclic codes and finite 
geometries see Berlekamp (1984), MacWilliams and Sloane (1977), and Van Lint 
(1982). 


Remark. Consider the Hamamard matrix H of order n=2” obtained from 
(1 _!) by repeatedly applying the Kronecker product (see chapter 14). In the 
rows of H and —H we replace +1 by 0 and —1 by 1. We thus obtain the words of 
R(1, m). 


5.6. Quadratic residue codes 


In this subsection we restrict ourselves to binary codes. Much of the following is 
also valid over other alphabets (with slight modifications). 

Let n be a prime =+1 (mod 8). This ensures that 2 is a quadratic residue 
mod n. Let Ry and R, denote the quadratic residues mod and the nonresidues, 
Tespectively. We define A(x) = Le r,* - If @ is a primitive nth root of unity in an 
extension field of F, and i€ Ry, then 6(a')” = 6(a'), i.e., O(a‘) =0 or 1. In the 
Same way we see that O(a’) + O(a) = 1 if jE R,. We choose a@ in such a way that 
1 =0. Since 2ER, the polynomials galt) = i eee (x-—a@’) and g,(x)= 

rer, %~ @") are in F,[x] and x” — 1 = (x — 1)g,(x)g, (x). 


Definition 5.17. The binary cyclic codes of length m with generator g,(x), 
respectively (x — 1)g)(x), are both called quadratic residue codes (QR codes), The 
extended QR code is obtained by adding a parity check to the code with 
generator g,(x). 


Since @(x) is an idempotent polynomial with the same zeros as g,(x) if m=—1 
(mod 8), respectively as (x — 1)g,(x) if n= 1 (mod 8), it is the idempotent of the 
corresponding code. We now form a matrix G as follows: take all cyclic shifts of 
the vector 6, add a column of zeros, respectively ones, if »=1 (mod 8), 
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respectively n = —1 (mod 8), and finally add a row of ones. From the facts above, 
it follows that the rows of G span the extended QR code of length n +1. 
Replacing 0 by —1 yields a Hadamard matrix of Paley type (see chapter 14) for 
which it is known that it is invariant under the permutations of PSL(2, n). This 
proves the following theorem for which a direct proof can be found in MacWil- 
liams and Sloane (1977) or Van Lint (1971). 


Theorem 5.18. The automorphism group of the extended binary QR code contains 
PSL(2, 7). 


Note that since PSL(2, n) is transitive, the minimum weight of a binary QR 
code with generator g,(x) is odd. 


Theorem 5.19. The binary code with generator g,(x) has minimum distance d 
satisfying d? >n. 


Proof. Let c(x) be a codeword of weight d. Replacing x by x’ with j ER, yields a 
word ¢(x) of the same weight in the code with generator g,(x). Therefore, c(x)é(x) 
is a multiple of g,(x)g,(x) but not of (x —1) because d is odd. This implies 
c(x)é(x) =1+x+---+x""'. Since the product contains at most d? terms, the 
proof is complete. O 


Remark. In the case n= —1 (mod 8) we can take j = —1 in this proof. Then in 
the product c(x)¢(x) terms cancel four at a time and there are d terms equal to 1. 
So n=d’—d+1-—4a for some a. 


Example. Take n= 23. We find a [23,12] code with d=7. By the Hamming 
bound, d cannot be larger than 7 so d=7. This is the binary Golay code (see 
chapter 14). 


5.7. Symmetry codes over F, 


The following class of ternary codes was introduced by Pless (1972). Let C be a 
symmetric or skew conference matrix (cf. chapter 15) of order m=0 (mod 6). 


Definition 5.20. A symmetry code Sym(2m) is a [2m,m] ternary code with 
generator matrix G = (IC). 


(If m — 1 is a power of a prime, then we take a Paley matrix for C; see chapter 
14.) Since CC’ = —1,, (over F,) the code Sym(2m) is self-dual. 


Example. If we take m = 6 we find a ternary [12, 6] code. Since it is self-dual it 
has minimum distance divisible by 3. A trivial analysis of G shows that d =3 is 
impossible. So Sym(12) is a [12,6,6] code. Puncturing yields a perfect code 
known as the ternary Golay code (see chapter 14). For small values of m it is 
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Table 1 
Name q Length n Dimension k Distance d Remarks 
Hamming q (9 -)I)Ag@-1) n-l 3 Perfect 
Primitive BCH q q'-1 =n—mé 6<d<2%-1 
Reed-Solomon q q-1 k n—-k+1 MDS 
Reed-Muller Zz 3" > (7) Plas 
2-0 

Quadratic 2 prime = +1 i(n+1) a>n Includes binary 

residue (mod 8) Golay 
Symmetry 3 2m =0 m often large Includes ternary 

(mod 12) Golay 


relatively easy to find the minimum distance of Sym(2m) by analysing G, e.g., 
d= 12 for m=18. 

Table 1 shows the information on the parameters of the codes treated in this 
section. 


6. Some nonlinear codes 


In this section we mention a few classes of nonlinear codes that are combinatori- 
cally interesting. 


6.1. Hadamard codes 


Let H, be a Hadamard matrix of order n (cf. chapter 14). In H, and in —H, we 
teplace —1 by 0. In this way we find 2n rows which we consider as words in F3. 
Since any two rows differ in 3n positions or in all 7 positions, these words form 
an (n,2n, 4n) binary code. From the Plotkin bound 3.6 one easily finds that 
A(4m, 2m) < 8m. Therefore, every Hadamard matrix of order 47m thus yields an 
optimal code of length 4m. These codes are known as Hadamard codes. The same 
name is used for the (x — 1, 2n, 4-1) punctured codes and the (n —1,n, 3n) 
shortened codes. 


6.2. Conference matrix codes 


Let C be a symmetric conference matrix of order n (n =2 (mod 4), cf. chapter 
15). We assume C is normalized. By deleting the first row and column of C we 
obtain the matrix S (e.g., a Paley matrix). We now form a nonlinear code by 
taking as codewords the rows of 4(S +/+ J) and $(-S +/+ J). In this way we 
obtain an (n — 1, 2n, $(n — 2)) code. 


Example. Starting from the Paley matrix of order 9, we find a binary (9, 20, 4) 
code. Note that an interesting modification of the method of Theorem 3.16 shows 
that this code is optimal. Using Theorem 3.16 for n = 8, d=3, in combination 
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with the obvious inequalities A, <1, A, + 4A, <12, yields A(8, 3) <21. But if M 
is odd, then in the proof of Lemma 3.15 the final inequality can be strengthened 
and then it follows that M < 20. 


6.3. Kerdock codes 


Consider the binary Reed-Muller codes of Definition 5.11 for r= 1,2. If L(x) is 
the form ae a,x, + € (where « =0 or 1), then the codeword pa av, + €1 in 
R(1, m) is the characteristic function of the set {x € AG(m, 2)| L(x) = 1}. If we 
replace L(x) by Q(x) + L(x), where Q(x) = eee q,;X;X; is a quadratic form, 
then we obtain a word in R(2, m). Clearly, R(2, m) is a union of cosets of R(1, m), 
where the coset Q + R(1, m) is obtained by fixing Q and letting L run over all 
forms of degree 1. If m is even and Q is nonsingular, then an affine transforma- 
tion of coordinates in AG(m, 2) puts Q into the standard form x,x, + x3x,+-°-° 
+X,,-1Xm- Then it is easy to show that the distance of a word in R(1,m) and a 
word in Q + R(1, m) is at least 2” '~-2”/?"'. To construct a large code that is a 
union of cosets of R(1, m) and that has this minimum distance, one needs a set of 
quadratic forms Q; such that Q; — Q, is nonsingular if i #j. Clearly, such a set 
cannot have more than 2” ' elements {=the number of choices for 


(9y2> 9132--- > Fim) I- 


Definition 6.1. A Kerdock set is a set of 2”~' quadratic forms in x,,%2,... Xm 
such that the difference of any two distinct elements is a nonsingular quadratic 
form. 


Such sets were first constructed by Kerdock (1972). A construction of Kerdock 
sets depending on a strong connection between such sets, spreads on quadrics 
(orthogonal spreads) and translation planes (cf. chapter 13) was given by Kantor 
(1982). Also see Van Lint (1983) and MacWilliams and Sloane (1977). 


Definition 6.2. A Kerdock code is a (2”,2°7”,2” '—2™’?~') nonlinear code that 
is the union of 2”~' cosets Q, + R(1,m) of the first-order Reed-Muller code, 
where Q, runs through the quadratic forms of a Kerdock set. 


6.4. The Nordstrom—Robinson code 


Consider a representation of the (24, 12, 8] extended Golay code for which the 
word with eight ones followed by sixteen zeros is a codeword. There are 32 
codewords with x, =x, =---=x, =0, and similarly there are 32 codewords with 
x,=X,=1, x,=0 for 1<j<7 and j ¥i (1<i<7). Consider the union of these 
sets and subsequently delete the first eight positions. Clearly what remains is a 
(16, 256, 6) code. This nonlinear code is known as the Nordstrom—Robinson 
code. It can be shown that this code is unique (Snover 1973). Therefore, this code 
is the unique example of Definition 6.2 with m = 4. 
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6.5. Preparata codes 


Let m be odd (m 23), n =2” — 1. We describe a code P of length 2”*' =2n +2 
as follows. Number the elements of F,,. in some fixed order. If X and Y are 
subsets of Fy», we use the notation (X,Y) for the word in F°"*” that is the 
concatenation of the characteristic functions of the sets X and Y. 


Definition 6.3. The extended Preparata code P of length 2”*' consists of all words 
(X, Y) for which 
(i) |X| is evens |Y| is even, 

(8) Brent = Byer ms 

(iii) Diver ® + ( xex% Licey y. 

Clearly, a set X satisfying (i) can be chosen in 2” ways. Let M be a 2m by n 
(0, 1)-matrix in which the ith column is 
(") 
ay’ 
where these elements are represented as column vectors in (F,)” and a, runs 
through the nonzero elements of F,,,. If X is given, then Y\{0} can be found from 
(ii) and (iii) by solving a system of m linear equations of the form M y = c [where 
c is found form the left-hand sides of (ii) and (iii)]. These have full rank. 
Therefore, there are 2”"”” solutions for Y if X is given. Therefore, P has 27"~7" 
codewords. If (X,, Y,) and (X,, ¥Y,) are two codewords with X, = X,, then their 
distance is at least 6 by the BCH bound 5.2. The assumption |X,4X,| =|Y,AY,| = 
2 easily leads to an equation of degree three that has no solutions. It follows that 


P has distance 6. So, the Preparata code P is a (2”°'—1,2*,5) code with 
k=2"*!_ Am — 2. 


Remark. The distance distribution of the code P of length 2”*' is the MacWil- 
liams transform [cf. (2.6)] of the distance distribution of the Kerdock code of the 
same length. Since the codes are not linear, this seems to be a coincidence. 

An explanation of this fact was given in 1992 by A.R. Calderbank, R. 
Hammons, P. Vijay Kumar, N.J.A. Sloane, and P. Solé (see Calderbank et al. 
1993, and Hammons et al. 1994), 

They consider group codes over Z,. For these codes duality is defined in the 
usual way. Next, a so-called symmetrized weight enumerator is introduced which 
keeps track of the number of coordinates 0, 2, and 1 or 3 (the last two considered 
equivalent). They show that for a code C and its dual C~ a generalization of 
Theorem 2.4 holds. Codes of length nm over Z, are mapped to binary codes of 
length 2n by the mapping ¢ defined by 


$(0)=00, (1)=01, (2)=11, 4(3)=10. 
(Note that 1 and 3 lead to the same weight, namely 1). 
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Finally, certain extended cyclic codes and their duals (all over Z,) are 
introduced. It turns out that the mapping ¢ maps these codes into Kerdock codes 
and Preparata codes. A corresponding mapping sends the symmetrized weight 
enumerators of the codes over Z, to the weight enumerators of the corresponding 
binary codes and one finds that indeed, the MacWilliams transform of the weight 
enumerator of a Kerdock code is the weight enumerator of the Preparata code of 
the same length. 


7. Codes and designs 


Many combinatorial designs (e.g., designs) have a strong connection to coding 
theory, in fact several of these designs were found by using appropriate codes. 
Quite an interesting theory of these connections has been developed, much of it 
due to Delsarte (1973b). We treat part of this theory in this section. 

Let C be any binary code of length m with distance enumerator A(x, y). If 


A'(x, y) = yan a y rY 


is the MacWilliams transform of A(x, y) [cf. (2.6)], then 


Ane aus Se (Ne “i= ep Akl (7.1) 


(cf. Definition 3.13). So, by Lemma 3.15 the numbers Aj are nonnegative. 
Clearly, Aj = 1. 


Definition 7.2. The dual distance d’ of a code C is defined by: 
d’:=min{i=1|A‘#0}. 


Note that if C is linear, then d’ is the minimum distance of C*. We can now 
show a relation between binary codes and orthogonal arrays (cf. chapter 14). 


Theorem 7.3. Let C be a binary code of length n with |C| = M. Let [C] be the M 
by n array with the codewords of C as rows. Then if r<d', any set of r columns of 
[C] contains each r-tuple exactly M/2' times. 


Proof. Since A;,=0 for 1<k<d’, we have from (7.1) 
> (-1)*) =0 for every zEF3 with wt(z)=k. 


xEC 
Taking wt(z) = 1, we see that every column of [C] must have M/2 ones and M/2 
zeros. Then taking wt(z) = 2, we conclude that every pair of columns must contain 


each of the four possible pairs exactly M/4 times. Proceeding by induction the 
resuit follows. O 
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Therefore, (C] is an orthogonal array of strength d'—1. From the distance 
distribution (A,,A,,...,A,) of a code C and the MacWilliams transform 
(Aj, Aj,---;An), we find the four so-called fundamental parameters of C (cf. 
Delsarte 1973a): the minimum distance d, the number s of distinct nonzero 
distances between codewords (i.e., besides Ag, there are s nonzero numbers A,), 
the dual distance d’, and the number s’ of nonzero coefficients Aj with 1<i<n, 
The number s’ is called the external distance of C. 

We wish to show some combinatorial significance of the external distance. To 
do this we introduce the set N:= {i| A‘{#0} and the characteristic polynomial 


F(x) rape (1 -2) 


of the code. We cail 
Fo= py a, K(x) 
k=0 
the Krawtchouk expansion of F(x). 


Theorem 7.4. Let V=F3, x EV. Denote by B(x, k) the number of codewords at 
distance k from x. Then 


> a, Bx, k) =1, 
k=0 
and hence the covering radius of C is at most s'. 


Proof. To prove the assertion, we manipulate in the group algebra A = CV with * 
as multiplication. Any subset § of V is identified with the element yes s of A. 
Let Y, denote the set of words of weight k. The assertion of the theorem then can 
be stated as 
C*>, a,Y,=V. 
k=0 

To prove this assertion, we first observe that V is the only element in A with the 
property 


0 if w(w) #0 
—1 roa f ? 
2 CD 2" ifw=0. 


vEeVv 


By Lemmas 3.14 and 3.15, and (7.1) we have 


Tda FS Cyere= 3 Cy? E ako), 


eECc k=0 u,w(ayak 


where w is the weight of w. The first factor is 0 if w ZN, the second is 0 if wEN 
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(unless w = 0, in which case the expression is obviously 2"). This completes the 
proof. O 


In the following, A, ,A,,,.-.,A;, are there nonzero coefficients of A(x, y). 
An interesting situation arises when s<d’'. In that case consider A’(x, y) and 
substitute x = 1. We find 


. i IC| . ’ n-i i 
+ D Ayy =n AKL + yyy. (7.5) 
I= i= 


Since Aj =0 for 1<i<d’, the jth derivative of the right-hand side of (7.5) in the 
point y = 1 does not depend on the values of the nonzero A; for 0<j<s—1. By 
calculating these derivatives, we find s linearly independent equations for the 
numbers A, . Therefore, ps numbers depend only on the parameters n and |C| 
and the numbers i,,l,,...,é,. Now, take any codeword as origin and consider 
the weight enumerator of c Clearly, nonzero coefficients can only occur for 
indices 0 and i; (1<j<s). Also, the MacWilliams transform of this weight 
enumerator must have zero coefficients for 1<i<d’. Therefore, the same 
equations that we found for the A, apply. We have thus proved the following 
theorem. 


Theorem 7.6. if s<d’, then the code is distance invariant. 


By the same argument, it follows that if we know the values A, for O<i ss’, 
then A(x, y) is determined by (7.5) because there are only s’ unknowns on the 
right-hand side. 

If C is distance invariant, then A, =0 or A, =1. Let us assume that we know 
A,,. We shall show that one can obtain ¢-designs by considering the words of some 
fixed weight as blocks. We take t= d’ — 5, where 5 = 5s — A,,. Consider some fixed 
t-tuple of positions and denote by A; the number of codewords of weight i, that 
have ones in those ¢ positions. Next, ‘take O<j <j<s,r=t+j. We now count pairs 
(c, x), where x is a word of weight 7 with ones in the ¢ fixed positions and ¢ is a 
codeword with ones in all the positions of x. From Theorem 7.3 we see that there 


are 
M c 7 ‘ 
2 \ 4 

such pairs (where M = |C|). On the other hand, the number of pairs is 


5 oe) 
ae Ai, 


In this way we find § linearly independent equations for the § unknowns A,,. It 
follows that these numbers are independent of the choice of the t-tuple. So. we 
have the following theorem. 


Codes 801 


Theorem 7.7. if 5<d' and i 2d’ — 5, then the words of weight i in the code form a 
i-design with t=d' — S. 


An easy example is provided by R(1,m). This yields the expected 3-design. 
More interesting examples can be found from the Golay code and its derivatives 
(see chapter 14). From the Nordstrom-Robinson code we find three 3-designs 
(blocksize 6, 8 and 10). 

There are several other theorems similar to Theorem 7.6. For these we refer to 
MacWilliams and Sloane (1977). We give one more theorem on designs obtained 
from codewords of fixed weight in a linear code — the celebrated Assynus— Mattson 
theorem (Assmus and Mattson 1969). We define the support of a codeword to be 
the set of positions where the nonzero coordinates occur. For a linear code over 
F,, the q—1 multiples of a fixed codeword have the same support. In the 
following, C is an [n, k] code over F, with minimum distance d, C* is the dual 
code, d’ is the dual distance. 


Theorem 7.8. Let t be a positive integer less than d. If q >2 let vu, be the largest 
integer satisfying 


whereas v, = Ww, =n if q =2. Suppose the number of nonzero weights of C* that 
are less than or equal ton —tis <d—t. Then, for each weight v with d<v Sug, 
the set of supports of codewords of weight v in C form a t-design. Furthermore, for 
each weight w with d'<w <min{n — t, wo}, the supports of codewords of weight w 
in C~ form a t-design. 


Proof. By definition of uy, two words of C with weight <u, that have the same 
support are scalar multiples of each other. Similarly for C”. If we delete d—1 
columns from a generator matrix of C, we still have a matrix of rank k. Consider 
a f-subset T of the positions. The code A is obtained from C by puncturing on T 
and B is obtained from C~ by shortening on T. Then B = A“. The conditions of 
the theorem imply that we can solve the MacWilliams equations of Theorem 2.4. 
This implies that the weight enumerator of B does not depend on the choice of T 
and the same holds for A. The assertion of the theorem now follows from 
straightforward counting arguments. O 


Examples. (a) The symmetry code Sym(12/) defined in Definition 5.20 is a 
ternary (12I, 6/] self-dual code. Therefore, all weights are divisible by 3. If the 
minimum distance d is larger than 3/, then we can take ¢=5 and apply Theorem 
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7.8. This idea works for /<5 and many of the resulting 5-designs were first 
constructed in this way (cf. Pless 1972). 

(b) By the remark following Theorem 5.19, the extended OR code of length 48 
has minimum distance 12 (14 is excluded by Theorem 3.4). Since the code is 
self-dual and the generator matrix has rows of weight 12, all weights are =0 
(mod 4). Again, we can apply Theorem 7.8 with ¢=5. This yields four different 
5-designs and their complements. 


8. Perfect codes and uniformly packed codes 


Perfect codes and nearly perfect codes were defined in section 2 and Definition 
3.12. This concept was generalized by Semakov et al. (1971) and the theory was 
developed further by Goethals and Van Tilborg (1975) and Van Tilborg (1976). 
We shall give a brief survey. For details we refer to the papers and to Van Lint 
(1982), and MacWilliams and Sloane (1977). We restrict ourselves to codes in F7. 


Definition 8.1. An e-error-correcting code C is called uniformly packed with 
parameters A and yw if for any x with distance e to C, there are exactly A 
codewords with distance e + 1 to x, and for any x with distance = e + 1 to C there 
are exactly 4 codewords with distance e+ 1 to x. 


By counting pairs (x,c) with e&C and d(x,c)=e+ 1, we see that A<(n — 
e)(q — 1)/(e +1), and equality implies 4 =0 and C is perfect. Nearly perfect 
codes are binary uniformly packed codes with 


n+1 
waati=(22F], 


Note that if e +1 divides n +1, then the code is perfect. 

The statement of Definition 8.1 can be translated into the group algebra 
terminology of the proof of Theorem 7.4. We find that C is uniformly packed with 
parameters A and y if and only if 


e-1 
A 1 
c+{Dy+(1-4)v.+2y..,}-v (8.2) 
i=0 Bw B 


in A. An argument analogous to the proof of Theorem 7.4 (see Van Tilborg 1976) 
then shows that (8.2) is equivalent to 


F(x) = y K,(x) + (1 - *)K.) + 7 K..,@). (8.3) 


From (8.3) we see that for a uniformly packed code s’ = e + 1. Now assume that 
C has external distance e + 1. Then from Theorem 7.4 we can calculate B(x, e + 
1) if x has distance e to C or distance = e + 1 to C. Therefore, C is uniformly 
packed. 


Codes 803 


Theorem 8.4. A code C is uniformly packed if and only tf its external distance 
equals e+ 1 (where d=2e + 1 or 2e +2). 


A further consequence of this and section 7 is that for a uniformly packed code 
and any x EV, the numbers B(x, k) are completely determined by the distance 
from x to C. 

If e =1 and the code is also linear, then a lot more is known. It can be shown 
(cf. Van Lint 1982) that a linear code with covering radius 2 and d = 3 is uniformly 
packed if and only if C* is a code which has s=2 (a two-weight code) and 
i+j=n+1, where A; #0, A, #0. For a survey of the connections of these codes 
to finite geometries see Calderbank and Kantor (1986). 

Besides their interest as a packing problem in F/, the uniformly packed codes 
also produce designs. A q-ary t — (n, k, A)-design is a collection S of vectors of 
weight k in F? such that every vector of weight ¢ in F” is covered by exactly ¢ 
elements of S. 


Theorem 8.5. If C is a uniformly packed code in F’ that contains 0, then the 
codewords of fixed weight in C form a q-ary e-design if d=2e+1, and a 
q-ary(e + 1)-design if d=2e +2. 


Proof. For words of weight d this is a consequence of the definition. The proof is 
by induction on the weight using the fact that for any x © V the numbers B(x, k) 
are completely determined by the distance from x to the code. O 


We now give a few examples of uniformly packed codes. 


8.1. A Hadamard code 


Consider a (12, 24,6) Hadamard code (see section 6). Puncture to obtain an 
(11, 24, 5) code C. If there were a word z with distance 2 or 3 to four codewords, 
then w.l.o.g. the original Hadamard matrix would have had four rows of the form 


+ + + + + + + + + + + + 
+ + + 4+ + + = = = = = = 
tot + -— = = + + Ft FO 
- =- = + + + + + FF HF = 


There is no row in {+1}" that is orthogonal to these four. Furthermore, the 
words with distance >1 to C have an average of three codewords at distance 2 or 
3 (easy counting). So this number is always three and C is uniformly packed. 


8.2. Preparata codes 


The Preparata codes P of Definition 6.3 have n =2”*’—1 (m odd) and e=2. 
Therefore, they satisfy (3.11) with equality, i.e., they are nearly perfect. 
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8.3. Two-weight codes 


Several two-weight codes can be obtained by starting with a first-order RM code 
(hyperplanes) and restricting to the positions of a nondegenerate elliptic quadric 
(since the hyperplanes have intersections with this quadric of only two different 
cardinalities). For details we refer to Van Tilborg (1976) or Calderbank and 
Kantor (1986). 


8.4. Nonexistence theorems for perfect codes and uniformly packed codes 


It was shown by Tietavainen (1973) and Van Lint (1971) that the Golay codes are 
the only nontrivial e-error-correcting perfect codes with e > 1 over any alphabet Q 
for which |Q| is a prime power. For e>2, e #6, the restriction on Q can be 
dropped, as was shown by Best (1982). The case e =6 was handled by Hong 
(1984). Van Tilborg (1976) showed that e-error-correcting uniformly packed codes 
with e > 3 do not exist. For e <3 much is known about nonexistence. The proofs 
for the binary case and for uniformly packed codes with A= yz +1 are easy (see 
Van Lint 1982). The problem of the existence of perfect codes that correct 1 or 2 
errors for alphabets that are not fields looks hopeless at the moment. For a survey 
of perfect codes and several generalizations see Van Lint (1975). 


9. Codes, finite geometries and block designs 


The Reed-Muller codes of Definition 5.11 have of course a geometrical interpre- 
tation. Many ideas from the theory of affine spaces are used in treating, 
understanding and decoding these codes. Similarly, other finite geometries have 
led to codes with certain desirable properties. Many of these are in fact subcodes 
of generalizations of RM codes (in their cyclic representation). In this case the 
codes have not led to new insight into the combinatorics. For a treatment of 
geometric codes see Berlekamp (1984). 

On the other hand, the methods of coding theory have led to quite a number of 
interesting theorems on block designs, including projective planes. Usually these 
results were nonexistence theorems. The idea is usually to study the binary code 
generated by the rows of the incidence matrix of a design that is not known but is 
assumed to exist. The properties of the alleged design lead to relations for the 
parameters of the corresponding binary code. Quite often, theorems of the type 
treated in this chapter show that such a code does not exist and hence the 
combinatorial design does not exist either. Below, we shall give a few examples of 
these methods. There have been recent applications by Bridges et al. (1981) and 
Tonchev (1986a,b). Also see Hall (1986, chapter 17). 

Let A be the incidence matrix of a projective plane of order n. C is the code of . 
length nWtnt+l generated by the rows of C. If 2 is odd, the rows of A with a 1 
in a fixed position have as sum a word with 0 in that position and 1’s elsewhere. 
Therefore, C is the even-weight code. If 2 is even, we obtain more interesting 
results. 


Theorem 9.1. If n=2 (mod 4) the code C has dimension 4(n’? +n + 2). 
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Proof. Clearly C is self-orthogonal. Hence dim C < 1(n? +n+2). Let dim C=r, 
dim C* =n?+n+1—r=k. Let H be a parity-check matrix for C in the form 
H=(I,P), possibly after a permutation of the places. Define 


Nic [, P 
AO! AL 


Over @ we have det AN = det A =(n + 1)"""*”””, Since all entries in the first k 
columns of AN" are even (by definition of H), we see that det A is divisible by 
2*. It follows that k<1(n?+n), and hence r2=1(n?+n+2). O 


Note that Theorem 9.1 shows that C is self-dual. 


Theorem 9.2. C has minimum weight n +1 and furthermore: 
(i) the vectors of weight n +1 in C are the lines of the plane, 
(ii) the vectors of weight n +2 in C are the hyperovals of the plane. 


Proof. If ¢ is a codeword of weight d, then count the intersections of c and the 
lines of the plane. The assertions follow immediately. If S is a hyperoval in the 
plane, consider the sum of the lines through pairs of points of S$. This is clearly 
(the characteristic function of) S. O 


For an account of the attacks on the existence problem of the projective plane 
of order 10 (using similar methods) see chapter 13. 
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1. Introduction 


A conjecture both deep and profound 
Is whether a circle is round. 

In a paper of Erdés 

Written in Kurdish 

A counterexample is found. 


Leo Moser 


Although geometry has been studied for thousands of years, the term com- 
binatorial geometry is of quite recent origin. [We do not know exactly when it first 
appeared, but Levi used it in the introduction to his 1929 book on configurations 
as if it were already an established term (Levi 1929).] Besides our present 
meaning of geometrical questions involving combinatorics, it is often used to 
cover the theory of convex bodies and the related problems of covering, packing, 
enumeration, and geometric inequalities. Although we do not specifically exclude 
these other topics, we are mainly interested in, and will mainly discuss, extremal 
problems involving finite sets of points in Euclidean space of a highly com- 
binatorial nature. Within that context, we do not claim completeness and will 
naturally tend to discuss problems on which we ourselves have worked, and we 
will try to indicate the literature of related problems. We will give short sketches 
of proofs when we wish to illustrate the methods used to prove things. We refer 
the reader to the senior author’s numerous survey papers on this subject (Erdés 
1975, 1980, 1981, 1984, 1985a,b) and to Griinbaum (1972). See also Grinbaum 
(1967), Griinbaum and Shephard (1987), Gruber and Wills (1983), Croft (1970), 
and Edelsbrunner (1987). 

We also want to point out the delightful book of Hadwiger and Debrunner 
which appeared in German and French and was later translated into English by 
Klee who added much original material and also brought the book up to date 
(Hadwiger et al. 1964). Klee has also informed us that his book with Stan Wagon, 
Old and New Solved and Unsolved Problems from Plane Geometry and Number 
Theory (Klee and Wagon 1991) has appeared. 

Another source of problems and results is the 1986 edition of the collection 
‘‘Research problems in discrete geometry” (Moser and Pach 1986). The nucleus 
of this collection was a list of problems (Moser 1966) compiled by W. O. J. Moser 
in memory of Leo Moser. Further expanded editions appear from time to time. 

From 1954 until 1973, there was a section entitled ‘““Ungeléste Probleme” 
(unsolved problems) edited by Hadwiger, in the Swiss journal Elemente der 
Mathematik. A total of 56 problems appeared. Stimulating geometrical problems 
also appear in the solved and unsolved problem sections in the American 
Mathematical Monthly. 

Several journals have specialized in geometry: Geometriae Dedicata, published 
by Reidel, started in 1972; Discrete and Computational Geometry, published by 
Springer, started in 1986; and the Journal of Geometry, published by Birkhauser, 
started in 1971. 
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In preparing this chapter we received help from many of the same people 
whose names appear in the references. We should particularly like to thank B. 
Griinbaum, L. M. Kelly, V. Klee, J. Pach, J.-P. Roudneff, and T. Zaslavsky. We 
also thank the editors for their assistance. 


2. Sylvester—Gallai theorems 


2.1. Sylvester’s problem 


Sylvester’s problem derives its name from the fact that it was stated as a problem 
to be solved by the reader by J. J. Sylvester in a column of mathematical 
problems and solutions in the Educational Times, published in London, in 1893 
(Sylvester 1893). Shortly thereafter an incorrect proof submitted by one of the 
readers appeared in the same column. Erdés thinks that Sylvester had a correct 
proof, but Coxeter (1948) seemed to assume that he did not. 

The problem was to show that any configuration of n points in the plane, not all 
on a line, must determine a line containing exactly two of the points. 

In posing this problem, Sylvester was probably motivated by the nine points of 
inflection of a general cubic curve. These nine (complex) points have the 
interesting property that a line through any two of them passes through a 
third -for more details see section 4.1. 

Some 40 years later the problem was revived by Erdés and subsequently solved 
by the Hungarian mathematician T. Gallai, and also by several others. Gallai’s 
proof may be found in de Bruijn and Erdés (1948). We shall call a two-point line 
a Gailai line, although the term ordinary line, introduced by Motzkin (1951), is 
also used. 

Kelly’s minimum-altitude proof is probably the nicest. It appeared in Coxeter 
(1948) along with Coxeter’s own proof. Let the n points be given in the Euclidean 
plane, not all on a line, and form all of the connecting lines. Let the point P and 
the line L have the minimum positive distance that occurs among all point-line 
pairs. Then the claim is that 1 must be a Gallai line. To see this, let Q be the 
closest point on L and P. Suppose that ZL has at least three points on it. Then 
there will be two points P, and P, on the same closed half-line. Let us say that P, 
is between P, and Q, possibly coinciding with Q. Then it is elementary to see that 
the line PP, is closer to P, than P is to L, contrary to the original assumption, 
and so there is always a Gallai line. 

By forming the polar reciprocal, interchanging relevant points and lines, one 
may arrive at the following dual form of the Sylvester—Gallai theorem: given n 
lines in R’, not all concurrent, and not all parallel, there is an intersection point 
determined by the lines that is incident with exactly two lines—a simple — 
intersection point. 

Alternatively, one may use the standard embedding of R? into the real 
projective plane R?* and form the projective dual of a system of m noncollinear 
points. That will be an arrangement of n lines not forming a pencil, and the dual 
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of the Sylvester—-Gallai theorem would say that the lines determine a simple 
crossing point. 

The existence of a Gallai line leads to a simple induction proof of the fact that 
there are at least lines: the result is true for n =3; let 2 >3 and assume the 
result is true for n — 1 points. Let L be a Gallai line, and let P be one of the two 
points on L. Removing P results in a configuration of n — 1 points. If they are all 
collinear, then the original configuration was a very special one called a “near- 
collinear set” with n — 1 points collinear, having 1 points and n lines. If the n —1 
points are not all collinear, then there are at least n —1 lines by the induction 
hypothesis, and Z is a new line. The result follows. 


2.1.1. Conic sections 

Wilson and Wiseman (1988) proved the following Sylvester theorem for conic 
sections: given points in the plane, if no conic section contains them all, then 
there is a conic section containing exactly five of them. See section 5.10 for results 
on circles. 


2.1.2. Compact convex sets 

There have been a number of generalizations of Sylvester’s theorem to versions in 
which the n points are replaced by n compact sets. In one of the most definitive of 
these, Herzog and Kelly (1960) proved the following: 


The Herzog—Kelly theorem. Let there be given n pairwise disjoint compact sets in 
R*, not all contained in a line. Suppose that at least one of the sets contains 
infinitely many points. Then there is a line intersecting exactly two of them. 


They remark that the result is not true if compactness of the sets is not 
assumed, as shown by the example of n parallel lines in the plane. If all of the n 
sets are finite, then the result is also false. For example, the nine points of the 
planar Pappus configuration may be arranged in three sets such that a line cutting 
any two intersects a third. See Herzog and Kelly (1960) for this and also for a 
proof of a weaker result in the finite case. Under the same hypothesis as the 
Herzog-Kelly theorem, Edelstein et al. (1963) proved that there is a hyperplane 
intersecting exactly two of the sets, which is a stronger result. We refer the reader 
to the references in Herzog and Kelly (1960) and Edelstein et al. (1963). 


2.1.3. Infinite sets 

Borwein (1984) proved Sylvester’s theorem for compact countably infinite sets 5 
in the plane. That is to say, if the line joining any two points of S intersects a third 
point of S, then S is contained in a line. 


2.2. The Motzkin—Dirac conjecture on the number of Gallai lines 


Erdés and de Bruijn asked whether ¢,(n), the minimum number of Gallai lines, 
where the minimum is taken over all configurations of 7 points, tends to infinity 
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with . Subsequent to this, Motzkin (1951) showed that #,(7) is at least the square 
root of n. Dirac (1951) also asked the same question and they both conjectured 
that ¢,(”) [37]. 

Kelly and Moser (1958) proved that ¢,(2) 24”, with equality occurring for an 
example with 7 points. 

The following example due to Motzkin (1975) shows that ¢,()>4n is best 
possible, if true, when 7 is even. If n = 2m = 4k, take the regular m-gon together 
with the m points on the line L at infinity where the lines joining pairs of the 
m-gon intersect L. The m Gallai lines arise when each vertex V, of the m-gon is 
joined to the point at infinity corresponding to the direction determined by 
V,-Vi+1. If 2 =2n = 4k + 2, the take a regular 2k-gon and its center, together 
with 2k + 1 points on the line at infinity. 

Hansen (1981) ‘‘proves” that ¢,(n) = $n except when n =7 and n = 13, where 
t,(7) = 3 and t,(13) = 6. Recently Csima and Sawyer (1993) corrected his proof so 
that it really shows ¢,(n) =n. 


2.3. Higher dimensions 


In R* it must also be the case that n points not all collinear determine a Gallai 
line. This may be seen by projecting the points into the plane in one of the 
infinitely many directions which do not cause any image points to coincide or new 
collineations to occur, and then applying the result in the plane. 


2.3.1. The Motzkin conjectures 

What about planes? One might reasonably ask whether m points in Euclidean 
space, not all lying on a plane, always determine a plane with only three points —a 
so-called “elementary plane’. Motzkin (1951) showed that unfortunately this is 
false by providing two counterexamples, but he saved the situation by proving the 
following (we use his numbering). 


Theorem U,. If we define a Gallai plane to be one in which all but one of its points 
lie on a single line, then there is always a Gallai plane formed by n points in 
Euclidean three-space, which do not all lie on one plane. 


He observed that U, implies by an induction argument similar to the one for 
lines: 


Theorem T,. Given n points in Euclidean space, not all lying on a plane, they 
determine at least n planes. 


He made the more general conjecture: 
Conjecture U,. If we define a Gallai hyperplane to be one in which all but one of 


its points lie in a (d — 2)-dimensional flat, then n points in R* always determine a 
Gallai hyperplane, unless they all lie in one hyperplane. 
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From this conjecture would follow: 


Conjecture T,. Given n points in R*, not all lying in one hyperplane, they 
determine at least 1 hyperplanes. 


Motzkin was able to prove T, for d=4, 5,..., but not the apparently harder 
U, for d>3. Hansen (1965) proved Uy, for all d. 


2.3.2. The Purdy conjectures 

Let n points be given in R’, not all lying on a plane. The number of planes p is at 
least n, and the number of lines ¢, by projection into the plane, is at least n. Is it 
always the case that p =t? The example of two skew lines with k points on one 
line and n — k on the other gives p=n and t=2+ k(n — k), and the example of 
n—1 points in a plane in general position and one point off the plane gives 
p~t+n=2. We have shown (Erdés and Purdy 1975) that p—t+n=22 if no 
three points are collinear and n is sufficiently large. Purdy (1986) has also shown 
that p > ct, where c > 0, provided that the points do not all lie on two skew lines. 
In fact, there is always one point with that many planes through it. Purdy 
conjectures that p =r for n sufficiently large, except for the two infinite families 
of exceptions mentioned: the points lie on two skew lines, or all but one of the 
points lie on a plane. Grinbaum pointed out that the eight vertices of a cube form 
a counterexample, and also gave us an example with 15 points and another with 
16 points, but thinks perhaps the conjecture is true for n = 17. See Griinbaum and 
Shephard (1984), where these examples are discussed in the dual setting. 

In R¢ Purdy extends the conjecture as follows: we call a configuration of points 
irreducible if points cannot be covered by a set of r =2 nonempty flats F,,...,F,, 
such that the sum of the ranks rk(F,) + --- + rk(F,) is at most d + 1 = rk(R*). By 
“rank”, we mean one more than the dimension (like the matroid rank): points 
have rank one, lines have rank two, etc. Thus a configuration of points on two 
skew lines in R° is not irreducible, since the points are covered by two flats of 
rank two, and the configuration on n — 1 points on a plane and one point off the 
plane in R? is not irreducible, being covered by a flat of rank three and a flat of 
rank one. 

Purdy conjectures that if n > N, and the points form an irreducible configura- 
tion, then W, = W,_,, where W, denotes the number of flats of rank k (the kth 
Whitney number). The conjecture implies by projection that W, =W,_, 2:°:= 
W, = W,, which implies G. C. Rota’s unimodality conjecture in this context. The 
conjecture is false in the more general context of geometric lattices (or equiva- 
lently, simple matroids), as may be seen by looking at d-dimensional projective 
space over the Galois field Z,. 


2.3.3. The number of Gallai planes 

There are also results about the numbers of Gallai planes. Bonnice and Kelly 
(1971) proved that there are at least 4. Gallai planes in Euclidean space. In a 
later paper, Hansen (1980) improves this result to 27. It is not clear what the 
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minimum number is. Probably more than cn Gallai planes exist with c > 2. In the 
same paper, Hansen also gives further examples of configurations of points 
without elementary planes. As mentioned already, Motzkin (1951) observed that 
there are configurations in R’ without elementary planes: for example, two skew 
lines with at least three points on each. Further, he noted that the Desargues 
configuration in R’ with 10 points has none. Bonnice and Kelly (1971) furnish the 
following example: construct a vertical prism whose base is a regular m-gon for 
odd m. For the configuration take the 2m vertices of the prism, the m points on 
the plane at infinity where the (horizontal) sides of the parallel m-gon intersect it, 
and the single point on the plane at infinity where the p vertical edges of the 
prism intersect it. There are 3m + 1 vertices. The configuration has no elementary 
planes. If m=3 this is the Desargues configuration. Hansen’s examples are 
somewhat similar: the prism is based on a regular 4m-gon, and you can have a 
total of either 4m +3 or 10m + 2 points. 

Griinbaum and Shephard have studied these examples and many others in the 
dual setting where they give rise to simplicial arrangements of planes — all the cells 
are simplices. They also point out an error in one of Hansen’s examples 
(Griinbaum and Shephard 1984). 


3. Arrangements 


3.1. Regions in the projective plane 


Suppose that we have n lines in the real projective plane, not all passing through 
one point. Let 1, denote the number of points incident with exactly k of the lines 
for k =2. The lines will divide the real projective plane into polygonal regions. 
Let p, denote the number of regions having k sides. 

There are two very useful formulas relating these quantities: 


Hatt, +2to+---+pyt2pst 3pet--- 
and 
P3=44+2(t, +21, + 3t,+---)+pst+2pe+3p,+-°:. 


The first of these appeared in Melchior (1940). The second one appeared in 
Griinbaum (1972), and it implies the inequality 


ps 74+ p+ 2p,+3p,+---, 


with equality characterizing simple arrangements. See Griinbaum (1972) for 
sufficient conditions on the p, that an arrangement exists. This relates to. 
Eberhard’s theorem - see Griinbaum (1967, p. 253). 

We sketch a proof of the two identities. Let F denote the total number of 
regions. Let V=t, +t, +--+ be the total number of vertices. Then there is an 
underlying graph on V vertices which is embedded in the real projective plane, 
and let E denote the number of edges. The real projective plane may be derived 
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from the sphere by identifying opposite points. The geodesic great circles on the 
sphere then correspond to lines in the projective plane. On the sphere everything 
will occur twice. There will be V’=2V vertices, E'=2E edges, and F'=2F 
regions. Euler’s polyhedral formula gives V’-— E’+F’=2, so that in the 
projective plane V— E + F = 1. It is also easy to see that on the sphere, since the 
great circles are not all concurrent, there cannot be any digons (two-sided 
regions). Three nonconcurrent circles form only triangles, and the result follows 
by induction, since the addition of a circle to an arrangement having no digons 
will not introduce a digon. 

Returning to the projective plane, we see that 2E = 3p, + 4p, +--+ =2(2t, + 
3t, + 4t, +--+). Judicious substitution into the V- E+ F formula now gives us 
the two formulas. 

Since the p, are nonnegative, we also have Melchior’s inequality 


t,23+t,+2t,+3t,+---, 


which holds, by duality, for n points in the real projective plane, not all collinear, 
where ¢, is the number of lines incident with exactly i points. Obviously this also 
holds in the Euclidean plane. This implies that the number of Gallai lines is at 
least three, and so we have yet another solution to Sylvester’s problem! 

In section 4.1 we will discuss an inequality due to F. Hirzebruch which holds 
provided 0=¢, =t,_, =t,_2! 


tp +3t,2n+t,+3t,+5t,+---. 


Euler’s formula also gives us the number of regions F=1—-—V+E=1-t,+ 
2t, + 3t, + +++, so that the number of regions depends only on the ¢,, and not on 
the particular embedding of the arrangement into the projective plane. In the 
Euclidean plane, n lines always determine 2n unbounded regions. When the line 
at infinity is removed, n regions are split in two to form them. (We are assuming 
that the example of n parallel lines has been excluded on the grounds that they 
are all concurrent on the line at infinity.) Thus in R? the number of regions is 
l+n+t,+2t,+3t,+---. 

The maximum number of regions that can be formed by an arrangement of n 
lines in the projective plane is 1 + 4n(m — 1). This may be seen by induction on n 
and the fact that three lines determine four regions. For 7 lines in the Euclidean 
plane the maximum number is x more than this, namely 1+ n(n + 1). In both 
cases the maximum occurs if the lines are in general position, no three lines are 
concurrent. Purdy (1980b) discusses the minimum number of regions f, formed if 
at most n—k lines are concurrent. He shows that if n=4k?+k+1, then 
fh=(k+Da-k). 


3.2. Pseudolines 


Many problems and results about arrangements of lines in the real projective 
plane make sense even if the lines are not perfectly straight. An arrangement of 
pseudolines is a family of 2 closed curves in the real projective plane such that any 
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two meet at precisely one point, where they cross each other. There are 
arrangements of pseudolines with only nine points which cannot be drawn with 
straight lines. Grinbaum(1972) conjectured and Goodman and Pollack (1980a) 
proved that all arrangements of fewer than nine pseudolines can be drawn with 
straight lines. 

Pseudolines were introduced by Levi in a 1926 paper on arrangements of lines 
(Levi 1926). He establishes the following “enlargement” lemma: given an 
arrangement of n pseudolines L,,..., L,, and two points P and Q which do not 
both lie on any of them, there is an enlarged arrangement L,,..., L,,,, in which 
L,., contains both P and Q. A proof is also contained in a monograph 
(Griinbaum 1972). 

Melchior’s identity and Griinbaum’s identity above both hold for arrangements 
of pseudolines. Many results for points and lines, when changed to their dual 
formulations for lines and points, hold for arrangements of pseudolines as well. 
The result of Kelly and Moser (1958) mentioned in section 2.2 (that the number 
of Gallai lines t, is at least 3m) is one such result. The corresponding statement 
for pseudolines is that an arrangement of n pseudolines always has at least 3n 
simple crossings. This was proved by Kelly and Rottenberg (1972). Their proof 
made use of Levi’s enlargement lemma. Since then, many results have been ~ 
extended to pseudolines, and we will mention these results when they arise 
naturally. 

Kelly and Moser (1958) used Melchior’s identity to show that if no line contains 
more than n — k of the n points, where 27k” < 2n, then the total number of lines t 
is at least nk — 2k’. Their proof works for pseudolines. Around that time, Erdés 
conjectured that there was a constant c > 0 such that ¢ > ckn, no matter how large 
k was. Beck (1983), and simultaneously Szemerédi and Trotter (1983), proved 
this with very small c. Recently, c has improved (see section 3.6). Beck’s proof 
also holds for pseudoline arrangements. The result is equivalent to a weakened 
conjecture of Dirac (see section 3.9). 

Erdos asked the following question (Erdés 1983): if m lines are given in the 
projective plane, and t, = 0 for k > 3, does there always exist a ‘‘Gallai”’ triangle, 
i.e., three lines of the arrangement whose three intersection points are all simple 
(Gallai) points? The example called B, by Fiiredi and Palasti (see section 3.7) 
shows that the answer is no if is at least four and not divisible by nine (Fiiredi 
and Palasti 1984). Another problem on simple points can be found in the 
above-mentioned article (Erdés 1983). 


3.3, The Graham-—Newman problem 


Graham and Newman posed the following problem: given a finite set of points in. 
the plane, each colored either red or blue and not all collinear, must there be a 
“monochromatic line”? That is, must there be a line containing only points of one 
color? 

Motzkin (1967) and M. Rabin announced this result, but neither of them 
published a proof, and the first published proof is due to Chakerian. A shorter 
proof, which bears a resemblance to Kelly’s elegant proof of Sylvester’s theorem, 
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appears in some class notes of Griinbaum (1974), where it is attributed to 
Sherman K. Stein. Since Chakerian’s proof is nicely explained and readily 
available in Chakerian (1970), we present Stein’s proof, previously unpublished. 
[Remark by Purdy: it seems that if you find a really short elegant proof, 
somebody else will write it up for you, because Coxeter wrote up Kelly’s proof in 
Coxeter (1948) and Chakerian apparently wrote up Stein’s proof for him.] 


Theorem. Given n lines in the real projective plane, colored by two colors, and not 
forming a pencil, there is an intersection point incident with only lines of one color. 


Proof. Let n lines be given, not forming a pencil, and colored by two colors. We 
wish to prove that there is an intersection point incident with only lines of one 
color, so let us suppose that there are no such monochromatic points. If the 
intersection points were all simple, i.e., no three lines concurrent, then any two 
lines of the same color would intersect in a monochromatic points. Hence the 
arrangement is not simple and there is an intersection point P incident with at 
least three of the lines. 

Since the lines through P are not all of the same color, we may suppose that 
there are lines L, L', and L” among the lines through P such that L and L’ have 
color C, and L” has color C”. If the lines not incident with P were all of color C, 
then any one of them would have to intersect L in a monochromatic point of that 
color. Hence there is a line L* of color C” not incident with P. If we ignore the 
other 1 — 4 lines, then the two triangles LL”L* and L”L'L* share an edge which 
is part of the line L". These two triangles would become merged into one larger 
triangle upon the removal of the line L”. The larger triangle we call a characteris- 
tic triangle of the arrangement. Any other triangle of this form, where L and L” 
have one color and L” and L* have the other color, we also refer to as a 
characteristic triangle of the arrangement. 

Since there are only finitely many characteristic triangles, there must be one 
which is minimal in the sense that there is not another one completely contained 
in it. Suppose that L, L', L", and L* as already described form a minimal 
characteristic triangle LL’L* which is cut into two triangles by the line L”. We 
claim that the intersection point Q of the lines L” and L* is monochromatic. To 
see this, first Suppose not. Since L” and L* have color C’, there must be a line L* 
of color C passing through Q. The line L* will cut either triangle LL"L* or 
L"L'L*. Without loss of generality we may suppose that L* cuts LL"L*. But 
then the four lines L*, L, L”, and L* give rise to a characteristic triangle LL"L* 
which is contained in the supposedly smallest one LL’L*, which is a 
contradiction. O 


Remark. This proof and also Chakerian’s proof work for pseudolines without any 
changes. 


Can we insist that the monochromatic line always be red, say? Clearly not, 
since we can always place blue points so as to spoil the red lines. 
Can we insist that the monochromatic line by a Gallai line (a 2-point line)? The 
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example of n — 1 red points on a line and one blue point off the line shows that 
we cannot. 

Erdés asked the following question. Does there exist for every k a set of points 
in the plane so that if one colors the points by two colors in an arbitrary way, 
there always should be at least one line which contains at least k points, all of 
whose points have the same color? Graham and Selfridge gave an affirmative 
answer for k = 3, but the problem is still open for k >3. 

Is there a generalization of the Graham—Newman problem to three dimen- 
sions? The straightforward generalization is false: given n points in R® colored by 
two colors, not all lying on one plane, there is not always a monochromatic plane. 
On a plane M put r — 1 red points in general position, and place one red point off 
the plane. On every line formed by two or more red points in the plane M place a 
blue point. Clearly there are no monochromatic planes. Perhaps if neither color 
separately is entirely contained in a plane there must be a monochromatic plane. 
Perhaps if no three points are collinear. 

Borwein has generalized the original result, replacing finite sets by countable 
compact sets (Borwein 1984). 


3.4. The two-coloring of regions 


If we are given 7 lines in the Euclidean (but not the real projective) plane, then 
an easy induction argument shows that the regions can be colored with two colors 
so that no two regions that have a side in common with the same color. If n = 1, 
color the two sides of the line with opposite colors. Let n > 1, and suppose that 
the result is true for n — 1, Remove one line L, and use the induction hypothesis 
to color the resulting arrangement. Not put L back. Reverse the colors on one 
side of L and leave the colors alone on the other side. The result follows by 
induction. 

The coloring is unique for each arrangement of lines, for once one region is 
colored, the colors of its neighbors are determined. Let us color the regions using 
r red regions and g green regions, where g <r. If you draw some examples, you 
will see that in general g<r. Fejes-Toth (1975) asked how large r can be 
compared to g. Palasti (1976) investigated this question for simple arrangements, 
applying a result from Palasti (1975) to show that r/g <2 for n <9. A few years 
later, Griinbaum (1980), and simultaneously Simmons and Wetzel (1979), proved 
that r < 2g — 2 for any arrangement. Examples due to Simmons, Palasti and other 
show that equality occurs for a few small values of n, but no infinite family is 
known. The example which Firedi and Palasti call A, (see section 3.7) gives 
r=2g—n-2 if n=3m, which implies that lim sup r/g =2 as n>». 

The regions formed by an arrangement of  pseudolines in the projective plane 
cannot in general be two-colored. But we may define an arrangement of 
topological lines in R° in a manner similar to pseudolines. The above proofs can 
then be used with little modification to show that r < 2g — 2 for arrangements of 
topological lines. Harborth (1985a) has shown that for such lines 7 <jn(n + 1) ifn 
is odd, and r<{n(2n + 1) if 2 is even, with equality occurring infinitely often in 
both cases. 
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One can ask the same question in R’ and higher dimensions. In R* Purdy and 
Wetzel (1980) showed that lim sup r/g <4(d + 1) + 1/d if the hyperplanes are in 
general position (no d having a line in common). 


3.5. Which sequences { p,} are realizable? 


In Levi’s fundamental paper on pseudolines (Levi 1926) may be found a proof of 
the assertion that in an arrangement of pseudolines p, =n. Griinbaum (1972) 
conjectured that for lines equality only occurs for simple arrangements, and this 
was proved by Roudneff (1988). The result is false for pseudolines. There is a 
counterexample with n =9 due to Canham in Griinbaum (1972). 

Griinbaum (1972) conjectured that p, <4n(n — 1) if n is sufficiently large. This 
was established by Canham for simple arrangements, and by Strommer (1977) for 
simplicial arrangements, i.e., arrangements in which every region is a triangle. 
Harborth and Roudneff have constructed simple arrangements of pseudolines 
achieving equality for infinitely many values of nm (Harborth 1985a,b, Roudneff 
1986). It is still not known whether this can be done with (straight) lines, a 
problem originally posed by Griinbaum. The example that Firedi and Palasti 
(1984) call B, contains at least 4 + n(n — 3) triangles. This lower bound was also 
achieved by Strommer (1977) using a construction of Burr et al. (1974). If we 
insist on simple arrangements (no three lines concurrent), then Firedi and 
Palasti’s A, has at least }(n — 1) triangles. See section 3.6 and Fiiredi and Palasti 
(1984). 

Getting back to the conjecture itself, in 1980 Purdy established the general 
result 


P3 Sant —1)+4 whenn=>6, 


for all arrangements (Purdy 1980a). In 1987, Roudneff (1991) proved the full 
conjecture for n>10, it being false for smaller n. One interesting feature of 
Roudneff’s proof is that it perturbs the lines, changing them into pseudolines. 
This is the first case we know of where the proof cannot be carried out with 
(straight) lines alone. Unless another proof is found, pseudolines have now 
become indispensable. 

If the sequence p,, P;, Pg, P,,... Satisfies the equation 


p,=4+p,+2p.+3p,+---, 


then (Griinbaum 1972) there is a simple arrangement of (straight) lines realizing 
it, but it is not clear exactly what conditions p, must satisfy. 

Griinbaum had conjectured that p, must be nonzero for simple arrangements if 
n is large enough. For pseudolines this was disproved by Harborth (1985a) and 
also by Roudneff (1987), but the question is still open for (straight) line 
arrangements. 

The number p, of quadrilaterals satisfies the upper bound 


Py =3n(n — 3) 
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(Grinbaum 1972, Palasti 1975). 

Any discussion of upper bounds on p, is complicated by minor disagreements in 
the literature. We will only say that Grinbaum (1972) and Palasti (1975) agree to 
the extent that p,<4n(n—3) when n=8, with equality achieved by simple 
arrangements when n is odd. 

There are also mixed results. For example, Purdy (1980b) showed that if 
n= 106, then max{p,, p,} 2 2n — 8. Strommer (1977) showed that p, + p, > 0 if 
the arrangement is simple and n = 4. Roudneff (1987) showed that 4p, + Sp, = 3n 
under the same conditions. 


3.6. The maximum of t, 


In this section we discuss how large ¢, can be as a function of 2 and k. 
Croft and Erdés (Griinbaum 1972) constructed the following example, for 
which ¢, > cn7/k?. 


Example. For fixed & and m, take the m points P(x, y) with integer coordinates 
1<x, ym. The arrangement of lines will consist of all those lines that can be 
drawn through the points P in the k given directions. We shall choose k slopes so 
that as few lines as possible are needed to cover the m’ lattice points. Only m 
horizontal lines are needed to cover the points, and only m vertical lines. More 
lines of slope 1 are needed, but 2m will suffice, a they will for lines of slope —1. 
More lines still will be needed for slope 5, but 4m will suffice. Similarly 4 lines 
of slopes —4, 2, and —2 will cover the points. If i and j have no common factor, 
and 0<i<j, then 2jm lines of slopes i/j, —i/j, j/i, and —j/i will cover the m 
points. For fixed j, there are }(j) choices for 7. Let N be such that 


k=2+49(2) + 46(3) +---+40(N-1) +r, 
where r<4@(N). Then 
n=2m + 8{26(2) + 36(3) +--+ (N—- LI d(N-1)+rN}m 


lines can cover the m? points k times. The average value of Euler’s $(x) function 
is about 60% of x [actually 6x/° + o(x)], so asymptotically k = 24N7/x? and 
n=16N m/q? =2nk’?m/6??, m? = 54n7/ (wk). 


3.6.1. Upper bounds 

What about upper bounds? Let T, =¢, + t,,, +°-+. Simple counting arguments 
show that t,<T,<cn?/k’, and Erdés asked whether T, =O(n’/k’), i.e., 
whether the above Croft-Erdés construction is essentially optimal. Szemerédi and 
Trotter (1983) were able to show this if k*<m, with an enormous implied 
constant. Beck (1983) found the weaker result T, <cn’ /k?*4, where d>0, but 
his proof has the virtue that it works for pseudolines as well. Both results are very 
impressive. Both papers independently deduce proofs of Erdés’s conjecture that 
at least ckn lines are determined if no n—& points are collinear (discussed in 


Extremal problems in combinatorial geometry 823 


section 3.2), and then go on to deduce the weak form of Dirac’s conjecture 
(discussed in section 3.9). 

Szemerédi and Trotter deduced their upper bound on 7, from another 
inequality: let n points and ¢ lines be given in the plane, with no particular 
relationship between them, and let J(n, tf) denote the total number of incidences, 
counting multiplicity, occurring between them. Then they showed 


I(n, t) <n? OP? 


with an enormous constant c. Recently, Clarkson et al. (1990) used completely 
different methods to show that 


I(n, t) $3(6)'° Pn’? + 25n + 20, 


and their proof also works for pseudolines. The constant 3(6)'”* in the leading 


terms is approximately 5.45. Their inequality can be used to show that 
n? 
ke 
3.7. The orchard problem 


T,<16-3' 75 fk? <8-3° 55. 


In a problem in the Educational Times, Sylvester (1867) considered the following 
question: how large can ¢, be, given that no four points are collinear. He also 
gave a construction showing that t, > 4n” + O(n). The following simplified version 
of Sylvester’s construction, due to A. H. Stone, appears in Griinbaum (1972). 

The cubic curve y =x° cannot have four collinear points on it. If P(x) is the 
point (x, x°), then it is not hard to see that three points P(x,), P(x.) and P(x,) are 
collinear if and only if x, +x, +x, =0. If n=2m+1, then the points P(—m), 
P(—-m+1),..., P(0),..., P(m) provide at least 17n” collinear triples. If n = 2m, 
then leave out P(—m). This gives ¢, 21? 

Burr et al. (1979) improved this to 1, =1n” + O(n) using the Weierstrass elliptic 
function. (Sylvester claimed, without a published proof, this lower bound also.) 
They also given an account of the history of the ‘orchard problem”, starting with 
J. Jackson in 1821, they discuss the connection with Kirkman-Steiner triples, and 
they have an extensive bibliography. Their constructions have applications to 
several related problems. 

Subsequently, Firedi and Palasti (1984) obtained similar results by two purely 
geometrical constructions which they called A, and B,, and that is how we shall 
refer to them. A, is a simple arrangement (no three lines concurrent). See Fiiredi 
and Palasti (1984) for the relationship of A, and B, to the constructions in Burr 
et al. (1979). We now give their constructions. 

Let P(a@) denote the point (cos a, sina) in the plane, and let L(a) denote the 
line through the two points P(a) and P(m — 2a). If a = 1/3, so that the two points 
coincide, then L is the tangent to the circle x” + y= 1 at P(/3). Then A, is the 
arrangement of lines L(w/n), L(3m/n),...,L((2n—1)m/n), and B, is the 
arrangement of lines L(0), L(2m/n),..., L(@2—1)a/n). The lines L(a), L(B), 
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and L(y) are concurrent if and only if a + 8 + y is an even multiple of 27. Fiiredi 
and Palasti showed 1,(B,) =n —3+ €, where ¢ =0, 2, 2 if n=0, 1, 2 (mod 3), 
and ¢, =0 for k>3. The value of ¢, is then determined by counting pairs: 
t, + 3t, =4n(n — 1). It is easy to see that A, is a simple arrangement (no three 
lines concurrent), since a + 8 + y cannot be an even multiple of 27 if a, B and y 
are odd multiplies of 27. 

Erdés (1962) asked the more general question: how large can t, be, given that 
O= 0,4) =lea2 = °° =t,-1- For k >3 there are no nontrivial upper bounds. For 
example, t,<-/n” is trivial, and not even t,< 5(1—«)n’ is known. For k = 
(cn’’*] one could ask whether t,>c’n'’’/c is always possible, where c’ is 
independent of both n and c. Erdés conjectured that if t,=0, then 2, = 0(n’). 
Perhaps even t,<cn?”. 

Karteszi (1963) constructed an example for all k >3 for which ¢, >c,n logan 
and t,=0 for i>k. See section 5.6 for an explanation of Karteszi’s method. 
Subsequently a better construction was found by Griinbaum (1976); this gives an 
arrangement having t,>c,n*~’“"?). When k=4, it gives t,> cn”, which 
Karteszi had also conjectured. 

The same question can also be asked for arrangements of pseudolines, but the 
problem becomes much easier, as the following argument shows: take the dual of ° 
the Croft-Erddés construction, which will be an arrangement of n lines having at 
least cn?/k° k-fold crossings, and now turn this into an arrangement of pseu- 
dolines such that 0 =¢,,; =¢4, =*'' =4,_1, by the simple expedient of breaking 
up all the intersections of more than k lines. These actually occur on the line at 
infinity, due to the families of parallel lines. 


3.8. Some conjectures 


Let us return to ” points in the plane, not all on a line, and as usual ¢, denotes the 
number of lines incident with exactly i points. In Erdés and Purdy (1978) we find 
some new and perhaps unexpected properties of the family {t,}. Erdés has 
conjectured that for n > no, there is always a k such that ¢, =” — 1. For example, 
t, 2n—1 in the near collinear set. Krier and Straus pointed out counterexamples 
for n =6, 9, and 13. The conjecture is true for n = 25. In fact, max{t,, t;} =a—-1 
for n= 25, 

We also made several conjectures in our paper. We conjectured that if t, =, 
then t=cn’, where t is the total number of lines (unfortunately there is a misprint 
changing this to cn, which is trivial). It follows immediately from Melchior’s 
inequality that 2¢, + 1, 24. Our conjecture can therefore be stated in the stronger 


form: if t, > t,, then t, > cn’. We now have a proof of this conjecture (Erdés and 
Purdy 1990). 


3.9. Dirac’s problem 


Dirac’s problem: let n points be given in R’, not all lying on one line. Draw al] 
the connecting lines, i.e., lines containing two or more of the points. Dirac (1951) 
asked whether there is always one of the n points which is incident with at least 
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in of the connecting lines. He offered the example of two ante CUE, lines with 
the points split as evenly as possible between them to show that jn is best 
possible, if true. He also gave n'” as a trivial lower bound. In Grinbaum (1972, 
p. 25) there is a list of six exceptions to this strict formulation of Dirac’s question. 
If we define r(x) to be the minimum taken over all arrangements of 7 points, then 
Griinbaum gives the upper bounds in table 1 (the third column gives the 
example(s) by their designation in a previous publication Griinbaum 1971). 

Erd6és (1961) asked whether the weaker lower bound r(n) =cn for some c >0 
could be proved. (This question is in fact equivalent to Erdés’s conjecture in 
section 3.2.) Beck (1983), and independently Szemerédi and Totter (1983), 
proved this with very small constants. A result of Clarkson et al. (1990) (see 
section 3.6) leads to a somewhat improved constant. The result also holds for 
pseudolines: given an arrangement of n pseudolines, one of the lines always has at 
least cn different crossing points on it. 

What about higher dimensions? Given n points in R*, not all on a line, it 
follows by projection into the plane that there is a point incident with at least cn 
lines. If the points truly span R? » then Purdy (1981) showed that there is always a 
point incident with at least c a * lines, and the proof works for all simple 
matroids. (See section 4.2 for a discussion of simple matroids.) Purdy also showed 
that if nm points span R’, then one of them is always incident with at least c,n 
planes. The constants c, and c, are close to 1. 


3.10. Directions 


Scott (1970), and independently also R. W. Shannon, asked the following 
question: given n points in the plane, not all on a line, what is the minimum 
number D(n) of directions that the points can determine? If P, Q and R are 
collinear, then they determine one direction; if PQ and RS are parallel, then they 
determine one direction. Scott and Shannon were able to show 


(2n)'? < D(n) <2[4n], 


the upper bound coming from the regular n-gon or the regular n-gon and its 
center, depending on the parity of x. Burton and Purdy (1979) achieved the first 
linear lower bound, showing that, if 2 > 3, 


D(n)2in. 
Table 1 

n U, Example(s) r(2) =U, 
9 4 A,(9) 

15 6 A,(15) 

19 8 A,(19), A;(19) 

25 10 A,(25) 

31 12 A,(G31) 


37 16 A,(37) 
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Ungar (1982) showed that D(m) =2[jn]. It does not happen very often that a 
problem in this subject gets a complete solution. His proof uses a method of 
representing arrangements of pseudolines called allowable sequences, introduced 
by Goodman and Pollack (see section 4.4). 

Grinbaum has a related question about vectors: given # points in the plane, 
not all on a line, what is the minimum number V() of different vectors that are 
determined by the points? Clearly V() = D(m). Ungar’s result and the near- 
collinear set (2 — 1 points collinear) give therefore: 


2[ $2] <V(1) <2n -3. 


Define V'(n) to be the same function under the additional assumption that no 
three points are collinear, so that V'(m) =V(n). Danzer and Griinbaum (1967) 
showed that 


V’(m)<cn™ , where a = (log 3)/(log 2) = 1.58496 ..., 


by means of the following example. For simplicity let n=2* and take the n 
vertices of a cube in R*. There are only 2(3*) different vectors, and this remains 
true if we now project the configuration into the plane in one of the infinitely - 
many directions that will not cause three points to become collinear. J. Pach 
recently made the observation that the same construction shows that h(n) <cn™, 
where A(n) denotes the minimum number of different distances among x points in 
general position in the plane (see section 5.4). 

Jamison (1985) has written a survey of the directions problem, telling the 
history. Using Ungar’s result, Jamison has also proved the curious result that 
points in the plane not all collinear always determine a tree whose edges are line 
segments determined by the points having all different slopes, and he conjectures 
that there is always a path with n — 1 different slopes (Jamison 1987). It might be 
interesting to show that there was always a tree of small depth that covered most 
of the points. For example, the strong form of Dirac’s conjecture (see section 3.9) 
would imply that there was always a tree of depth one (a star) which covered at 
least half of the points. 

We have a related problem: let n red points be given in the plane, not all on a 
line, and form the ¢ connecting lines. What is the minimum number of blue points 
that would be needed, so that each of the ¢ lines has at least one blue point on it? 
Let the minimum number be denoted by f(n). The weak Dirac conjecture, now a 
theorem, states that there is a red point incident with at least cn red lines, and 
these would all need different blue points. Hence f(n) > cn, but with a very poor 
constant. The near-collinear set, with ¢=n, shows that f(n) <n. Clearly f(n) < 
D(n) = 2[3n]. Dean Hickerson constructed an example showing that f(n) <n — 2. 


3.11. Planes and hyperplanes 


An arrangement of hyperplanes in R* or P% is a finite collection of hyperplanes 
not all having a point in common. The hyperplanes divide the space into 
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d-dimensional regions called d-cells, and they also divide each k-dimensional flat 
into k-dimensional regions called k-cells. 

Let n points be given in R® such that no A,v lie in a hyperplane. Then Beck 
(1983) showed that the points determine at least cn“ hyperplanes. 

Let nm points and ¢ lines be given in the plane, without any special relation to 
each other. As mentioned previously, Szemerédi and Trotter showed that the 
number of incidences [(n,f) between them satisfies I(n,t)<cn’*r’’?. Many 
results quoted in this chapter follow from their inequality. 

It would be nice to find a three-dimensional version. If no three points are 
collinear, then it is not hard to show that the number of incidences between 7 
planes and n points is less than c(log n)n*'*7*"*. The factor log n can probably be 
eliminated. An example due to Noga Alon shows that the inequality is not true if 
we remove the restriction on collinear points. 

Edelsbrunner et al. (1990) proved that the number of incidences is 


O(n’? log 7) if the n points are points of intersection formed by the a planes. 


4. Other geometries 


4.1. Complex space 


Every nondegenerate homogeneous cubic equation with real coefficients can be 
put in the form 


xty+z>t+exyz=0, 


by a real homogeneous substitution. The nine points of inflection are then given 
by the scheme 


(0, -1,1), (0, —w",w), (0, —@, w’), 
(1,0,-1),  (w,0,-w’), = (w’, 0, -w), 
(-1, 1,0), (-w’, o, 0), (-oa, w’,0), 


where w is a primitive cubic root of unity. 

Three points of inflection are collinear if (a) they lie in the same row of the 
scheme, (b) they lie in the same column, or (c) no two of the three points lie 
either in the same row or in the same column. For more details see, e.g., Hilton 
(1932, chapter 14). 

The nine points of inflection of the general cubic have the interesting property 
that a line through any two of them contains a third. Presumably it was this 
example which prompted Sylvester to pose his now famous problem. 

At this stage the reader might reasonably conjecture that if n is sufficiently 
large, then there is always a Gaillai line. Such an expectation is contradicted by 
the following example in Motzkin (1951). 

For any n = 3, let w be a primitive complex nth root of unity. Let the 3n points 
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P;, Q;, and R, have the following homogeneous coordinates: 


P{w',-1,0) forlsi<n, 
Q (0, @’, -1) forl<j<n, 
R,(-1,0,*) forl<k<n. 


The three points P,, Q;, and R, are collinear if and only if n divides i+j+k 
exactly. This may be seen by observing that the three by three determinant 
formed from their coordinates, which is equal to —1+ wi*!** is zero precisely 
when i + j +k is a multiple of n. Hence every line formed from a P, and a Q, will 
have an R, on it, and so on. Of course P,,..., P, also lie on a line together, as 
do the Q’s and the R’s. Hence there are no Gallai lines, so long as n is at least 3. 


4.1.1. Some results from algebraic geometry 

Hirzebruch (1983) studied arrangements of lines in the complex projective plane 
by associating with each arrangement some algebraic surfaces, calculating their 
Chern numbers, and then applying the Miyaoka—Yau inequality (Miyaoka 1977, 
Yau 1977). Let t; denote the number of points incident with exactly i of the lines. . 
If n is the number of lines and t,=¢,_, =0, then the Miyaoka—-Yau inequality 
implies that 


a-l 


ttt, 2n +t, + 2t, + 3t, + 


Hirzebruch (1984) obtains the following stronger result: if 4, =¢,_, =¢,-2=9, 
then 


t, +t, 2nt t, + 3t, + St, + 


If we are given n points in the complex plane C’, with at most n — 2 of them on 
any line, and if ¢, is the number of lines containing exactly i of the points, then by 
the usual duality argument applied in the complex projective plane the weaker 
Hirzebruch inequality must hold. If we make the stronger assumption that at most 
n—3 of the points are collinear, then the stronger inequality holds. 

If the numbers #, arise from real points P,(x;, y;), then the same ¢, will result if 
the points P, are situated in C’. This means that Hirzebruch’s inequalities must 
also hold in R’. Erdés asks for an independent elementary proof of these 
inequalities in R’, since they do not seem to follow from Melchior’s inequality. 
Purdy asks what Hirzebruch’s methods would give if it were assumed that 
t, = boy Serre =O, 

4.1.2. A problem of Serre 

No one has ever observed a finite set of points spanning C’ that do not determine 
a Gallai line, and Serre (1966) asked whether such a set can exist. Kelly (1986) 
answered this question by showing that indeed n points spanning C” always 
determine a Gallai line. Here is a sketch of his proof, which uses Hirzebruch’s 
weaker inequality: let S be a finite set of points spanning R? without a Gallai line, 
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and let PES. The lines through P and other points of S intersect the plane 7 at 
infinity in a set of points S*. If the points of S* formed a Gallai line, this would 
imply the existence of a Gallai line in the original configuration S$. Hence ¢, =0 
for S*, and t, >0 even by Hirzebruch’s weaker inequality. The 3-point line in the 
plane 7 corresponds to three lines through P lying in a plane 2‘ and containing a 
smaller configuration of points also lacking a Gallai line. This very special 
configuration, consisting of a finite set of points confined to three lines in a plane, 
and the three lines passing through one of the points, and every two points 
collinear with a third point can be used to define a finite additive subgroup of the 
underlying field. This is absurd, since the field has characteristic zero. 


4.2. Sets of points as matroids 


4.2.1. The Erdés—de Bruijn theorem 

Erdés and de Bruijn proved the following general theorem (de Bruijn and Erdés 
1948): if S,,...,5,, are m>1 subsets of an n-element set S such that every pair 
of elements of S is in precisely one S,, then m =n. Moreover, equality can only 
occur if m — 1 elements are in one S; and two elements are in all the other S,, or if 
n=1+k(k—1) and |S,|=---=|S,,| =, and each element occurs in exactly k of 
the sets. The second example is called a finite projective plane. 


4.2.2. Kelly's conjecture 

Consider the following situation: let L,,..., L, be n pairwise skew lines in R* 
and let H,,...,H,, be the hyperplanes spanned by them. If S, is the set of lines 
L, contained in H,, then the Erdés—de Bruijn theorem applies and gives m > n. 
On the other hand, Kelly observed that the corresponding Sylvester—Gallai 
theorem is not true, i.e., there is not always a hyperplane incident with only two 
of the given lines, Kelly defines an S—G configuration (Sylvester—Gallai) to be a 
set of n points, for some n, not all collinear, such that the line joining any two 
contains at least one other (Kelly 1986), and he observed that the obvious 
one-to-one mapping between points of C’ and points of R*, together with duality 
in projective C** and R** provides a method of converting any S—G configuration 
in C’ into an analogous line arrangement in R*. Thus, if for some n there are n 
points in C’ not all on a line such that the line spanned by any two of them 
contains a third, then there are n pairwise skew lines in R* such that the 
hyperplane spanned by any two of them contains a third. There are several S-G 
configurations discussed in section 4.1 and they all produce analogous examples of 
pairwise skew lines. Kelly (1986) also conjectures, with a misprint that is 
corrected in Boros et al. (1989), that there is always a hyperplane incident with 
either two or three of the lines, and points out that if this were proved without 
using the Hirzebruch inequalities, then it would give an independent proof of the 
Sylvester—Gallai theorem (see section 4.1) for m points spanning C’ — the reverse 
implications is not true. Unfortunately, Kelly’s conjecture was disproved by 
Bokowski and Richter. 
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4.2.3. Simple matroids 
For the rudiments of matroid theory we refer the reader to chapter 9 or the book 
Welsh (1976). 

Given n points in R’ or C% (or the corresponding projective spaces, oF 
analogues over finite fields), we can define a simple matroid on these points in the 
following manner: if a subset S of these points lies in a k-dimensional flat, but not 
a (k — 1)-dimensional fiat, then S has rank &. In this matroid, the Ath Whitney 
number W, is the number of closed sets of rank & in the matroid, which will be 
the number of flats of dimension k — 1 that are determined by the 7 points. It 
follows from the Erdés—de Bruijn theorem that for any simple matroid on n 
elements W, =n. It is also known that the number of hyperplanes W,_, is at least 
n, see Welsh (1976). This is the generalization of Motzkin’s result that the number 
of hyperplanes determined by n points spanning R° is at least n (see section 2.3). 

The Sylvester—Gallai theorem does not hold in the context of simple matroids. 
For example, the projective plane over Z,, which forms a simple matroid of rank 
3 on n=p’+p-+1 points, has exactly n “lines, each containing p+ 1 points. 


4.3. The geometric lattice of flats 


Given a finite number of hyperplanes in P“ they form, together with their 
intersections, a geometric lattice ordered by reverse inclusion. The corresponding 
MGbius function becomes a useful tool for studying arrangements as the 
dimension increases. Hyperplanes in R* only form a semilattice, since two 
hyperplanes might be parallel. 

The flats of a simple matroid, when ordered by reverse inclusion, form a 
geometric lattice. All geometric lattices can be achieved in this way. 


4.3.1. “Facing up to arrangements” 
In his “Facing up to arrangements”, Zaslavsky (1975) examined the geometric 
lattice of intersections of the flats ordered by reverse inclusion and derived 
formulas for the number of k-dimensional cells involving the standard lattice 
functions, such as the Mébius function p(x, y). 

For example, he shows that the number of regions determined by an arrange- 
ment A of hyperplanes in real projective space P“ is c(A) = +(- 1)" (1), 
where y(y) is the characteristic polynomial 


x(V) =D p(0, hy" -*O | 


r(A) is the rank of the lattice of intersections associated with A, and k(f) is the . 
length of the longest chain in the lattice below ¢. He also derives formulas for the 
number of bounded and unbounded regions in R*. 

In addition to the application of his formulas that we will discuss below, they 
are used in the proof of a generalization of a theorem of Hadwiger due to 
Goodman and Pollack (1988), see section 6.5. The formulas also imply that the 
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number of cells of an arrangement depends only on the lattice of intersections of 
the flats. 

For a more geometrical treatment of formulas for the cell numbers and fiat 
numbers, albeit limited to spaces of three of fewer dimensions, see Alexanderson 
and Wetzel (1981). 


4.3.2. Slimmest arrangements of hyperplanes 

In his book Convex Polytopes, Grinbaum (1967) conjectured that the smallest 
possible number of k-cells of a projective d-arrangement of n hyperplanes is that 
achieved by the near pencil, consisting of n — d+ 1 hyperplanes concurrent in a 
coline and d — 1 more hyperplanes in general position to make the whole thing a 
d-arrangement. Shannon (1976) proved this conjecture, characterized all of the 
minimal arrangements, and showed that the same configuration also minimizes 
the number of &-flats. Independently of Shannon's solution, Zaslavsky had 
noticed that he could prove Griinbaum’s conjecture by using the formulas in 
Zaslavsky (1975) and the following result of Dowling and Wilson (1974): the 
minimum value of |w,| over all geometric lattices with rank r>4 and n atoms 
occurs when and only when the lattice is isomorphic to the direct product of a line 
and a free geometry. Here |w,| is the absolute value of w,, which is the Ath 
Whitney number of the first kind. The W, are sometimes called Whitney numbers 
of the second kind instead of merely Whitney numbers. The minimum of W, is 
also determined in Dowling and Wilson (1974), and this implies Shannon’s result 
concerning the minimum number of k-flats. 

Zaslavsky (1985) develops the necessary combinatorial machinery so that he 
can settle the analogue of Griinbaum’s conjecture for arrangements of hy- 
perplanes in R*. He shows that the minimum number of k-flats and k-cells is 
attained when the configuration is a Euclidean near pencil, which is more 
complicated then a projective near pencil. 


4.3.3. Back to P* 
Makai and Martini have determined the minimum number of d-cells when the 
hyperplanes are assumed to be in general position. They showed (Makai and 
Martini 1989) the following. Let there be given an arrangement of n hyperplanes 
in real projective d-space, where no three of them have a (d — 2)-dimensional 
intersection. Then the number of projective d-cells is smallest when the n 
hyperplanes are arranged as follows. Consider in a 2-flat an arrangement of 
n—d+2 lines L,,...,L, 4,2, no three of them passing through one point. Say 
this 2-flat is spanned by the origin O and the first two unit vectors e, and e,. Take 
now the third unit vector e, and consider all 2-flats spanned by L’s and e,, and the 
one spanned by O, e,, and e,. Then n — d +3 2-flats are obtained. Consider the 
3-flats spanned by them and the fourth unit vector e,, and the one spanned by O, 
€,, €2, and e,. Repeat this until m (d — 1)-flats are constructed. 

Perhaps more general results on geometric lattices could be proved which 
would imply the above result of Makai and Martini. 
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4.3.4. Matroids minimizing i, 
At the end of a paper containing more significant results, Elliott (1967) gave a 
simple proof that if 1 points are given in the plane, then the number of distinct 
noncollinear triples that they determine is smallest when n — 1 of the points are 
collinear. Purdy (1982a) generalized this result to matroids, showing that the 
number i, of independent sets of size X on a simple matroid of rank d+ 1 onn 
points is a minimum when n—d +1 points lie on one line and the other d-1 
points are in general position, with a similar result for arbitrary matroids. This 
result was simultaneously proved by Bjérner (1980b), who applied some of his 
results in Bjorner (1980a) on shellable posets to prove the more general result 
that if a matroid has no circuits with fewer than c elements, then i, is minimized if 
as many as possible of the points lie in a flat of rank c, and the rest are in general 
position (see chapter 34). 

BjOrner’s result for c =3 implies that for n points in R*, with no three points 
collinear, the number of triangles i, is minimized if all but one of the points are 
on a plane in general position. The same configuration minimizes i,. 


4.3.5. The unimodality of W, and i, 
G. C. Rota conjectured that the Whitney numbers are unimodal. That is, - 
W, > min{W,, W,} if i<j <k. Welsh (1976) conjectures that the Whitney numbers 
are log concave, i.e., W7>W,_,W,,,. For simple matroids of rank four, this 
would say that Pr 2=nn, where t is the number of lines, n is the number of points, 
and 7 is the number of planes. Mason (1972) has conjectured that t? > 3n(n — 
1)a/2(m — 2), with equality only when every plane contains exactly three points, 
and this has come to be known as the ‘“‘points—lines—planes conjecture”. Seymour 
(1982) proved it in the special case that no five points are collinear. His paper also 
contains a very simple proof of the weaker t’ 2 nz if no 4 points are collinear. Of 
course, both of these results apply to the simple matroid formed by 7 points 
spanning R°. In that special context Purdy (1986) showed that 1? = cnz, without 
any other restrictions. 
Mason (1972) also conjectured that 


2 . . 
i, 2t,iiy,, for O<k<n, 


and this was proved by Dowling (1980) for k <7. 


4.4. Allowable sequences 


The allowable n-sequences of Goodman and Pollack are very closely related to 
arrangements of pseudolines in the real projective plane. We give here a brief 
account, and we refer the reader to the papers, e.g., the expository paper 
(Goodman and Pollack 1981), for more details. . 

Let P,,...,P, be n points in the plane, and let L be a line. If L is 
perpendicular to the line P,P,, then the orthogonal projections of P, and P, onto L 
will coincide. If L is not perpendicular to any of the lines P,P,, 1<i<j=< < n, then 
the projected points will be distinct and will occur in some order on L, defining a 
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permutation of the indices 1, 2,...,. If L rotates counterclockwise, the 
permutation changes every time L passes through a direction perpendicular to a 
line of the form P;P,, When L has turned through an angle 7, the permutation 


will be n, n—1,...,3, 2, 1. The line L is to be imagined rotating forever, 
producing a periodic sequence of permutations I7,, 77,,.... 
An example of a sequence arising in this way is ..., 1(23)4(56), 13(246)5, 


(136)425, 63(14)25, 634(125), 6(345)21, (65)4(23)1, .... We have put parenthe- 
ses around the blocks that are to be reversed in the next move. 

Goodman and Pollack noticed that the infinite periodic sequence of permuta- 
tions f1,, f1,,... arising in this way has the following properties: 

(1) The move from IJ, to IT,,, consists of reversing one or more disjoint 
blocks of indices. 

(2) If (é, 7) and (r,s) are two pairs of indices, then the pair (r,s) will reverse 
exactly once between every two reversals of (i, j), unless (r,s) and (i, j) are 
reversed together, in which case they will always be reversed together. 

Any periodic sequence & of permutations of 1, 2,..., satisfying (1) and (2) 
are called by Goodman and Pollack an allowable sequence of permutations, or an 
allowable n-sequence. One of their properties is that permutations IT and II' a 
half-period apart are the reverses of each other. 

Goodman and Pollack noticed that one could determine several geometric 
properties of the points P,,...,P, by applying rules to the allowable n-se- 
quences: 

(a) If P, is on the line P,P,, then the indices i, j and k will be reversed together 
in the same block, and not otherwise. 

(b) If the two lines P,P, and P,P, are parallel, then the pairs (r, s) and (i, j) will 
be reversed in different blocks, but at the same time, i.e., in the same move from 
HH, to II, ,,, and conversely. 

They also noticed that two properties involving convexity could be determined 
from the allowable n-sequences. Let C be the convex closure of P,,..., P,. Then 

(c) P, is an extreme point of C if and only if i is the first index in some term of 
> (and therefore the last in some other term). 

(d) P, is in the convex closure of P; ,...,P;, if and only if there is no term of 
& in which i precedes all of i,,...,i,. 

An allowable sequence can also be associated with an arrangement A of lines 
{L,,...,L,} in the real projective plane. Consider the intersection points to be 
in the finite part of the plane, and choose a direction to be horizontal which is not 
parallel to any of the lines. Let L be a horizontal line just above the highest 
intersection point, and write down the order (left to right) in which the lines L, 
intersect L, and write down the different orders as L moves down parallel to 
itself. Whenever L passes an intersection point P the lines going into P will come 
out in the reverse order. When L has passed the bottom intersection point, one 
half-period of an allowable sequence 2 will have been traversed, and this is 
enough to define 3. 

It is easy to reverse the correspondence and assign an arrangement of 
pseudolines to every allowable sequence, getting a sort of “wiring diagram” with 
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wires crossing whenever blocks are reversed. An example of an allowable 
sequence that is not realizable with straight lines (and therefore by duality not 
realizable by points either) is ..., (12)345, 21(345), 214(35), 2(14)53, (24)(15)3, 
4(25)(13), (45)(23)1, 54321,.... 

Goodman and Pollack (1984b) show that one can determine whether two 
arrangements of pseudolines A and A’ are isomorphic by looking at their 
associated allowable n-sequences. They also proved Helly’s theorem, Radon’s 
theorem, and Caratheodory’s theorem, and some others in the context of 
allowable sequences (Goodman and Pollack 1980b,c). Peter Ungar completely 
solved the problem on the minimum number of directions determined by 7 points 
by using allowable sequences (see section 3.10). 


5. Metric problems 


5.1. Repeated distances 


Let d,(n) be the maximum number of times that the distance 1 can occur among n 
points in R*. In 1946, Erdés showed that 


/log t } 
n*? og 8" < di(n)<cn® 2 


and conjectured that the lower bound was the correct order of magnitude (Erdés 
1946). 

The lower bound is given by an m by m portion of the integer lattice, where m 
is approximately n'’?, and the upper bound can be proved (although it was not in 
the original paper) by the following graph argument: 

Let G be the unit-distance graph, i.e., the graph whose vertices are the n 
points, joined by an edge if and only if they are at distance 1. Then G cannot 
contain a complete bipartite Kuratowski K,, subgraph, since this would require 
that there exist points X,, X,, Y,, Y,, Y, in the plane such that the six distances 
d(X,, Y,)= 1, which is impossible in the plane. 

A theorem on extremal graph theory by K6vari et al. (1954) states that such a 
graph can have at most cn*’” edges. The general graph theorem is that a graph G 
not containing a Kuratowski K, , subgraph with r<s has at most C,n?"'"” edges, 
where C, is a positive constant depending only on s; see chapter 23. 

Erdos also conjectured that if X,,...,X, are any distinct points in the plane 
there is always an X, that has fewer than niliee len points equidistant from it. The 
lattice points show that if true this is best possible. More generally, denote by 
h(X;) the largest number of points equidistant from X,. Perhaps 


h(X,) fesay h(X,) <niteflos loga : 
In 1975, Jozsa and Szemerédi showed that d,(n) =0(n*’”). Beck and Spencer 


(1984) improved this to d,(n)<n'?””, and Spencer et al. (1984) improved this to 
d,(n)<cn"™. 
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Erd6s (1960) showed that in R’, 


ni? 5/3 


<d,(n)<cn 

The upper bound can be proved by a similar graph argument, although it was 
proved differently in the original paper: the unit-distance graph G cannot have a 
complete bipartite Kuratowski K, , subgraph, and so G has at most cn*’* edges by 
the same result of K6vari et al. (1954). 

The lower bound comes as follows: take the points (x, y, z) such that O< x, y, 
z<n'*. If d is a distance that does occur, then d” is an integer of the form 
u?+v?+w’, where O<u, v, w<n'”, Hence d? <3n?”*. Since every pair formed 
from the n points must have a distance, clearly some distance must occur at least 

n> times. Erdés (1960) points out that the lower bound can be improved by a 
factor of log logm using deep number-theoretic methods. 

Beck (1983) improved the upper bound, so that 


cn*” log log n<d;(n)<n°?*, wherec>0. 


In the same paper, Beck mentioned that he can show that 


d,(n)<n'?/*** forn>ng(e) (13/8 = 1.625) . 


8/5 


Chung (1989) improved this to d,(n)<cn"”. She achieved this by showin that 
. number oF incidences: between A points and t spheres is at most c(n*/°r4’> + 
n'’? + n), provided no three of the spheres are collinear. She has Sei this 
ail to higher-dimensional spheres. (See also section 5.10.) 
In R* things change drastically, thanks to a construction due to H. Lenz: let 


P.=(u;,v,;,0,0) forl<i<[}n] 
and 
Q,=(0,0,u,,0,) forl<j<[jn], 


where (u,, u,) are distinct solutions of the equation uty = +. If necessary, the 
points P, can be placed arbitrarily close together, as can the points Q;. Every 
distance d(P,, Q;) is unity. The construction shows that d,(n) =[jn 1. 

For k = 4, Erdés (1960) showed that as n tends to infinity 


d,(n) =1),)° — n?{2[1Lk]} 7! +o(n’). 


Here is a rough idea of the proof for even dimensions. The implied lower bound 
is obtained by a generalization of the Lenz example. To see the upper bound, let 
a configuration of n points in R’” be given and construct the graph G with these 
points as vertices and joined by an edge if they are at distance 1. By elementary 
geometry you cannot have sets S,,...,5,,,, of three points each such that the 
nine distances between the points of S, and the points of S, are unity for every i 
and j, since this would force the existence of m+ 1 mutually orthogonal planes, 
an impossibility in R*”. It is not even possible if S, has only two elements. 
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Erdés and Stone (1946) then have a result stating that such a graph G cannot 
have more than the requisite number of edges. Specifically they showed that every 
graph on v vertices having at least 


1 
iy? (1 -3) +n? 


edges contains a complete (d + 1)-partite graph with vertices in each part for any 
fixed ¢. Subsequent work by Bollobas, Erdés, Simonovits, Szemerédi, and ChvAtal 
has shown the result to be true even if ¢ increases with n, provided t<c’' logn, 
where c’ depends on c. We refer the reader to chapter 2.3. 

Moser (1966) conjectured that among 7 points on the unit sphere in R’, no 
distance D can occur more than cn times. This was recently disproved in (Erdés 
et al. 1990a), where it is shown that for any distance D, 0< D <2, there are n 
points on the unit sphere such that each one is at distance D from at least c log*n 
others. Here log*m denotes the minimum integer r such that, starting with n, if 
one iterates the logarithm function r times one obtains a value <1. This result 
implies that the distance D occurs at least 3c(n — 1)log*n times. 


5.1.1. Nearly equal distances 
Erdés et al. (1990b) prove that for a sufficiently large configuration of n points in 
the plane with minimum distance at least 1, the number of pairs whose distance is 
between ¢ and ¢ + 1 cannot exceed [17’] for any real t. Their method generalizes 
readily to higher dimensions. 


5.2. Maximum and minimum distances 


5.2.1. Large distances 

Let M,(n) be the maximum number of times that the maximum distance can 
occur between n points in R*. Pannwitz and Hopf (1934) and Sutherland (1935) 
showed that M,(n) =n. A. Vaszonyi conjectured that /,(”) = 2n — 2, and this was 
proved independently by Grinbaum, Heppes, and Straszewicz. Lenz’s construc- 
tion done carefully shows that M,(n) =[1n’]. (The points on each circle must be 
kept close together.) 

For which sets does equality occur? Let us call a set of n points in R* (n, d)-best 
if it has diameter 1 and the distance 1 occurs M,() times. In R’ the (7, 2)-best 
sets consist of points on the boundary of a Reuleaux m-gon. Neaderhouser and 
Purdy (1982) give a proof of this, and two infinite families of (7, 3)-best sets. It 
seems to be a difficult problem to characterize the (n,3)-best sets. 

Let n, denote the maximum number of times that the kth largest distance _ 
occurs among n points in the plane. Then 1, = M,(n). Vesztergombi (1985) has 
shown that 1, + 2n,<3n and n,<n+n,. In the same paper she showed that 
n, <kn. 

Let d, >d,>d,>--- denote the different distances occurring among n points 
in the plane. Consider the graph G, whose vertices are the n points, and two are 
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joined by an edge if their distance is at least d,. Erd6s et al. (1989) show that the 
chromatic number x(G,) of G, is at most 7 if n=ck’, and 7 is best possible. 


5.2.2. Farthest neighbors 

Recently, Avis et al. (1988) have studied an interesting problem called the 
farthest neighbor problem. Given a set S= {X,,...,X,} of n points in R*, for 
each point X; let f(X,) be the number of points of S at maximal distance from X,, 
the farthest neighbors of X,. Then define F,(n) to be the maximum of f(X,) + - 

+ f(X,) over all possible sets 5S in R*. The problem i is to estimate F,(n). 

In 1984, Avis showed that F,(n) = 3n — 3 if n is even (Avis 1984). Edelsbrunner 
and Skiena (1989) have shown that F,(n)=3n-—4 if n is odd. For k24 
preliminary results suggest that the bounds for F,(n) are similar to those for d,(n), 
where the generalized Lenz construction provides the maximum. 

However, F,(n) achieves its maximum value on an entirely different set: divide 
the 7 points evenly between a circle C and a line L through the center of C and 
orthogonal to it. They prove: 


dn? +13n<F,(n)<1in?+3n+255. 


8/5 


Chung (1989) showed that the number of farthest neighbors is at most cn" if no 


three points are collinear. 


5.2.3. Minimum distance 

Let m,() be the maximum number of times that the minimum distance can occur 
between 7 points in R*. That m,(n) <3n follows from the observation that six 
equal circular discs may be placed around another disc of the same size so that the 
central one is touched by all others but no two overlap and that it is not possible 
to place seven discs in this way. Erdés showed that 


3n — ¢,n''? <m,(n)<3n-c,n'? 


and conjectured that 
m,(3n’ +3n+1)=9n? + 6n, 


which comes from considering a large hexagonal chunk of a hexagonal packing of 
circles. Harborth (1974) proved this striking conjecture and showed that 


m,(n) = [3n — (12m - 3)'7] foralln. 


In an analogous three-dimensional situation around a given ball it is possible to 
place 12 balls of equal size, all touching the first one but not overlapping it or 
each other. It appears that with judicious positioning it might be possible to 
squeeze in a 13th sphere. This dilemma between 12 and 13 was a subject of a 
long-standing argument between Sir Isaac Newton and David Gregory starting in 
1694. Newton believed 12 to be correct, but this was not proved until 180 years 
later. Among Gregory’s unpublished papers H. W. Turnbull found notes dated 
1694 of an actual conversation that took place between the two men. They had 
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been discussing the distribution of stars of different magnitudes when the problem 
came up. It was R. Hoppe who eventually proved that the 52-year-old Newton 
had been right —- see Bender (1874). There have been several proofs since then, of 
which the simplest is probably that due to Leech (1956). 

Hoppe’s result gives m,(n) <6n, and in fact m,(n) = 6n + O(n’’*). For d>3 
only two ‘touching numbers” r, are known exactly (7, = 240 and r,, = 196 560). 
For more details see Coxeter’s (1963) survey. See also Fejes-Toth’s excellent and 
more recent survey on packing and covering (Fejes-Toth 1983). 


5.3. Different distances 


A related but different question is to ask: given n points in R“, what is the 
minimum number of different distances that must occur. Let D,(n) denote the 
minimum taken over all configurations of n points in R*. The first result was due 
to Erdos: 


cn'!? <D,(n) <cn/(logn)'’? . 


The lower bound can be proved as follows. Let P be one of the n points. If there 
are at least (n — 1)'”” distances from the point P, then the result follows. We may 
therefore assume that there are not. Then there must be a circle around P 
containing m2>(n-1)'” points. Let Q be one of those points, and let 
Q,,---,Q,,-, be the others. There must be at least } (rm — 1) different distances 
d(Q, Q;), and the result follows. 

The upper bound is obtained from an m by m portion of the integer lattice, 
where m’<n<(m+1)’. If d isa distance occurring between two of the points, 
then d’ is an integer of the form u” + v”, and d? does not extend 2n. It is an old 
result of Landau (1909) that the number of such integers is less than cn/(log n)'’”, 
and the result follows. 

Leo Moser improved the lower bound to cn*’” (Moser 1952), showing that in 
fact there is always one point from which there are at least that many distances (as 
the Erdés }:roof above also shows that cn'’” distances occur from one point). In 
1982, Chung showed that D,(n)>cn°’’ (Chung 1984), but not necessarily from 
one point. Beck (1987) was able to show that there is always a point from which 
there are more than n’’*** different distances. In fact, he showed that if a > n,(e) 
then there is always a point from which there are at least n°*’*'~* different 
distances. Since 58/81 >5/7, this is also an improvement on Chung’s result. 
Chung, Szemerédi and Trotter have recently improved the result to cn*’° 

Erdés (1957, 1970) asked the following question: what is the smallest number 
f(k, d) so that f(k, d) points in R“ always contain a set of k points whose distances 
are all different? He conjectured that f(k, 1) = {1 + o(1)}k’, and pointed out that 
a result in Erdés and Turan (1941) showed that f(k, 1) = {1+ o(1)}k?. 

Erdés and Guy (1970) show that out of the n’ points on the n by n integer 
lattice, a set of k points exists all of whose distances are distinct. Here k >n?’> ‘ 
for any e >0, for all n >n(e), 
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5.3.1. Two-distance sets 
A set of n points in R? is a two-distance set if there are two distinct distances a 
and § such that the distance between any pair of points in the set is either a or 8B 
and both distances are realized. Kelly (1947) posed the problem of determining 
f(d), the largest possible cardinality of a two-distance set in R*, and he showed 
that f(2) =5, realized by the regular pentagon. Croft (1962) shows that f(3) = 6, 
realized by the mid-points of the edges of a regular tetrahedron. The }d(d + 1) 
mid-points of the edges of a teular simplex show that f(d) =4d(d + 1). Larman 
et al. (1977) show that f(d@) <4(d+1)(d +4). They also show that f(d)<2d + 3 
unless a”/B” is the ratio of two consecutive integers. They use a result by Wilson 
(1975) on the existence of balanced incomplete block designs to show that 
f(a) >4d(d + 1)[1 — 4/d] if d is sufficiently large. Their construction gives a = 1 
and B =2'’”. Harborth and Piepmeyer (1990) show that if a=1 and B> {2+ 
3'/2)12 then the set has at most d +1 points. See also section 5.11. Blokhuis 
(1984) modified the proof in Larman et al. (1977) to show f(d) <4(d + 1)(d + 2). 
Let f(s, d) be the largest possible cardinality of an s-distance set in R°. Bannai 
and Bannai (1981) showed that 


d+s (° +s- : 
fs,dy<(***)4 (4804). 
Subsequently, the Bannais and Dennis Stanton (Bannai et al. 1983) eliminated 
the second term, showing: 


fls, ay<(***), 


5.4. General position 


Many of the questions about the number of distances, etc. can be asked with the 
restriction of general position: no three points collinear, and sometimes also no 
four points on a circle. 

Let h(n) be the minimum number of distances determined by 7 points in the 
plane if not three points are collinear. Perhaps A(n)/n tends to infinity as n tends 
to infinity, but we cannot even show that for n>), h(n) =n. 

Erdés had asked many times whether h(d) = 0(d’), and recently J. Pach has 
proved this by showing that 


h(n)<cn* , where a = (log 2)/log 3 = 1.58496... 


The example he used to obtain this result is in fact the same one that Danzer 
and Grinbaum (1967) had used (see section 3.10) to obtain an upper bound on 
the number of different vectors V’(m) determined by n points in general position. 
The Danzer-Griinbaum result implies Pach’s result, since 2h(n)<V'(n)<cn", 
but we had not noticed this fact. 

Let there be given 7 points in the plane which form a convex polygon. Erdés 
conjectured that these points determine at least [47] distinct distances. The 
regular polygon shows that if true this is best possible. Altman (1963) proved this. 
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See also Altman (1972). Erdés also conjectured that there is always one point so 
that the number of distinct distances from this point is also [4], and this is still 
open. Erdés also conjectured that there is one point that has no three points 
equidistant from it. L. Danzer constructed a convex 9-gon disproving this. The 
example is now published in Erd6és (1985b). Erdés then conjectured that if 
X,,...,X,, are the vertices of a convex polygon, then there is always an X; that 
has no four vertices equidistant from it. This has neither been proved nor 
disproved. 

Let d,>d,>d,>--- denote the different distances occurring among the 
vertices of a convex n-gon. Consider the graph G, whose vertices are the n 
points, and two are joined by an edge if their distance is at least d,. Erdds et al. 
(1989) show that the chromatic number x(G,) of G, is at most 3 if n = ck’. Even 
if m is not assumed to be large, G, has a vertex of degree at most 3k — 1 and 
therefore y(G,) < 3k. They eonecture that y(G,) < 2k + 1. The regular (2k + 1)- 
gon would show that this is best possible. It is possible that G, always has a vertex 
of degree <2k. 

The conjecture that y(G,) <2k + 1 would imply Altman’s result that there are 
at least [57] different distances as a special case. To see this, let k be the number 
of different distances, so that G, is the complete graph and ¢ = y(G,) < 2k + 1. 
Then k =4(n—1), which implies k >[}5n], since k is an integer. 

Erd6s once conjectured that if X,,...,X, are n points that determine only 
o(m) distinct distances, then there always is a line which for n > n)(k) contains at 
least kK points. Szemerédi proved this and then went on to conjecture that if 
X,,...,X, is such that no three of the points are on a line, then X,,...,X, 
determine at least [47] distinct distances, i.e., Altman’s theorem has nothing to 
do with convexity (but only no three on a line). Unfortunately Szemerédi’s proof 
only gives [5] instead of [4] and his conjecture remains open. For the proof, 
see Erdos (1975). 

In Erdés and Purdy (1976) we used Karteszi’s method to show the following. 
Let f(n) denote the maximum number of times that unit distance can occur among 
n points in the plane if no three points lie on a line. Then f(7) = cn log n, where c 
is approximately 2{3 log 3}~'. See section 5.6 or Karteszi (1963) for an explana- 
tion of Karteszi’s method. 


5.5. Borsuk’s problem 


In 1932, Borsuk proved the following result: if the d-dimensional solid ball of 
diameter 1 is partitioned into d parts, then at least one of the parts has diameter 
1. This is a consequence of the fact that if the sphere (the boundary of the solid 
ball) is covered by d closed sets, then at least one of the sets contains a pair of 
antipodal (i.e., diametrically opposite) points of the sphere. 

The following year, Borsuk published his notorious problem: prove (or 
disprove) it is possible to partition every bounded set in R% into d +1 sets of 
smaller diameter. Clearly it is enough to consider closed sets of diameter 1. Since 
the convex hull of a set has the same diameter as the set itself, it would also be 
sufficient to solve the problem for closed convex sets. 


Extremal problems in combinatorial geometry 841 


In its generality, the problem has only been solved for d=2 and d=3. The 
most geometrically satisfying solution for the plane is probably the follow- 
ing. Every set of diameter 1 can be covered by a regular hexagon of edge length 
1/3 and the hexagon can be split into three pieces of diameter 1V3 = 0.866.... 
There is a similar proof in three dimensions using a polyhedron obtained by a 
triple truncation of the regular octahedron. No one has found such a covering for 
sets in R*. 

For d=4 only special cases of Borsuk’s problem have been solved. See 
Griinbaum’s (1963) survey article for details and references. See also the survey 
article in the same volume by Danzer et al. (1963) on Helly’s theorem and related 
questions. 

The finite version of Borsuk’s problem is particularly tantalizing: given n points 
in R‘, can they always be partitioned into d + 1 sets of smaller diameter? In other 
words, we assume that the given set in Borsuk’s problem is finite. Whereas a 
negative answer to this question would of course imply a negative one for the 
original question, the reverse implication has never been established. 

A simple induction argument shows that the finite Borsuk’s problem has an 
affirmative answer in the plane. Let n points be given in R’. The result is true if 
n=3. Let n>3 and suppose the result is true for n—1. Since M,(n)=n, it 
follows that the unit-distance graph G has at most n edges, so the average degree 
is <2. Hence there is a vertex of degree <2. That is, there is a point P which is 
not at distance 1 from more than two other vertices. With P removed, the 
remaining n — 1 points can be divided into three sets of diameter less than 1, by 
the induction hypothesis. Put P in one of the three sets that does not contain any 
points at distance 1 from it. The result follows by induction. 

A similar proof works for d= 3. Since M,(n) = 2n —-2, G has at most 2n —2 
edges and average degree less than 4. Hence there is a vertex of degree <3 which 
will be a point having at most three points at distance 1 from it, and by induction 
the points can be divided into four sets of smaller diameter. 

This approach fails at d= 4, since M,(n) =[4n’], and the problem is still open 
for d= 4. 


5.6. The Hadwiger—Nelson problem 


The problem of how many colors it takes to color the plane so that no two points 
of the same color are distance 1 apart has stimulated work on the chromatic 
number of unit-distance graphs and the related work on the dimension and 
faithful dimension of a graph. 

More generally, E. Nelson and J. R. Isbell, and independently P. Erdés and H. 
Hadwiger, asked for the smallest number A(d) such that the points of R* can be 
colored by A(d) colors so that no two points of the same color are distance 1 
apart. Thus #(d) = y(G(R*)), i.e., the chromatic number of the unit-distance 
graph for the entire space. Clearly the number is the same if the distance 1 is 
replaced by any other distance. We could have just as well defined h(d) to be the 
maximum of y(G(S)) over all finite subsets § of R*. This is a consequence of the 
following theorem. 
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Compactness theorem (de Bruijn and Erdés 1951). Suppose that every finite 
subgraph of an infinite graph G has chromatic number <k. Then G itself has 
chromatic number at most k. 


The original proof used a theorem of Richard Rado on infinite matroids. All 
known proofs use the axiom of choice. The theorem is also a consequence of 
Gédel’s compactness theorem. 


5.6.1. In the plane 

Hadwiger (1961) showed that 4<h(2)<7. Erd6s thinks that almost certainly 
h(2) > 4. The following seven-point example due to Leo Moser gives the simplest 
proof that h(2)>4. Let O, P and Q be the vertices of an equilateral triangle of 
side 1. Let R be the reflection of O in the side PQ. Any 3-coloring must color O 
and R the same. Rotate the figure about O to obtain O, P’, Q', and R’ so that R 
and R’ are at distance 1. Four colors are necessary to avoid a conflict between R 
and R’. 

S. B. Stechkin has produced a decomposition of the plane into six sets, each of 
which fails to realize some distance, but unfortunately each distance is realized by 
some set, so it does not show that h(2)<7 (see Larman and Rogers 1972). 

The Moser graph has girth (smallest circuit size) 3. Wormald (1970) has a larger 
example with girth 5 that needs four colors. 


5.6.2. In higher dimensions 
Larman and Rogers (1972) used methods and results from the geometry of 
numbers to show that 


cn? <h(d) < {3 + o(1)}4 


After seeing their proof of the lower bound, Erddés made a conjecture about 
finite sets (there are at most (2 — €)” subsets of an n-element set such that no two 
have an intersection of cardinality r, where cn<r<c’n<in) which implied 
h(a) >(1 + c)*. By proving a slightly different extremal-set-theory result, Frankl 
and Wilson (1981) were able to show that 


h(d) = {1+ 1/5 + o(1)}*. 


Thus we know that h(d) grows exponentially, but we do not know whether 
h(d)‘"* tends to a limit as d tends to infinity. Frankl and Rédl (1987) did later 
prove Erd6s’s original set-theory conjecture. (See chapter 24.) 


5.6.3. The dimension of a graph 

Erdos et al. (1965) define the dimension of a graph G to be the smallest d such 
that the vertices of G can be represented as points in R‘ so that points joining by 
an edge are distance 1 apart. Every graph has finite dimension, since the complete 
graph K, on nm vertices can be embedded in R*' as a regular simplex. They 
establish several facts: 
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(1) For any graph G, dim G <2y(G). 

(2) There exist graphs with arbitrarily high girth and chromatic number. 

(3) If G has n vertices and girth >c log n for c sufficiently large, then x(G) <3, 
and therefore dim G <6. 

We make the remark that the above result of Larman and Rogers implies that 
dim G >c log x(G) for some c > 0. Hence, (2) implies that there exist graphs of 
arbitrarily high dimension and girth. 

Erdés and Simonovits (1980) show that dim G < A(G) + 2, where A(G) denotes 
the maximum degree of all the vertices of G. They also define the faithful 
dimension of a graph G to be the smallest d such that the vertices of G can be 
represented as points in R* so that two points are joined by an edge if and only if 
they are at distance 1. They proved that the faithful dimension is bounded above 
by 24(G) + 1. On the other hand, the faithful dimension is unbounded on the 
class of 2-chromatic graphs. 


5.6.4. Regular graphs of faithful dimension 2 

Karteszi’s method has come to be a convenient term for the method used by 
Karteszi (1963) to construct a geometrical example, see section 3.7. We shall 
explain the method by applying it on the following problem of determining the 
smallest regular graphs of degree k that have faithful dimension 2. 

That is, we define f(k) to be the minimum number n so that there exists a set of 
n points in the plane so that the unit-distance graph G on these n points is regular 
of degree k, i.e., so that each point is at distance 1 from exactly & others. 

Let S be a finite set of points such that the unit-distance graph G(S) (in which 
points are joined by an edge when they are at distance 1) is regular of degree k. 
Let S+a denote the set obtained by translating S by the unit vector a. Then 
every point x in S is at unit distance from its corresponding point x +a in S+a, 
so that one could almost say that the 27-element set 7, obtained by combining the 
two sets § and S$ + a, has an associated unit-distance graph G(T) which is regular 
of degree k +1. Unfortunately it might happen that a point x in S is at unit 
distance from a point y + a in S$ + a. Fortunately the simple procedure of rotating 
S can avoid this problem, since there are infinitely many possible rotated positions 
and only finitely many unit-distance occurrences to be avoided. This is essentially 
Karteszi’s method, and it shows in this case that f(A + 1) < 2f(k). Since f(2) =3, 
this gives f(3) <6, f(4) <12, and more generally f(k) <3-2"° 

In 1974, Erd6s asked whether f(k)=3-2*~*, and Purdy, with the aid of a 
computer, produced an example G, showing that f(4) <9. The regularity of G, as 
a line drawing makes it very beautiful, and it appears on the cover of at least one 
book (Bollobads 1986). After seeing G,, which is the Cartesian product of two 
equilateral triangles, we realized that duplication can be replaced by triplication 
to show that f(k + 2) <3k. More generally, by slightly perturbing the Cartesian 
product of a graph of degree r and one of degree k, we see that f(k +r) <f(k) f(r), 
which shows the full power of Karteszi’s method. Harborth (1985d) showed that 
the above constructions are optimal when k <5. See also Harborth (1983). 
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5.6.5. Minimal forbidden unit-distance graphs 

The computer search for small, regular unit-distance graphs leads naturally to the 
consideration of minimal graphs G that have dimension greater than 2. That is, if 
you remove an edge from G, it has dimension 2, The closely related problem of 
classifying the critical graphs of dimension 3 was raised in the paper Erdés et al. 
(1965). 

Purdy and Purdy (1988), with the help of a CRAY computer, determined these 
minimal forbidden graphs for n <7 and work on n = 8. They also prove (Purdy 
and Purdy 1988) that there are infinitely many minimal forbidden graphs. The 
work naturally generates interesting graphs, e.g., graphs which have dimension 2 
but not faithful dimension 2, or forbidden graphs which do not contain the 
forbidden subgraphs K, or K; 3. 


5.7. Triangles of unit area 


In 1967, A. Oppenheim asked us the following question: given n points in the 
plane, how many different triangles can have the same nonzero area A. Without 
loss of generality, of course, A= 1. Let A,(m) denote the maximum number of 
triangles of unit area, taken over all configurations of m points in R*. We were 
able to show (Erdés and Purdy 1971): 


cn’ log logn <A,(n)<cn*”?. 


The lower bound is provided by points (i, /) with positive integer coordinates 
i<a=[(logn)'’7] and j<n/a. It is easy to see that the area 1a! occurs cn” times 
from triangles with horizontal bases of lengths 1, 2,...,@. When the triangles 
with sloping bases are also counted, then estimates on the average behavior of 
Euler’s totient function gives cn” loga triangles of that area. To see the upper 
bound, we fix a point P and show that at most cn*”? triangles PXX’ can have unit 
area. We take the graph G whose vertices are the remaining m — 1 points, with X 
and X’ joined by an edge if triangle PX.X' has unit area. Then G cannot have a 
K, , subgraph, and so has at most n°” edges. 

Recently, Pach and Sharir (1990) have improved the exponent on the upper 
bound, showing that A,(n) = O(n’). 

In R® we were able to show A,(n)<cn*?. In R° a generalized Lenz 
construction gives us 3m points P,, Q,, etc. such that m? triangles are congruent 
and therefore have equal area. This left R° in doubt. Purdy (1974) showed that 
A,(n) <en*~*, where d was a small positive constant. Using algebraic geometry, 
he showed that there could not exist sets S, T, and U in R° containing N points 
each, such that the N° triangles obtained by taking one vertex from each set all 
had unit area, where N was very large, but constant. The following extremal 
theorem on hypergraphs, due to Erdés (1964), then implies the result: given a 
collection of ¢ triples of elements chosen from an n-element set, if ->n°"™, 
where f(k) = k~’, then there are sets S, T, and U of size k so that all k® possible 
triples formed from them are in the original collection. 
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5.8. Triangles of different areas 


One can also ask for the minimum number of different areas. Given n points in 

the plane, form all of the triangles. What is the minimum number of different 

areas that must occur if the points do not all lie on a straight line? (Degenerate 

triangles have zero area, of course). Let 7,(n) denote the minimum number. 
We showed 


cn! <T,(n)<en. 


(Erdés and Purdy 1976): The lower bound was subsequently improved by Burton 
and Purdy (1979) to: 


cn<T,(n). 


Only the correct constant remains to be found. 
5.9. Angles 


5.9.1. Repeated angles 
Given n points in R*, let B,(n, a) be the maximum number of times that the 
angle a@ can occur among the ordered triples of points, and let B,(n) be the 
maximum of that for all a, O0<a<v7. 

Conway et al. (1979) proved that 


B,(n)<en?'?. 


Purdy (1988) showed that 


3-1/25 


B,(n, a)<cen except when a = 37. 


Surprisingly, he showed that 
B,(3m, 41) >m°* 
by means of the following example. Let u,, uv; be m solutions of u* + v?=1. Let 
Xx; = (u,, v,, 0, 0) forl<ism, 
Y, = (0,0, 7,0) forl<jxm, 
Z, =(-1,0,0,k) forl<k<m. 
Then (Y,—X,)-(Z, — X,) = uj — 1+? =0, so that the m® angles Y,X,Z, are all 
right angles. , 
A further extension of the Lenz example shows that B,(3m,a)2m° for all 
acute angles a, but the magnitude of B,(m,a@) is unclear for obtuse angles a. 
Another example shows that B,(3m, a) =m? for all angles a. 


Purdy proved the upper bound on B,(n, a), for a not a right angle, by showing 
that in R" there cannot be 15 points X,, Y,, and Z, such that the 125 angles Y,X,Z, 
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all equal a. The hypergraph theorem mentioned in section 5.7 then implies the 
following. 

Let H be the subset of the triples on a set S of n points in R* defined by saying 
that XYZ is in H if and only if the angle XYZ equals a, and suppose that H 
contains more than n*''”° triples. Then there are three 5-element subsets S,, S,, 
and S, of S, so that every triple X,X,X, with X, in S, is in H, and every angle 
X,X,X, equals a. 

Recently, Pach and Sharir (1990) have shown that cn’ logn < B,(n) < 
c'n’ logn. They also showed that for infinitely many angles a, there is a constant 
c(a) depending on @ such that 


B,(n, a) >c(a)n’ logn . 


5.9.2. Different angles 

Corradi, Erdés and Hajnal asked: is it true that n points in the plane not all ona 
line determined at least 2 — 2 different angles? This is trivial if not three points 
are on a line, but seems to present curious difficulties in general. The weak Dirac 
conjecture (see section 3.9), now a theorem, states that there is always a point 
through which cn different lines pass, with a very small constant c. There is 
therefore a fixed vertex with that many different angles involving it. 

A related question is: is it true that the smallest (nonzero) angle determined by 
our points is <a/n? We have equality for the regular polygon. This is again trivial 
if no three of the points are on a line, but in the general case we only get c/n, by 
the weak Dirac conjecture. 


5.9.3. Large angles 

Erdés and Szekeres (1935) proved that given » points in the plane, if n = 2‘, then 
three of the points form an angle ¢ satisfying {1—1/k}4<@<7m. They gave 
examples showing that this could not be improved. 

Barany (1987) showed that under the right conditions there are many large 
angles: for any ¢ > 0 there is an n(e) such that every finite set S of points in the 
plane contains a subset 7 of cardinality at most n(e) such that every point in § — T 
is the vertex of an angle @ formed with rf, and ¢, in T such that w— € <0 <7. If 
n(e) is the smallest n that works, then 


c,/e <log,n(e) < {c,/e} log, {c,/e} - 


5.10. Problems involving circles 


Motzkin (1951) solved Syivester’s problem for circles. If n points are given in the 
plane, not all on a line, and not all on a circle, he showed that there is either a 
circle or a line containing exactly three of the points. Just invert the plane around 
one of the points and apply the Sylvester—Gallai theorem. Erdés asked: what is 
the minimum number c of circles that are determined by the points. Elliott (1967) 
showed that, if n is sufficiently large, then c 24(n—1)(n -2), i-e., the configura- 
tion of m points with m —1 collinear does indeed give the minimum. Serre had 
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already pointed out that this was not the minimum for eight points—the eight 
vertices of two concentric squares determine fewer than 21 circles. 

Let there be given 7 points in the plane such that no circle or line contains 
more than n—k of them. Beck (1983) showed that the number of circles 
containing at least three and at most A of the points exceeds ckn’. In fact, it is 
not hard to prove the same result with A replaced by 4. In the same paper he 
showed that the number of circles containing k or more points is at most cn*/k*, 
provided k<n'’?. 

Chung (1989) shows that the total number of incidences occurring between n 
points and 1 circles is at most c(n°*1°4 +n +1). 

Erdés and Straus observed that if n>1k° and X,,...,X, are in general 
position (no three on a line, no four on a circle): then there are always k of them 
so that the ik(k-—1)(k-—2) circles they determine all have different radii. 
Probably n can be much smaller. 


5.10.1. Unit circles 

A few years ago, Erdés asked how large is f(n), where f(n) is the largest number 
for which there are n points X,,... , X,, in the plane so that there are f(7) distinct 
circles of unit radius passing through pairs of them. Erd6és observed 


3 <fln) <n(n-1). 


The upper bound comes from observing that each pair of points determines at 
most two circles of unit radius. 
Elekes (1984) used the following clever argument to show that 


f(n)>cn?”. 

Let V,,...,V,, be unit vectors in general position in the sense that the ¢= 
Lm(m - 1)\(n — 2) sums of the form V,+V,+¥, are all distinct. It follows from 
this that the 2 = 5m(m — 1) sums V, +V, are also all distinct. A unit circle with 
center V, + V, +V, will contain the three points V,+V,, V,+ V,, and y, +V,. This 
pives. us ¢ unit cities containing at least three of the n points, so that f(gm(n - 
1) =£m(n — 1)(m — 2), and the result follows. 

Erdés conjectures that f(n)/n’ tends to zero. It might be possible to find an 
asymptotic formula, or even an exact formula for f(). Harborth and Mengersen 
have determined exact values of f(n) for n <8, see table 2 (Harborth 1985c, 
Harborth and Mengersen 1986). 

J. Beck proved the following in 1982. Let C,,..., C, ben circles of unit radius 
so that any two intersect. Then the number of distinct intersection points is at 
least cn’. The best value of c is not known. 


5.11. Integer distances 


Anning and Erdés (1945) showed that for every n there are n points spanning the 
plane such that all of their distances are integers, but that there is no infinite set of 
such points, i.e., not all on a line, all integral distances. 
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Table 2 


fn) 
1 

4 

4 

8 

12 
16 


SOrXNAA MAW] 


Graham et al. (1974) asked whether there are d+2 points in R“ whose 
distances are all odd integers. They showed that there are such points if and only 
if d is of the form 16k — 2. 

They point out that their examples all need three distances and ask whether 
there are sets of nm points in R“ realizing only two distances, D and D’, both 
integers. Their methods show that the answer is no if D and D’ are both odd. 
Harborth and Piepmeyer (1991) show that this is only possible if n <d +2. They 
also show that if D’ > cD, then n <d + 1, where c= {2+ 3'7}'?=1.93.... See 
also section 5.3.1. 


6. Helly-type theorems 


6.0. Introduction 


No discussion of combinatorial geometry would be complete without a discussion 
of Helly’s theorem and its generalizations. It has been 30 years since the last two 
surveys were written, following a period of great activity. Xeroxed frequently, the 
encyclopedic survey article by Danzer et al. (1963) is one, and the delightful book 
of Hadwiger and Debrunner, translated and expanded by Klee (Hadwiger et al. 
1964), now out of print, is the other. Our hope is that the present survey will shed 
some light on the relative ‘“‘dark age” that followed these surveys and also make 
the reader aware of the recent ‘‘renascence’’. Nonetheless, one seven-year gap 
remains in the references, that from 1969 to 1975. 

We have limited the discussion to finite collections of convex sets or to arbitrary 
collections of compact convex sets in R*. For a discussion of the intersection 
properties of infinite collections of arbitrary convex sets in R% we refer the reader 
to the first section of Klee (1963). 

For the results about Helly families of sets and their relation to Helly’s theorem 
we refer the reader to Bollobds’s (1986) book. 


6.1. Helly’s theorem 


Helly’s theorem (Helly 1923) says that if n =d+1 convex sets in E“ have the 
property that any d+1 of them have a nonempty intersection, then there is a 
point common to all of them. 
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On the line (i.e., d= 1) the theorem would say that if m =2 line segments are 
given and any two of them intersect, then they all do. This is certainly true! 

In the plane Helly’s theorem says that if n =3 convex sets are given such that 
any three intersect, then they all do. The example of n hyperplanes in general 
position in E* shows that d +1 cannot be replaced by d in Helly’s theorem. 

We shall deduce Helly’s theorem from Radon’s theorem, which asserts that 
n>=d+2 points in R‘ can always be split into two sets A and B whose convex 
closures have a nonempty intersection. 

The result is obvious if m= d+ 1. We shall use induction on n. Suppose that 
n>d +1, and consider a family {S,,...,S,} of convex sets in R* each d +1 of 
which intersect. The induction hypothesis may be applied to each of the smaller 
collections {S,,...,5,} — {S,} to obtain the existence of a point P, common to all 
of the sets except possibly S,, We now apply Radon’s theorem to P,,..., P,. 
There is a partition of {P,,..., P,} into two sets A and B whose convex closures 
intersect at a point Q. Helly’s theorem now follows by observing that Q belongs 
to all n sets of the family. 

To prove Radon’s theorem, let x,,...,x,, be the vectors representing n = d + 2 
points in R*. Since n >d +1, there is a nontrivial solution (y,,..., y,) to the 
system of d + 1 homogeneous linear equations 


ye tooo t+ y,X, =O, 
yp terety, =0. 


Some of the y’s will be positive, some will be negative, and perhaps some will be 

zero. We may suppose that the points have been numbered so that y,,..., y, are 

nonnegative and y,,,,---, y, are negative, for some k, 1<k<n. Thenc=y,+ 
>-+y, >0. The point 


Q =(y,/c)x, ++ + (y,/c)x, 


is in the convex closure of A = {x,,...,x,}, and the same point 


OQ = (Yess! O)Xp4, + °° + (y,/-Ox, 


is in the convex closure of B = {x,,,,-..,x,}, and Radon’s theorem is proved. 
We remark that if n =d+2 points are in general position, then the partition 
{A, B} is unique and is referred to as a minimal Radon partition. Tverberg (1966) 
extended Radon’s theorem, showing that any set of m(d + 1) — d or more points 
in R* can be partitioned into m subsets whose convex closures all intersect. 


6.1.1. Caratheodory’s theorem 

Helly’s theorem may also be viewed as the dual of a theorem due to Caratheod- 
ory, which states the following: if S is a subset of R“ and x is in the convex closure 
of S, then there are points x,,...,x,,, ES such that x=Ajx, +°+* + Aga Xaa1s 
where all A; = 0. See, e.g., Eggleston (1958). 
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6.1.2. Krasnoselskii’s theorem 

Krasnoselskii (alternative spelling Krasnosselsky) (Krasnosselsky 1946) proved 
the following: if S is a compact subset of R? containing at least d + 1 points such 
that for each d+ 1 points there is a point from which all d + 1 are visible, then S$ 
is star-shaped. The point P is visible from Q if the segment PQ lies entirely in S. 
The set S is star-shaped if there is a point P from which every point of S is visible. 
Krasnoselskii’s theorem has been regarded as a consequence of Helly’s theorem 
for a long time, but Borwein (1977) showed that one can also derive Helly’s 
theorem from Krasnoselskii’s theorem. 

In the plane the theorem may be described in the following picturesque 
manner: if a modern art galley is so shaped that for every three paintings there is 
a place where you can stand and see those three, then there is a place where you 
can stand and see all of the paintings. 

Krasnoselskii’s theorem is false for subsets of R“ that are not compact, and 
Peterson (1982) asked whether one could find a number f, so that if any f, + 1 
points are visible from some point, then the set must be finitely starlike, which he 
defined as follows. 


Definition. A set is finitely starlike if for any finite number of points in the set 
there is a point in the set from which they are all visible. 


Peterson’s question has not been settled, but there are some partial results. 
Breen (1988) finds such numbers for sets S in the plane, with certain restrictions 
on the interior of § and the closure of S restricted to S. See also Breen (1985) for 
a survey of theorems of the Krasnoselskii type. 


6.1.3. Some consequences of Helly’s theorem 
From Helly’s theorem one can immediately deduce the following: 

(a) If a convex set A can be translated so that it is contained in the intersection 
of any d + 1 of n=d +1 given convex sets, then it can also be translated so that it 
is contained in all n of the convex sets. We shall deduce this from Helly’s theorem 
to illustrate a technique: let B,,...,B, be the given convex sets. In any 
coordinate system let points be represented by vectors and for the set S let S+a 
denote the set of translates {x + a|x€S}. We define C, to be the set of vectors ¢ 
such that A + ¢ is contained in B,. The result follows by applying Helly’s theorem 
to C,,...,C,, which are also convex. 

(b) If a convex set A can be translated so that it intersects the intersection of 
any d+ 1 of n>d +1 given convex sets, then it can also be translated so that it 
intersects the intersection of all n of the convex sets. To prove this, let _ 
B,,...,B, be the given sets, and let C, be the set of vectors ¢ such that A + ¢ has 
a nonempty intersection with B,. Apply Helly’s theorem to the convex sets 
Cis ste Cos 

(c) If a convex set A can be translated so that it contains the intersection of any 
d+1ofn2d+1 given convex sets, then it can be translated so that it contains 
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the intersection of all m of the convex sets. To prove this, define C, to be the set 
of vectors c such that A + ¢ contains B,. 

The above results were published in Vincensini (1939) and Klee (1953). 
Assertions (a), (b), and (c) all become false if A is allowed to be rotated as well as 
translated. To see this, let the family of convex sets of C, of section 6.5, and let A 
be a line segment of length 2. 


6.1.4. Compact sets 

In the above arguments, we did not assume that the convex sets were compact, 
but sometimes it is convenient to do so. Helly’s theorem for a finite family of 
compact convex sets implies the same result for a finite family of arbitrary convex 
sets by the following standard argument, which applies to essentially all the 
generalizations of Helly’s theorem and will be taken as obvious from now on. Let 
S,,.--,5, be arbitrary convex sets such that any d+1 of them have an 
intersection point P = P(i,,...,%,,,) depending of course on the particular d + 1 
sets chosen. Let H be the polytope that is the convex closures of all such points P. 
For i=1,...,n let T; be the (compact, convex) intersection of H with the 
(topological) closure of S;. Applying Helly’s theorem for compact convex sets, we 
deduce that the intersection of the closures of all the §;, which is the closure of 
their intersection, is nonempty. Helly’s theorem then follows by observing that 
only a nonempty set can have a nonempty closure. 


6.1.5. Infinite families 

In Helly’s theorem for compact sets there is no need to assume that the family of 
sets is finite. The same remark applies to virtually all (compact) generalizations of 
Helly’s theorem. Suppose that we have a countably infinite family of compact 
convex sets in R® such that any d + 1 have a nonempty intersection. Let S, be the 
intersection of the first n sets. Then by the finite version of Helly’s theorem, S,, is 
nonempty. Let x, be the point of S, closest to the origin. The sequence {x,} is 
coritained in the compact set S, and therefore has a limit point x*, which must be 
in S*, the intersection of the S,. Hence x* is in all of the sets of the original 
family. To obtain the result for uncountable families, we simply observe that in 
R* the intersection of an infinite number of closed sets can be expressed as the 
intersection of some countable subcollection. 

Helly’s theorem is false for infinite families of bounded, but not closed convex 
sets, and for infinite families of closed but unbounded convex sets. For a 
discussion of the intersection properties of infinite families of noncompact convex 
sets in R*, see the first section of Klee (1963). 


6.1.6. Rado’s theorem 

A theorem of Ramsey’s (1930) asserts that if all of the r-membered subsets of an 
infinite set X are divided into m classes, then there is an infinite subset Y of X all 
of whose r-membered subsets belong to the same class. Using this theorem and 
Helly’s theorem, Rado (1952) proved the following: every infinite family of 
convex sets in R* contains an infinite subfamily with one of two properties: (a) 
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any finite number of sets of the subfamily have a nonempty intersection; (b) for 
some k<d every k members of the subfamily have nonempty intersection 
whereas every kK + 1 members have empty intersection. 


6.1.7. Multiplied Helly theorems 
The following multiplied version of Helly’s theorem is attributed to Lovasz in 
Barany (1982) (for simplicity we state it in R’). 

Let r, b, g 23, and suppose that we are given a red family of convex sets 
R,,...,R,, a blue family B,,...,B,, and a green family G,,...,G,, such that 
R,, B,, and G, have a nonempty intersection for any #, j, and k. Then for one of 
the colors it must be the case that all of the sets (of that color) intersect. 

In the d-dimensional version there are families of d + 1 different colors with at 
least d + 1 in each family. Helly’s theorem can be deduced from this by making 
the families all the same. Barany (1982) proves a dual version of this and remarks 
that both can be deduced from the following well-known fact: if the closed convex 
sets S,,...,5, do not have a common point, then there are closed halfspaces 
H,,...,H,, with H, containing S,, for i=1,...,#, which do not have a common 
point. Many generalizations of Helly’s theorem also have multiplied versions. 


6.2. Some generalizations 


De Santis (1957) extended Helly’s theorem to show that if we are given n =d + 
1—k convex sets in R* such that any d+1—k of them contain a common 
k-dimensional flat, then they all do. For k =0 this is Helly’s theorem, and a 
0-dimensional flat is simply a point. 

If we refer to any translate of a subspace of codimension x as a flat of deficiency 
k, so that a hyperplane is a flat of deficiency 1, then De Santis’s theorem may be 
more elegantly stated as follows: if we are given m =k + 1 convex sets in R* such 
that any k + 1 of them contain a common flat of deficiency k, then they all do. 

Griinbaum (1961) showed that if we are given n = g(d, k) convex sets such that 
any g(d, k) of them have a k-dimensional ball in common, then they all do. The 
function g(d, k) is defined as follows: 


g(d,0)=d+1, g(d,1)=2d, 
g(d,k)=2d—k forl<k<d, 
g(d,d)=d+1. 


For k =0 this is Helly’s theorem, and a 0-dimensional ball is a point. 

Horn (1949) and Klee (1951) showed that if 1<k <d+1, and you have n =k 
compact convex sets in R‘, then the following three assertions are equivalent: 

(a) every & of the sets have a common intersection; 

(b) every flat of deficiency k — 1 in R* admits a translate which intersects all the 
sets, 

(c) every flat of deficiency k in R% lies in a flat of deficiency k—1 that 
intersects all the sets. 
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Horn and Valentine (1949) added a fourth equivalent condition: 

(d) for each set of k of the convex sets and for each flat F of deficiency k —1 
there is a translate of F which intersects these k sets. 

When k=d +1, the above reduces to Helly’s theorem and assertion (c) is 
vacuous. 


6.2.1. Parallelotopes and Helly dimension 

If we are given n parallelotopes in R* with edges parallel to the coordinate axes, 
with the property that any ‘wo of them intersect, then they all intersect. Thus the 
d + 1 of Helly’s theorem has been replaced by 2. In a sense these sets behave as if 
they were in one-dimensional space. To pursue this further we need some 
definitions. 


Definition. A convex set K in R? is a convex body if it is compact and contains a 
d-dimensional ball. 


Definition. The set A is homothetic to B if A=kB+c for some vector ¢ and 
scalar k. The set kB := {kx |x € B}. (For example, if k>1, then kB is the result 
of expanding B about the origin of the coordinates.) Whether or not the sets A 
and B are homothetic is not affected by changing the origin. 


Definition. The Helly dimension of a family of convex bodies F is the smallest 
number d such that the following statement is true: if A is a subfamily of F such 
that any d + 1 members of A have a nonempty intersection, then all of the sets in 
A intersect. If F is the collection of all translates of a convex body K, then d is the 
Helly (translate) dimension of K. If F is the collection of homothetic images of a 
convex body K, then d is the Helly (homothetic) dimension of K. When the Helly 
(translate) dimension of K is the same as its Helly (homothetic) dimension, then 
we shall refer to their common value as the Helly dimension of K. 


Clearly the collection of all parallelotopes in R“ with edges parallel to the 
coordinate axes has Helly dimension 1, and any single parallelotope has Helly 
dimension 1. 


Definition. A zonotope Z is a convex set which can be expressed in the form 
Z=S,+S,+--:+5,, where the S, are (closed) line segments, and A + B is the 
usual (Minkowski) sum given by A+ B= {a+b|aEA, b€B}. For an exposi- 
tion of zonotopes see McMullen (1971). 


Zonotopes are in fact the projections of parallelotopes, and they have Helly 
dimension 1. It has been known for a long time that no other convex bodies have 
Helly (translate) dimension 1, see, e.g., Gritzmann (1988). 

It would seem reasonable to conjecture that the convex bodies of Helly 
(translate) dimension <d consist of the direct sums of compact convex sets that fit 
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in R*. In the special case of centrally symmetric bodies, this has been proved by 
Boltyanskii (1976) when d is 2, and by Kincses (1987) when d is 3 or 4. 

Barany and Kincses (1987) have characterized the Helly (homothetic) dimen- 
sion of a convex body K as the dimension of any dense set of extreme points of 
the dual of K. 


6.2.2. Quantitative Helly theorems 

Barany et al. (1984) prove the following quantitative Helly-type theorem: given n 
convex sets in R* such that the intersection of any 2d of them has volume at least 
unity, the intersection of all n of them must have volume at least c(d), where c(d) 
is a positive number depending only on d. Here 2d cannot be replaced by 2d — 1. 
To see this, take the family of the 2d halfspaces supporting a d-dimensional cube 
of side e, where e€ is small. The intersection of any 2d — 1 of them is unbounded, 
but the intersection of all 2d of them is only e*. Pach notes that similar 
quantitative Helly-type theorems can be proved with volume replaced by 
diameter, surface area, etc. 


6.2.3. Eckhoff’s conditions and fractional Heily theorems 
What if most subcollections of size d + 1 of a collection of convex sets in R® have . 
a nonempty intersection? Does it follow that most of the sets also intersect? 
Katchalski and Liu (1979) proved the following. Let 7 > d. For each p there is an 
a such that for any collection of n convex sets, if at least a proportion a of the 
r-tuples of sets have an intersection, then at least pn of the sets have a common 
intersection. 

Kalai rest made the above result quantitative by determining the smallest 
a =a(p, d, r) that will work. For example, a(p, d, d+1)=1-(1-p)** 

Kalai deduced the above from Eckhoff’s conditions, which we now explain. Let 
n convex sets be given in R*. We let f, denote the number of subcollections of 
i+1 sets that have a nonempty intersection. Thus Helly’s theorem says: 


it f= (4/1 ,)>then fy. =1 


Eckhoff conjectured that f=(f,..-, f,-,) would be an f-vector that actually 
occurred if and only if 


fers(p)er-o(e ee Vena) 


for all k, 1<k <n. Kalai proved the necessity of the inequalities in Kalai (1984) 
and their sufficiency in Kalai (1986). Eckhoff also had a proof of sufficiency, but 
did not publish it. 


6.2.4. The Griinbaum and Motzkin conjecture 

If one tries to generalize Helly’s theorem to sets that are themselves unions of <k 
disjoint convex sets, examples show that there is no A(d,k) such that if any 
h(d, k) members of a collection have a nonempty intersection then they ail do. 
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Griinbaum and Motzkin (Grinbaum 1961) considered a modified situation in 
which not only are the sets expressible as the union of <k disjoint convex sets, 
but the intersection of k or fewer sets can also be expressed as the union of <k 
disjoint convex sets. They conjectured that for such sets there is a number A(d, k), 
and that the smallest one that works is A*(d,k) =k(d +1). They proved their 
conjecture for k =2, and seven years later, Larman (1968) proved it for k =3. 
Larman aiso gives an example showing that h” (d, k) = k(d + 1). 


6.3. Selection theorems 


Debrunner and Hadwiger (1957) introduced the following. 


Definition. Let 7 convex sets be given. A k-partition of the sets is a partition of 
the sets into k classes so that the sets in each class have a common intersection. 


Klee once asked (see Hadwiger et al. 1964), whether there exists a number # 
with the following property: given a collection of n =A convex sets in R?, such 
that each A of them permit a 2-partition, the entire collection of 7 sets always 
permits a 2-partition. Unfortunately such an A does not exist, and the unitary 
rosette U,, of Hadwiger and Debrunner discussed in section 6.5 shows that. Each 
selection of 2n — 1 sets from U, permits a 2-partition, but the complete collection 
of 4n sets does not. 

Let us ask whether there is any property at all of each selection of 4 sets which 
would imply that all of the sets permit a 2-partition. A necessary condition would 
be that given any three sets, two of them must lie in the same partition; therefore, 
two of them must have a nonempty intersection. Is this condition also sufficient? 


Definition. A family of convex sets has property (p, q) if out of each selection of 
p sets some q have a nonempty intersection. 


We can now restate our question: does a family of m convex sets with the (3, 2) 
property necessarily permit a 2-partition? The answer is yes on the line [since 
K(3, 2, 1)=2, see below], but no in the plane. In fact, in R’ there is no k for 
which a k-partition is guaranteed by the (3,2) property. To see this, take n lines 
in general position. Then any two intersect, but no three intersect. Hence they 
will only permit a k-partition if 7 =< 2k. 


Definition. For p >q, let K(p, q, d) denote the smallest integer k for which a 
k-partition is admitted by each finite collection of convex sets in R? having 


property (p, q)- 


It is not hard to see that K(p, q, d)=p —q +1. Let us construct a family with 
a lopsided k-partition. One class has q sets in it, the other p — q classes contain 
only one set. Let us suppose also that sets in different classes are indeed disjoint. 
Such a collection can be constructed already on the line. Clearly if any p sets are 
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chosen, then at least q of them will have an intersection. Hence K(p, q, 
d)=p—q +1. It is shown in Hadwiger et al. (1964) that K(p, q, 1)=p—q+1. 
Hence K(3, 2, 1)=2, whereas K(3, 2, 2) is infinite. 

It is not known whether K(4, 3, 2) is finite. The above inequality gives K(4, 3, 
2) 22, but examples due to Danzer (in Hadwiger et al. 1964) and Grinbaum 
(1959) show that K(4, 3, 2) =3. Grinbaum’s example consists of nine translates 
of any centrally symmetric strictly convex body. It therefore also shows that 7(4, 
3, 2) 23, where 7(p, q, d) is the analogue of K(p, q, d) for families consisting of 
translates of a convex set. 

The example of n hyperplanes in general position in R* shows that K(p, q, d) 
is infinite if g<d. Each d hyperplanes will have a nonempty intersection, but no 
d+1 will, so that at least n/d partitions would be needed to cover all of the 
hyperplanes. 

Helly’s theorem asserts that K(d@ +1, d+1, d)=1. 

Debrunner and Hadwiger (1957) showed that K(p, q, d) =p —q + 1 whenever 
p-(p-d-1)/ds<q<p. 


6.3.1. Parallelotopes 
We can define a function N(p, q, d) to be the analogue of K(p, q, d) for families ~ 
of mutually parallel parallelotopes in R°. Then 
p-qtd 
p-qt1<Mp,q.d)<( d Ys 
Moreover, if 1+} p<q<p, then N(p,q,2)=p-—qt1. 
For a discussion of these and related matters see Hadwiger et al. (1964). 


6.4. Common transversals 


An m-transversal of family of sets in R¢ is an m-dimensional flat that intersects 
each set. We shall refer to a 1-transversal as simply a transversal. 

One might ask whether there is a number s such that if any s convex sets in the 
plane have a transversal then they all do. This is false even if we restrict the sets 
to be compact and pairwise disjoint. For every 1 there is a family of 4% compact 
pairwise disjoint convex sets in R? so that any 2n — 1 have a common transversal, 
but there is no transversal common to all of them. This is the expanding rosette 
E,, of section 6.5. 

Also in the plane, Vincensini defined a collection of convex sets to be totaily 
separable if there exists a direction so that all the lines parallel to that direction 
intersect at most one of the sets. For example, the rosette E, is not totally 
separable, even though the sets are pairwise disjoint. 

Vincensini showed that there was indeed an s so that if each s of a totally 
separable family of compact convex sets in R? admitted a common transversal, 
then they all did. Vincensini (1953) used s = 4, and Klee (1954) improved this to 
s =3. We shall now sketch Klee’s proof from Hadwiger et al. (1964). 

Suppose that there is a line LZ so that all lines parallel to L intersect at most one 
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set of the family. Thus any transversal of a pair of sets {A, 8} must intersect L at 
a nonzero angle @. Indeed there is an interval of angles /(A, B) corresponding to 
such transversal directions. With a little effort it follows from the hypothesis that 
any two intervals {(A, B) and {(C, D) intersect, so that Helly’s theorem in R! 
guarantees the existence of a direction common to all of the pairs. Projecting in 
that direction we see that any two projections A’ of A and B' of B must intersect. 
One more application of Helly’s theorem in R’ gives the result. 


6.4.1. Hadwiger’s theorem 

Let A be an ordered family of convex sets in R?. A transversal L is consistent with 
the ordering of A if the directed line L encounters the sets of A in order (or 
reverse order). 

Hadwiger (1957) proved that an ordered family of disjoint convex sets in R? 
will have a transversal if any three sets have a transversal consistent with the 
ordering. 

If a family of n compact convex sets is totally separable, then they are disjoint, 
and n — 1 parallel lines can clearly be drawn separating the sets from each other. 
Any transversal of three sets must be consistent with the obvious order induced 
by the parallel line. Hadwiger’s theorem provides the Klee—Vincensini theorem 
for finite families, and therefore for all families by a standard argument (see, e.g., 
section 6.1.5). 

In a very pleasing development, Wenger (1990) proved Hadwiger’s theorem 
without assuming that the sets of the family are disjoint. Wenger’s proof is similar 
to that of the Klee—Vincensini theorem given above. 

Wenger also makes the subtle observation that neither his theorem nor 
Hadwiger’s asserts the existence of a transversal that is consistent with the 
ordering! He then goes on to prove that an ordered family of compact convex sets 
such that every six sets have a transversal consistent with the ordering will have a 
transversal consistent with the ordering. It follows immediately from this result 
that a finite ordered family of arbitrary convex sets in R? will have a transversal 
consistent with the ordering if each six of the sets have a transversal consistent 
with the ordering (just use the argument in section 6.1.4). This result has the 
interesting corollary that an arrangement of labelled lines L,,..., ZL, will cross 
some line L in order if every six lines have this property. 


6.4.2. Higher dimensions 
Valentine (1963) proved the following theorem. Suppose that we are given a 
family of compact convex sets in R° such that there exist three distinct planes J7,, 
IL,, and If, containing a common line such that for each triple of convex sets M,, 
M,, M, in the family each pair of the triple is strictly separated from the 
remaining member of the triple by a translate of either 7, or H, or IT,, and this 
correspondence is cyclic. If every four members of the family have a common 
2-transversal, then all the members of the family have a common 2-transversal. 
Valentine remarks that a corresponding result is true in R*. 

In a recent development, Goodman and Pollack (1988) have proved a 
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generalization of Hadwiger’s theorem to R‘, and their result implies Valentine’s 
theorem. For the sake of simplicity we shall state it in R° for finite families. 

Given a finite family of compact convex sets in R* such that no three of them 
have a common 1-transversal, then they all have a common 2-transversal if and 
only if there is a configuration of points {P,,..., P,} in general position in R? 
such that for every four indices i<j<k</ there is an oriented plane [7 which 
meets B,, B,, B, and B, in points consistently with the order type of 
{Pi aig Po}. 

We need to define order type: two configurations of points {P,,...,P,} and 
{Q,,---,@Q,} in the plane have the same order type if for any three indices i, j, 
and k the question ‘Does P, lie on the left of the line from P; to P,?” always has 
the same answer as the question ‘‘Does Q, lie on the left of the line from Q, to 

pe 

In R4 the statement of Goodman and Pollack’s theorem is analogous. Their 
proof makes use of minimal Radon partitions and also a formula from Zaslavsky 
(1975) on the number of cells in a partition of R* by hyperplanes (see section 
4.3). 

In R% the above assumption that no three sets have a common 1-transversal 
generalizes to the assumption that no d sets have a (d — 2)-transversal. When - 
d= 2, this is just Hadwiger’s assumption that the sets are disjoint. Goodman and 
Pollack published their theorem before Wenger pointed out that this is superfiu- 
ous in R’, so it is natural to ask whether it is possible to prove the d-dimensional 
result without that hypothesis. Pollack and Wenger (1990) have recently done just 
that. In fact, they proved the following even more general theorem: a finite family 
{S,,...,S,} of connected compact sets in R’ has a hyperplane transversal if and 
only if for some k there exists a set of points {P,,...,P,} that spans R* and 
every k + 2 sets of the family {S,,...,5,} are met by a ke flat consistent with the 
order type of {P,,..., P,} 


ase 


6.4.3. Parallelotopes 

Santalo (1940) proved that if we are given n parallelotopes in R“ with edges 
parallel to the coordinate axes, and if any 2° '(d +1) of them admit a (d—1)- 
transversal, then they all do. 


6.4.4. Translates 

Recently, Tverberg (1989) proved the following result: given any collection of 
pairwise disjoint translates of a convex set H such that any five permit a 
transversal, there is a transversal common to all the translates. 

Danzer (1957) had proved this in the special case when H is a disc, and 
Griinbaum (1958) conjectured the general result. Danzer also gave the following 
example showing that five cannot be replaced by four: at the vertices of a regular 
pentagon of side 2 place the centers of discs of unit radius; any four of the discs 
have a common transversal, but the five discs do not have one. 

The example C,, of section 6.5 shows that the translates must be assumed to be 
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disjoint. Any 1—1 of the discs in C, admit a transversal, but there is no 
transversal for the entire family of discs. 


6.4.5. Flats 

Lovasz proved the following: given a family of r-dimensional subspaces of R’, 
such that any B,, of them are intersected by the same k-dimensional subspace, 
then there is a k-dimensional subspace intersecting all of them, where 


r+k 
B,=( k iE 


The result restated in terms of flats is the following: given a family of 
r-dimensional flats in R* space, such that any F,, of them have a k-transversal, 
there is always a k-transversal for the whole family, where 


r+k+2 
Fie ( kt+1 
The families do not need to be finite or countable. The proof uses Grassmann 
products. 


6.5, The rosette 


An important example in the plane is the rosette, due to Hadwiger and 
Debrunner (Hadwiger et al. 1964). 

For i=1,...,2n let S, be the circular arc with polar coordinates (r, @), where 
r=r,, and 


(i — n+ 1){0/(2n)} <0 <(i+n—1){a/(2n)} . 


Similarly, for i=1,...,2n let T, be the circular arc with polar coordinates (r, @), 
where y =r,, and 


(i +n —1){1/(2n)} <6 <(i + 3n — 1){n/(2n)} . 


Calculation shows that S; and 7; are arcs of length w—a/n radians, with S;, 
centered on @ = wi/(2n), and T, centered on wi/(2n) + 7. 

Compact convex sets are obtained from the circular arcs by forming their 
convex closures, which are D-shaped regions, the petals of the rosette. 

There are really two kinds of rosette: 

(1) If 7, increases rapidly enough with i, then the petals of the rosette are 
pairwise disjoint, and no line intersects all 4n of them. Nonetheless, any 2m — 1 of 
them have a line in common. Let us call this example E,,, the expanding rosette. 

(2) Ifr, =7,=+-+=r,, = 1, then we have 4n compact convex sets such that for 
each selection of 2m — 1 sets there is a pair of (antipodal) points P and P’ so that 
each of the 2n — 1 selected sets contains at least one of P, P’. Nonetheless, there 
are not two points Q and Q’ such that every one of the 47 sets contains at least 
one of Q, Q’. Let us call this rosette U,, the unitary rosette. 

Let the family C, of n congruent discs be defined as follows for n > 3. Let the 
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discs have radius R, and center in polar coordinates 


(7,6) =1,2iw/n) fori=1,...,n 


R,,=cos'(m/n)_ ifnis even, 
R,, = cos?(/n) + cos’*(m/2n)—1 ifn isodd. 


Any n~—1 of the discs have a transversal. In fact, it is possible to rotate and 
translate a line segment of length 2 so that it intersects any m — 1 of the discs. On 
the other hand, the discs do not permit a transversal. 


7. Some other problems 


7.1. Euclidean Ramsey problems 


A finite subset C of R? is called r-Ramsey for R’ if for any partition of R° into r 
sets S,,...,5,, some S, always contains a subset C’ which is congruent to C. If C 
is r-Ramsey for every r for some R’%, then it is said to be Ramsey. 

The study of these problems was started by the six mathematicians R. L. 
Graham, P. Montgomery, B. Rothschild, J. Spencer, E. G. Straus, and P. Erdés. 
The six originators found two very general conditions for a set to be Ramsey — one 
is necessary and one is sufficient: if C is Ramsey, then C must lie on the surface of 
a sphere of some radius; a subset of the vertices of a d-dimensional parallelepiped 
(a “brick’’) is always Ramsey. It is conjectured by at least the first of the authors 
that the first condition actually characterizes Ramsey configurations. It has 
recently been shown by Kfiz (1991) that the set of vertices of any regular n-gon is 
Ramsey. 

Since every acute triangle is contained in a brick, it must be Ramsey. This 
raised the question whether obtuse triangles were Ramsey. Frankl and Rédl 
(1986) proved that all triangles are Ramsey. They mention that they are unable to 
prove the Ramseyness of any pentagon, and only of some quadrilaterals (although 
Kriz has now done this). 

Many other challenging problems remain open. We shall present a few of them 
here. 

What triangles are 2-Ramsey in R?? L. Shader proved that all right triangles 
are. On the other hand, equilateral triangles are not. Graham conjectures that 
equilateral triangles are the only triangles that are not 2-Ramsey in R’. 

Color R‘ with r different colors. Is it then true that at least one color contains 
the vertices of a k-dimensional simplex of unit volume? Graham (1980) proved 
and generalized this. His paper contains many of the relevant references on 
geometric Ramsey theory up to 1980. See also Graham’s (1985, 1990) articles. 

In a recent development, Frankl and Rédl prove that all simplices in arbitrary 
dimensions are exponentially Ramse ey, which is a stronger property than just 
being Ramsey. A finite set A in R° is exponentially Ramsey if there exists a 
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positive real e(A) so that for every partition of R" into r classes with a =d and 
r<(1+e«(A))”, one of the classes contains a congruent copy of A. We refer the 
reader to chapter 24. 

Erdos posed the following question: if S is a subset of the plane such that no 
two points of S are at unit distance, is it then true that the complement of S 
contains the vertices of a unit square? Juhasz (1979) settled the question in the 
affirmative for the vertices of any quadrilateral, and she showed that the result 
was false for the vertices of a 12-gon. 


7.2. Heilbronn’s problem 


An old problem of Heilbronn states the following. Let X,,..., X, be n points in 
the unit circle in the plane. Denote by A,(X,,...,X,,) the smallest area of all the 
k-gons formed from the n points, and let g,(m) denote the maximum of 
A,(X,,-.-.,X,,) taken over all configurations of n points. 

Heilbronn asked for the determination or estimation of g,(). He of course 
observed that trivially g,(2)<c/n and suspected that, the order of magnitude of 
g;(n) is 1/n’. Erdés observed that indeed g,(n) > c/n’. The first nontrivial result 
was due to K. F. Roth who proved that g,(n)<1/n(loglogn)'’?. This was 
improved by W. Schmidt to 1/n(logn)'’? and later by Roth to 1/n'*‘, where 


a 
The competition between Schmidt and Roth was interrupted by Komlds et al. 


(1982) who improved c to 4 and showed that 


83(n) > (clog n)in’ . 


Their proof of the lower bound uses techniques for combinatorics and 
probability theory, and Erd6s thinks that their method of proof will have further 
applications. Since Heilbronn had expressed the belief that g,(m) had order 1/n’, 
you could say that he had been proved wrong, but that would be a little harsh, 
and Szemerédi believes that it is quite possible that the new lower bound (with 
the log factor) is best possible. 

For k >3, the first nontrivial results are due to Schmidt, who proved that 


8,(n) > cyn 1" 
and asked whether g,(n) = 0(1/n). This problem is still open. 

Erdés and Szemerédi posed the following problem. Denote by D= 
D(X,,...,X,) the smallest distance occurring between two of the X’s, and by 
A=A(X,,...,X,) the smallest angle determined by any three of the points. If 
there are three collinear points, then A will be zero. Is it true that 


DA = o(n*”)? 


The answer to this question might throw some light on Heilbronn’s problem. It 
is not too hard to show that DA <cn *’?. The regular polygon gives the largest 
known value for DA. 
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V. T. Sos, E. Straus, and Erdos have slightly modified Heilbronn’s problem 
(see Erdés 1985a for details). 


7.3. Diverse problems 


Let X,,...,X, be m points in the plane not all on a line. Denote by A= 
A(X,,-..,X,) the largest area and by a=a(X,,...,X,) the smallest positive 
area of the triangles X,X,X,. Define f(n) to be the minimum of A/a taken over all 
configurations of n points. Erdos et al. (1982) showed that f(m) = [(n — 1)/2] if 
n > 37. The corresponding function for d-dimensional simplices satisfies f,() = 
[(z — 1)/d], and perhaps equality holds for n >n,. 


7.3.1. Convex polygons 

Let g(n) be the smallest integer such that any planar configuration of g(n) points, 
no three on a line, must contain the vertices of a convex n-gon. Erd6s and 
Szekeres (1935) proved that 


2n—4 
24 1<giny<("_S) 
and conjectured that g(n) is equal to the lower bound. Their paper also contains 
an elegant proof of a much worse upper bound using Ramsey’s theorem. Their 
conjecture has been verified for some small values. E. Klein proved that g(4) = 5, 
and E, Makai and P. Turan proved that g(5) = 9. 


7.3.2. Empty polygons 

Denote by f(x) the smallest number (if it exists) for which every set of f(m) points 
with no three on a line contains k which form a convex polygon that contains 
none of the other points in its interior. It is easy to see that f(4) = 5. Ehrenfeucht 
and Harborth proved that f(5) was finite. Harborth (1979) showed that in fact 
f(5) = 10. It is not yet known if f(6) exists. Horton (1983) showed that f(7) does 
not exist. 


7.3.3. The minimum number of empty triangles 

Erdés asked the following question: given n points in R’, no three on a line, what 

is the minimum number 7(n) of empty triangles that they can form? A simple 

argument shows that 7(n) = {n(n — 1), for any pair of points {P,Q} determines a 

line L, and a closest point R to L. The triangle POR must be empty, and triangles 

are counted at most three times in this way. That T() <cn* can be deduced from 

the example in Horton (1983) and was also proved independently by Purdy 

(1982b) and by Katchalski and Meir (1988), who also counted empty convex 
k-gons. Barany and Fiiredi (1987) looked at the corresponding problem in R*. 


7.3.4. Mid-points of diagonal and convex n-gons 
Erdos et al. (1990c) define f(n) to be the minimum over all convex n-gons of the 
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number of different mid-points of the diagonal. They show that 


0.8 () <f(n) <0.9 () 


7.4. Non-Archimedean valuations 


Sometimes, p-adic valuations are useful for constructing examples. For any prime 
p, the p-adic norm N,(x) is defined for rational numbers x. A highly nonconstruc- 
tive theory of homomorphisms requiring the axiom of choice then says that N,(x) 
extends to a non-Archimedean valuation r(x) on the set of reals R. We shall 
answer two geometric questions using r(x). 

(a) Is it possible to color the real projective plane P” with three colors so that 
there are three noncollinear points of each color and no line contains all three 
colors? 

If we do not insist that each color has three noncollinear points, then the 
problem has a trivial solution. Let P be any point and L any line through P. The 
points of the plane are colored as follows: P is red, the rest of the line is green, 
and the rest of the plane is blue. 

D. S. Carter and A. Vogt, and independently A. W. Hales and E. G. Straus, 
have shown that this is possible (with analogous results in higher dimensions). We 
give a sketch of the proof in Hales and Straus (1982), which is comparatively 
short, and refer the reader to that paper for further details and references. 

Let r(x) be any non-Archimedean valuation defined on the reals. Let (x, y, z) 
be the homogeneous coordinates of a point P. We color P as follows: 

If r(x) > max{r(y), r(z)}, then color the point C,. 

If r(y) = r(x), and also r(y) > r(z), then color C,. 

All remaining possibilities use color C;. 

Consider the points P(x;, y;, z;) for i= 1, 2, 3. Now look at the determinant 


x, Yi 21 
X, Yr 22 
Xx Y3 23 


This will be zero if and only if the three points are collinear. Suppose that point 
PAx;, Y;, 2;) is colored C, for i=1, 2, 3. Observe that 


r(x,) > max{r(y,), r(z1)} , 
r(y,)=r(x,) and r(y,)>r(z2), 
r(z3) = max{r(x3), r(y3)} . 


This implies that r(x, y22,) > r(x,y,z,) for the 5 = 3! — 1 other choices of distinct i, 
j, k. Hence r(D) = r(x, y.z,) 21. Hence the determinant D is nonzero, and the 
points are not collinear. Hence three points of different colors cannot be 
collinear. 

The trivial valuation ¢(x) is defined by saying that (0) =0, and «(x)= 1 if x is 
nonzero. It is not hard to show that if r(x) is not the trivial valuation, then each 


D= 
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color has three noncollinear points. If r(x) = #(x), then we get the trivial coloring 
described above: C, will be points for which x is nonzero and y = z = 0. This will 
be the point on the line L at infinity where the x-axis intersects L. C, will be the 
points for which y is nonzero, z = 0, and x is anything. This will be the remainder 
of the line at infinity. C, will be the remainder of the projective plane. 

Straus and Hale, and Carter and Vogt also showed that the implication goes the 
other way. That is, any 3-coloring of a projective plane over a field F having the 
above properties gives rise to a non-Archimedean valuation on F. Since the finite 
fields and their extensions do not possess such valuations, their projective planes 
do not have such colorings. 

Thus, for example, if P(2, p) is the projective plane formed from the field Z,, 
where p is a prime number, and if the points of P(2, p) are colored with three 
colors in such a way that there are three noncollinear points of each color, then 
there will always be a line with all three colors on it. 

(b) Problem number 5479 in the American Mathematical Monthly posed by 
Fred Richman and John Thomas asked whether a square could be divided into an 
odd number of nonoverlapping triangles of equal area. The answer is no, as 
proved by Monsky (1970), using a non-Archimedean valuation r(x) on the reals. 
John Thomas had already proved the result for the special case when the vertices 
of the triangles are rational, and he used a 2-adic valuation, but he did not extend 
it to the reals. 

Mead (1979) extended Monsky’s result to R“, showing that the unit hypercube 
can be divided into m simplices of equal volume if and only if m is a multiple of 
d! Kasimatis (1989) proved that if m =5, a regular m-gon can be dissected into m 
triangles of equal arca if and only if m is a multiple of n. The result is false for 
n=3. For n=4 it follows from Monsky’s result. 

We now briefly sketch Monsky’s proof, slightly modified. 

Let r(x) be a non-Archimedean valuation on the reals obtained by extending 
the 2-adic norm, so that r(2) = 4. Color the points of the plane with three colors 
in the same manner as in (a) above. Let T be a triangle whose vertices are of 
three different colors. Then 


r(area T) = (3D) = r(5x1 9222) > (3) =2. 


Suppose now that the unit square O<x, y <1 (all points have z = 1) is divided 
into m triangles of area 1/m. Monsky proves that there must be a triangle with 
vertices of three different colors, which implies that r(1/m) = 2, and m is an even 
integer, as required. Incidentally, Monsky used the fact that no line had all three 
colors. He colored R’ rather than the projective plane, but it is the same coloring 
you obtain if you restrict that of Straus and Hales to R? and only consider p = 2. 

Stein (1989) conjectured that centrally symmetric 2n-gons can never be | 
dissected into an odd number of triangles of equal area, and proved this for 
2n = 6, 8. For 2n = 4 this follows from Monsky’s result, and for regular 2n-gons it 
follows from Kasimatis’s result. Monsky recently proved this conjecture by 
showing that in fact no symmetric subset of the plane can be dissected into an odd 
number of triangles of equa! area. 
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7.5. Halving lines 


Suppose we are given n points in the plane, no three on a line. It is easy to see 
that there is always at least one line L that is spanned by the points that also cuts 
the points in half. That is, so that as near as possible half of the remaining n — 2 
points are on each side of L. To find L, just rotate a line L' about a vertex of the 
convex closure of the n points, which will cause L’ to cross the other m — 1 points 
one at a time. This argument shows that there are at least three such halving lines. 

Erdos et al. (1973) define h,(n) to be the maximum number of halving lines of 
a set of n points in the plane, no three collinear, and they give a construction 
showing that h(n) > cn log n. They show that h,(n) = O(n”). Recently, Pach et 
al. (1992) improved this to h,(n) = O(n*’/log*n), where log*n denotes the 
iterated logarithm function. 

In Edelsbrunner’s (1987) book Algorithms in Combinatorial Geometry, there is 
a discussion of h(n) and the corresponding function h,(n) in R*, and a proof due 
to H. Edelsbrunner and R. Seidel that 


h(n) =c,n* 'logn. 
Barany et al. (1990) show that 


h,(n) <¢'n 18 : 


This was recently improved to 
h(n) = cn®” log?’?n 


by Aronov et al. (1991). 


7.5.1. Edelsbrunner’s problem 

Edelsbrunner (1982) asked for bounds on the maximum number g(k, 7) of subsets 
of size at most k cut off by a line, for 7 points in the plane, no three on a line and 
k <1n. Goodman and Pollack (1984a) showed that g(k, n) < 2nk — 2k’ — k. Alon 
and Gy6ri (1986) determined that g(k, 2) = kn. The problem may be stated more 
generally in terms of pseudolines, or allowable sequences, and Alon and Gy6ri 
obtain the same value for the corresponding function analogous to g(k, 7). 
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1. Introduction and basic definitions 


This survey is concerned with the combinatorial or facial structure of convex poly- 
topes. The earliest result in the area is Euler’s theorem asserting that if v, e, and 
f are the numbers of vertices, edges, and 2-faces of a 3-dimensional convex poly- 
tope, then v — e+ f = 2. Attempts to understand and extend Euler’s theorem were 
among the first stimuli for the developments on which we report. Since the early 
1950s, the relationship to linear programming has been an important stimulus. It 
has been joined more recently by connections on the “pure” side with matroid 
theory, commutative algebra, and algebraic geometry, and on the “applied” side 
by problems from statistics, nonlinear optimization, computational geometry, and 
computational complexity. Thanks to all of these stimuli, as well as to the strong 
intuitive appeal of the subject, the combinatorial study of convex polytopes has 
advanced greatly in the past thirty years. 

In studying the facial structure of polytopes, the natural setting is a finite- 
dimensional vector space E over an ordered field ®. Of course, the real field 
R and its subfields are of special interest. Except where the contrary is stated, at- 
tention is confined here to the case in which @ = R, but that is mainly for brevity 
as a large fraction of the facial or combinatorial theory can be developed for an 
arbitrary ®. 

As the terms are used here, a polyhedron is the intersection of a finite collection 
of closed halfspaces in E and a polytope is a bounded polyhedron. Equivalently, a 
polytope is the convex hull of a finite set. Since, for us, polyhedra and polytopes 
are all convex, the word convex is omitted in much of what follows. 

A face of a polyhedron P is the empty set 9, P itself, or the intersection of 
P with a supporting hyperplane. Prefixes indicate (affine) dimension, and the 0-, 
1-, (d — 2)- and (d — 1)-faces of a d-polyhedron are respectively its vertices, edges, 
ridges and facets. A polyhedron is pointed if it has at least one vertex and hence 
has faces of all dimensions less than.or equal to the dimension of the polyhedron 
itself. 

With respect to the ordering given by set inclusion, the collection of all faces of a 
polyhedron P forms a lattice, the face-lattice of P. Two polyhedra P and Q are said 
to be isomorphic, or combinatorially equivalent, or of the same combinatorial type, 
if' their face-lattices are isomorphic. When both are polytopes, this is equivalent 
to saying that there is a bijection between their vertex sets V and W such that a 
subset of V is the vertex set of a face of P if and only if its image in W is the 
vertex set of a face of Q. 

A complex, as the term is used here, is a finite collection @ of polytopes such 
that each face of a member of © is itself a member of @, and the intersection of 
any two members of € is a face of each. The dimension of a complex @ is the 
maximum of the dimensions of its members. The union of @’s members, sometimes 
called set @, is denoted by |) €. A d-complex © is pure if (J is the union of @’s 
d-dimensional members. Note that when P is a d-polytope, the faces of P other 
than P itself form a pure (d — 1)-complex @(P), the boundary complex of P. 

A complex © in our sense is sometimes called a cell complex or a polyhedral 
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complex, and its members are called cells. However, to emphasize our primary 
interest in the boundary complexes of polytopes, the members of © are here 
called faces, and when © is a pure d-dimensional complex its 0-, 1-, (d — 1)-, and 
d-dimensional members are called vertices, edges, ridges, and facets. 

Two complexes @ and & are isomorphic if there is a bijection between their 
vertex sets V and W such that a subset S of V is the vertex set of a face of € if and 
only if the image of S in W is the vertex set of a face of 2. Thus the combinatorial 
equivalence of polytopes can be defined, according to taste, as isomorphism of 
their boundary complexes or isomorphism of their face-lattices. A complex © is 
polytopal if it is isomorphic to the boundary complex of a polytope. Polytopal 
complexes are the primary focus of this report. In order to convey the spirit of 
some of the methods that have been used to study such complexes, we include a few 
indications of proof. However, our primary concern is with definitions, statements 
of results, and references. 

The literature contains detailed studies of the boundary complexes of a vari- 
ety of specific polytopes, especially those that are regular or close to regular in 
various senses and those that arise from important types of optimization prob- 
lems. That latter direction of research is often called Polyhedral Combinatorics, 
and is surveyed in chapter 30 in this Handbook. It has much in common with the 
subject matter of the present chapter, but its emphases are different. Here our 
primary concern is not with specific polytopes but with properties that hold for all 
polytopes or at least for very wide classes of them. Specific polytopes arise here 
mainly as the solutions of various extremum problems concerning combinatorial 
structure. Even though our treatment is restricted to the “general” (as opposed 
to the “specific”) combinatorics of polytopes, we have been forced by limitations 
of space to ignore many important areas. However, by consulting the references 
listed here, the reader will gain some acquaintance with these areas as well as the 
ones discussed here. 

Polytopes may be presented in a number of ways. In many cases, the initial 
description is not a matter of taste but is dictated by the nature of a problem that 
gives rise to a polytope. It is often desirable to be able to derive the full face-lattice 
or boundary complex from the initial presentation. This topic is discussed briefly 
in section 3. 

Because of the many situations in which polytopes arise, and the frequency with 
which facial structure is of interest, polytopal complexes are important combi- 
natorial objects. Many difficulties in studying them arise from the fact that they 
are defined in an extrinsic, geometric fashion, and it seems that a purely com- 
binatorial characterization could be very useful. The problem of finding such a 
characterization is commonly called the Steinitz problem, because of a theorem 
in the book of Steinitz and Rademacher (1934) that solves the problem for 3- 
polytopes. The Steinitz problem is open for each dimension d > 4, but it has led 
to various 3-dimensional ramifications and higher-dimensional partial analogues. 
The most important of these are described in section 4. 

Section 5 deals with the enumeration of combinatorial types. This has been a 
much-studied topic in the case of 3-polytopes, and for that case there are fairly 
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complete results. In higher dimensions, on the other hand, the lack of a purely 
combinatorial characterization makes precise enumeration difficult even for rel- 
atively small numbers of vertices or facets. Nevertheless, there are a few pre- 
cise enumerations as well as some rough asymptotic results, and they are de- 
scribed. 

Among the combinatorial invariants associated with a d-polytope, the face- 
vector and the edge-graph are the ones that have been most thoroughly studied. 
They are the subjects of sections 6 and 7, respectively. The face-vector f(P) is 
the sequence (fo, fi, ..-,fg—-1) that lists for each i the number f; of i-faces of P. 
In the edge-graph @(P), the nodes are P’s vertices and the edges are pairs of 
vertices that share an edge of P. There are other natural ways of associating a 
graph with P, but when one speaks simply of the graph of P, the edge-graph 
&(P) is intended. A graph is d-polytopal if it is isomorphic to the graph of a 
d-polytope. 

For results on polytopal complexes obtained before 1967, the most important 
reference is the book of Griinbaum (1967). Since we refer to this so frequently, it 
will be designated simply as Gr67. It, and a subsequent survey article (Griinbaum 
1970) stimulated many later developments. There were later books by McMullen 
and Shephard (1971), Bartels (1973), Yemelichev et al. (1981), Br@ndsted (1983), 
and Ziegler (1994), and survey articles by Ewald et al. (1979) and Bayer and Lee 
(1993). 

We are indebted to M. Bayer, G. Kalai, L. Lovasz, B. Sturmfels, and G. Ziegler 
for helpful comments. 


2. Additional terminology 


A d-polyhedron is simple if it is pointed and each of its vertices is incident to pre- 
cisely d edges or, equivalently, to precisely d facets. A polytope is simplicial if each 
of its facets is a simplex (the convex hull of an affinely independent set of points). 
(The simplices are the only polytopes that are both simple and simplicial.) The sim- 
ple polytopes and the simplicial polytopes are of special importance because many 
problems from other areas lead naturally to one or the other sort of polytope, and 
berause, for each d-polytope P in R? there is an arbitrarily small perturbation of 
P’s facets (vertices) that yields a nearby simple (simplicial) d-polytope. 

When P is a d-polytope in R? and the origin is interior to P, the polar P° of 
P is obtained by intersecting certain halfspaces corresponding to the points of P; 
specifically, 


P° = {x € R@: (x,y) <1 for all y € P}, 


where (-,-) is the usual inner product. The same set P° is obtained when y ranges 
only over P’s vertex-set, and hence P° is also a d-polytope with the origin in its 
interior. The boundary complex (or face-lattice) of P is antiisomorphic to that of 
P° — i.e., there is an inclusion-reversing bijection — and it is common also to say 
that the polytopes are dual to each other. 
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For an arbitrary d-polytope P, there is a translate Q such that the origin belongs 
to Q’s relative interior, defined as the interior of Q relative to its affine hull A. It 
is then possible to form the polar Q° of Q in A, and this is a polytope whose 
face-lattice is antiisomorphic to that of P. Any polytope Q° formed in this way 
will be denoted by P*. There are many different choices for P*, but that will cause 
no confusion as they are all combinatorially equivalent. 

Under polarity, simple polytopes are dual to simplicial polytopes, so combinato- 
rial results on the facial structure of simplicial polytopes can be directly translated 
into results about their simple counterparts, and vice-versa. Thus much of the 
combinatorial study of polytopes can be approached from the “simple” viewpoint 
or, equivalently, from the “simplicial” viewpoint. Though formally equivalent, the 
two viewpoints lead to different emphases and different generalizations. From the 
viewpoint of linear programming and some other applications, the simple viewpoint 
is more natural. However, the simplicial viewpoint is emphasized here except in 
section 7, because it is more purely combinatorial in flavor and it enables us to use 
the well-developed language of simplicial complexes. Polytopes that are neither 
simple nor simplicial are also considered, but the initial focus for most topics is on 
the simplicial case. 

A complex is simplicial if its members are all simplices. In particular, if P is a 
simplicial d-polytope, then the boundary complex &(P) is a pure simplicial (d — 1)- 
complex. Simplicial complexes can be defined as abstract complexes in a purely 
combinatorial way, so that a pure simplicial (d — 1)-complex, e.g., would consist of 
a finite collection of d-sets (sets of cardinality d) together with all subsets of those 
sets. However, here we stick to embedded complexes (i.e., to complexes as defined 
earlier), for they are more natural when considering the boundary complexes of 
nonsimplicial polytopes. When dealing with simplicial complexes, the reader may 
employ the geometric or the abstract definition, according to taste, for each abstract 
simplicial complex can be realized geometrically. In this Handbook, chapter 34 
contains a good discussion of abstract simplicial complexes and the natural way of 
topologizing them. 

Several other sorts of complexes arise naturally in the study of polytopal com- 
plexes. A complex © is called a sphere (ball) if |) @ is homeomorphic to a Eu- 
clidean sphere (ball). Note that if P is a d-polytope, then the boundary complex 
B(P) is a (d —1)-sphere and the complex @(P) U {P} is a d-ball. A complex € 
is called a d-manifold if the topological space (J € is locally homeomorphic to R?. 
And © is a d-pseudomanifold (closed d-pseudomanifoid) if € is a pure d-complex 
and each ridge of @ is contained in at most (exactly) two facets of ©. 

A Shelling of a pure d-complex © is an ordering F\,...,F, of @’s facets such 
that for 2 < j <n the complex FN (UW F;) is a (d — 1)-ball or (d — 1)-sphere. A 
pure complex is shellable if it admits a shelling. This notion is discussed in detail 
in chapter 34. 
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For a complex € and a face F of ©, the /ink of F in @ is the following subcomplex 
of @: 


link(F, @) = { C € €: CO F =6 and there exists D € € for which 
Cc DandF cD} 


3. Presentations of polytopes 


Polytopes and polyhedra can be presented in a variety of ways. In many applica- 
tions, a polyhedron P is presented as an intersection of closed halfspaces, hence as 
the solution set of a system of linear inequalities Ax < 5, where A is a real m x d 
matrix, b € R™, and x € R?. When P is d-dimensional and no row of the matrix 
[A, 5] is redundant in the representation, each row corresponds to a facet of P. 
See chapter 30 for more information on presentations of polyhedra. 

A polytope P in R¢ can also be represented as the set of all convex combinations 
of a finite set of points {x,,..., xn} in R¢, hence as 


P= {oP Axi: A; 2 0, Dadi = 1} . 


(By a theorem of Carathéodory, the same set is obtained when all but d+1 of 
the A; are required to be 0.) When the set {x,,...,x,} is minimal for this rep- 
resentation of P, it is P’s vertex-set. Many authors have studied the problem of 
finding such a minimal representation, or, in other words, of finding the vertex set 
of a polytope that is presented as the convex hull of a given finite set of points. It 
seems that a clearly optimal algorithm is known only for the 2-dimensional case 
(Kirkpatrick and Seidel 1982). For higher-dimensional cases, usable algorithms and 
other references can be found in Swart (1985). 

A common computational task is to pass from one sort of presentation of a 
polytope P to another. As is discussed in section 6, a d-polytope with n vertices 
(facets) may have as many as 


(" - Us D1) , (" - Was 2!) 


facets (vertices), and hence one sort of presentation may be much longer than the 
other. Thus in measuring the efficiency of an algorithm for passing from one sort of 
presentation to another, it is reasonable to take into account the size of the output 
as well as the size of the input. The reader is referred to Dyer (1983), Swart (1985) 
and Seidel (1987) for studies of the computational complexity of some problems 
of this sort, for further references and for specific algorithms, and to Mattheiss and 
Rubin (1980) for computational experience with several algorithms. 

Three other ways of presenting a d-polytope P are useful because they lead to 
lower-dimensional pictures of P’s facial structure. Suppose that P is a d-polytope 
in R4, F is a facet of P, and x is a point of R4\P such that among the hyperplanes 
that are affine hulls of the facets of P, only that of F separates x from P (in 
the terminology of Gr67, x is beyond F but beneath all the other facets of P). 
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Let € = B(P)\{F}. The projection of P onto F by rays issuing from x yields, 
when restricted to |) ¢, a homeomorphism of ) © onto F, and this projection 
carries the members of €\{F} onto the members of an isomorphic complex € 
with |) € = F. The complex € is called a Schlegel diagram of P based on F. 

Another useful dimension-reducing representation of a polytope is its Gale di- 
agram, a notion developed by M. Perles (Gr67). We state the central result below 
and refer to McMullen (1979) for a comprehensive survey of Gale diagrams and 
their applications. 

Suppose that P is a d-polytope in R* and X = {x;,...,xn} is its vertex set. An 
affine dependency of X is a vector (a1,..., a,) € R” with 2?_,a; =O and 2? _,ajx; = 
0. Suppose that the n-vectors aj,...,@,—g-; form a (linear) basis for the vector 
space of all affine dependencies of X. Let A be the x (n — d — 1) matrix whose 
columns are the a;’s, and let the x;’s, 1 <i<n, be the row-vectors of A. The 
set X = {%1,---) Xa} CR ~4-1 is called a Gale transform of P, and its normalized 
version X, obtained by replacing each nonzero x; € X by the point ¥, = x;/{{x;{|, 
is a Gale diagram of P. The following theorem from Gr67 shows that the entire 
boundary complex %(P) can be “read” from a Gale transform or Gale diagram. 
This is very useful when P’s number 7 of vertices is not much larger than P’s 
dimension d, for then the transform and diagram are low-dimensional objects. 


Theorem 3.1. A subset {x;i,,..-,xi,} of the set X of vertices of a d-polytope P is 
the vertex set of a face of P if and only if the origin belongs to the relative interior 
of the convex huil of the set X\{X;,,...,%;,} or, equivalently, to the relative interior 
of the convex hull of the corresponding subset of x. 


A more sophisticated relative of the Gale diagram is the affine Gale diagram 
of Sturmfels (1987a), which permits a further reduction in dimension. When P 
is a d-polytope P with n vertices, the Gale diagram is, in a sense, an (n — d — 
1)-dimensional object and the affine Gale diagram is an (n — d — 2)-dimensional 
object. By means of appropriate rules of translation, P’s combinatorial structure 
can be “read” from either diagram. The difference betweenn — d—1andn—d-—2 
is unimportant when n is much larger than d, but it is important when, e.g., the 
larger number is 3 or 4. 


4. Polytopal realizability of complexes 


This section is divided into subsections as follows: Realization of 3-polytopes; The 
higher-dimensional Steinitz problem; Realization over incomplete ordered fields. 


4.1. Realization of 3-polytopes 


Steinitz’s combinatorial characterization of 3-polytopes (Steinitz and Rademacher 
1934) can be formulated in a number of ways. The following combines formulations 
of Griinbaum and Motzkin (1963) and Klee (1966). The important point is that 
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condition (i), which arises in a geometric context, is equivalent to the purely graph- 
theoretic conditions (ii), (iii) and (iv). The latter three conditions and related 
properties of planar graphs are discussed in chapter 5 in this Handbook. (The 
reader should note that in the present chapter, graphs are always “simple” in the 
sense of not having any loops or multiple edges. However, we avoid the designation 
simple graph because of possible confusion with simple polytopes.) 


Theorem 4,1. For a graph G with v nodes and e edges, the following four conditions 
are equivalent: 
(i) G is 3-polytopal; 
(ii) G is planar and 3-connected; 
(iti) if H is the set of all nonseparating circuits in G and f is the cardinality of 
H, then 
(a) v—e+f =2, 
(b) each edge of G belongs to exactly two members of X, and 
(c) whenever two members of H have more than a single node in common, 
their intersection consists of a single edge and its endpoints; 
(iv) G contains a set X of circuits that satisfies (a), (b) and (c). 
Further, if G is 3-polytopal and & is as in (iv), then X is the set of all nonseparating 
circuits in G, and in each realization of G by means of a 3-polytope P, the members 
of X correspond to the boundaries of the facets of P. 


For the interesting history and the proof of Theorem 4.1, see Gr67, The theorem, 
particularly the equivalence of (i) and (ii), has had a great impact on the study of 
3-polytopes. Some of its uses are discussed in sections 5 and 7. It is very useful from 
an algorithmic as well as a mathematical viewpoint, because both the planarity and 
the 3-connectedness of a graph with n nodes can be tested in time O(n) (Hopcroft 
and Tarjan 1973, 1974). 

The following formulation is shown by Danaraj and Klee (1978b) to yield an- 
other linear-time algorithm for recognizing 3-polytopes. 


Theorem 4.2. A 2-complex is polytopal if and only if it is a shellable closed pseu- 
domanifold. 


Another formulation, essentially equivalent to Theorem 4.2, is the following 
from Gr67. 


Theorem 4.3. A 2-complex is polytopal if and only if it is a sphere. 


For yet another equivalent formulation, we define a d-diagram as a complex 
@ in R@ such that the union D = [J@ is a polytope, each face of D other than 
D itself is a member of 9, and CNOD is a member of @ for each C € 9. Of 
course, each Schlegel diagram of a d-polytope is a (d — 1)-diagram. The strict 
converse of this is false, because there are 2-diagrams that cannot be realized as 
Schlegel diagrams of 3-polytopes. (Chapter 36 characterizes plane graphs that are 
projections of polytopes.) However, as shown by Gr67, the converse is correct for 
d = 3 in the following combinatorial sense. 
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Theorem 4.4. Every 2-diagram is isomorphic to a Schlegel diagram of a 3-polytope. 


The following theorem of Gr67 extends Steinitz’s theorem to 3-polytopes with 
symmetries. 


Theorem 4.5. if G is a planar 3-connected graph, then: 

(i) G is isomorphic to the graph of a centrally symmetric 3-polytope if and only 
if there exists an involution f of G such that each node v of G is separated from 
f(v) by some circuit; 

(ii) G is isomorphic to the graph of a 3-polytope with a plane of symmetry if 
and only if G admits an involution that reverses the orientations of its 2-faces. 


When a d-polytope P and a combinatorial automorphism f of the boundary 
complex &(P) are given, f is said to be realizable by an isometry if there exist 
in R? a polytope P’ combinatorially equivalent to P and an isometry of R4 such 
that g(P’) = P’ and g induces the combinatorial automorphism f in @(P’). The 
following is due to Mani (1971). 


Theorem 4.6. For each 3-polytope there is a combinatorially equivalent 3-polytope 
P CR® such that every automorphism of P’s graph is induced by an isometry of - 
R?, 


The proofs of Theorems 4.1, 4.5 and 4.6 use several types of “reductions” of 
3-connected planar graphs described by Gr67. These are inductive methods of 
constructing such graphs from ones that are in some sense simpler. The same is 
true of the proofs of the next four results. 

For a facet F of a polytope P, we say that the shape of F can be preassigned 
if each polytope F’ that is combinatorially equivalent to F is the image of F 
under some combinatorial isomorphism of P. The following is due to Barnette 
and Griinbaum (1970). 


Theorem 4.7. The shape of any facet of a 3-polytope can be preassigned. 


Another shape-assignment problem arises as follows. For a d-polytope Q in R? 
and a point x of R¢\Q, let C(x, Q) denote the union of all rays that issue from x 
and intersect the boundary but not the interior of @. When each such ray intersects 
the boundary in a single point, the profile of Q from x is defined as the subcomplex 
of &(Q) consisting of all faces F of Q that are contained in C(x, Q). This profile 
is always a (d — 2)-sphere. A d-polytope P is said to have the universal shadow- 
boundary property if for each (d — 2)-sphere ¥ in B(P) there exists a d-polytope 
Q in R‘, a point x of R7\Q, and an isomorphism of @(P) onto B(Q) that carries 
f onto the profile of Q from x. The following is due to Barnette (1970). 


Theorem 4.8. Each 3-polytope has the universal shadow-boundary property. 


The next result appears in Gr67. 
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Theorem 4.9. For each 3-polytope P, the boundary complex B(P) is isomorphic 
to the boundary complex of a polytope in the rational 3-space Q°. 


We end this subsection with a result that combines the discrete and the contin- 
uous aspects of polytopes. When V is the vertex-set of a d-polytope P in R?4, a 
realization of P in R¢ is an injection f : V — R4 such that f(V) is the vertex-set 
of a polytope Q and f generates an isomorphism of B(P) onto 2(Q). Two real- 
izations f and g of P in R¢ are combinatorially isotopic if there is a continuous 
one-parameter family of realizations that starts with f and ends with g. It is eas- 
ily seen that each realization of P can be extended to a homeomorphism of R4 
onto itself, and combinatorially isotopic realizations can be extended to isotopic 
homeomorphisms of R*. Hence each realization may be classified as orientation- 
preserving or orientation-reversing. 

The isotopy theorem for 3-polytopes, due to Steinitz and Rademacher (1934), 
may be stated as follows. 


Theorem 4.10. if P is a 3-polytope in R°, then all orientation-preserving realizations 
of P in R? are combinatorially isotopic. 


4.2. The higher-dimensional Steinitz problem 


The above theorems give a fairly complete combinatorial picture of the face-lattices 
of 3-polytopes, and of several sorts of geometric realizability of these lattices. It is 
striking that ail of them fail for higher-dimensional polytopes. The following result 
of Griinbaum and Sreedharan (1967) was the starting point for research on the 
higher-dimensional Steinitz problem. 


Theorem 4.11. There exists a simplicial 3-sphere with 8 vertices that is not isomor- 
phic to the boundary complex of any 4-polytope. 


This result holds also for each d > 4, with 3, 8 and 4 replaced respectively by d — 
1, d+ 4, and d. In fact, it turns out that when d and n — d are both large, the number 
of (combinatorial types of) d-polytopes with n vertices is greatly exceeded by the 
number of such types of nonpolytopal simplicial (d — 1)-spheres. (See section 5 for 
details.) 

Theorem 4.11 was proved by showing that a certain simplicial 3-diagram, first 
presented by Briickner (1909) and assumed by him to arise as the Schlegel diagram 
of a 4-polytope, does not in fact arise in this way. When a d-diagram @ is the 
Schlegel diagram of a polytope P, 2 has properties known as invertibility (arising 
from the fact that each facet of P may serve as the basis of a Schlegel diagram) 
and dualizability (arising from the existence of P’s polar). Briickner’s 3-diagram 
has neither of these properties, but it turns out also that neither is sufficient to 
guarantee that a simplicial 3-diagram is isomorphic to the Schlegel! diagram of a 
4-polytope (Schulz 1979, 1985, Barnette 1980). 

Although Theorems 4.1-4.9 do not extend to general d-polytopes for d > 4, 
they do extend to those whose number of vertices is only slightly larger than d. 
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That is the import of Theorems 4.12-4.16 below, which should be compared with 
Theorems 4.2, 4.5(i), and 4.6-4.8. 


Theorem 4.12. Each piecewise linear (d —1)-sphere with at most d +3 vertices is 
polytopal. 


Theorem 4.13. If a (d — 1)-sphere with at most 2d vertices admits a combinatorial 
involution that has no fixed point then the sphere is isomorphic to the boundary 
complex of a centrally symmetric d-polytope. 


Theorem 4.14, If a d-polytope has at most d +3 vertices, then the shape of each of 
its facets can be preassigned. 


Theorem 4.15. If a d-polytope has at most d +2 vertices, then it has the universal 
shadow-boundary property. 


In each of Theorems 4.12—4.15, it is known that the bound on the number of ver- 
tices cannot be relaxed. It appears that Theorem 4.12 holds without the assumption 
of piecewise linearity, but that assumption is used in the proofs of Mani (1972) for 
the simplicia) case and Kleinschmidt (1976b) for the general case. Theorems 4.13, 
4.14, and 4.15 are due respectively to Kleinschmidt (1977), Kleinschmidt (1976a), 
and Shephard (1972). 

The following is stated in Gr67. 


Theorem 4.16. /f a d-polytope has at most d +3 vertices there is a combinatorially 
equivalent d-polytope P C R¢4 such that every automorphism of B(P) is induced by 
an isometry of R4. 


The strong isotopy theorem 4.10 does not extend beyond dimension 3. In fact, 
the following is only a small sample of the striking results of Mnév (1988) and 
Vershik (1988) on the topological structure of spaces of realizations of polytopes. 


Theorem 4.17. For each d > 4 there exist a d-polytope P C R¢ and an orientation- 
preserving realization f of P in R¢ that is not combinatorially isotopic to the identity 
mapping on P's vertex-set. 


As far as the number of vertices is concerned, a smallest example for Theorem 
4.17 is provided by a certain simplicial 4-polytope P with 10 vertices first con- 
structed and studied by Bokowski et al. (1984). In this case, the f for Theorem 
4.17 is provided by a bijection of P’s vertex-set. This same P is such that although 
@(P) admits a fixed-point free involution and also admits several other combina- 
torial automorphisms, for each polytope Q combinatorially equivalent to P it is 
only the identity automorphism of %(Q) that comes from an affine mapping of Q 
onto itself. (Compare this with 4.6 and 4.16.) 

Although 4.10 does not extend to higher dimensions, it has a consequence that 
may extend. Suppose that P and Q are combinatorially equivalent d-polytopes in 
R?, and Q’ is Q’s reflection in a hyperplane. Must there exist an isotopy A of R? 
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(a continuous one-parameter family h,, 0 < ¢ < 1, of self-homeomorphisms of R?) 
such that Ho is the identity mapping, h,(P) is Q or Q’, and for each t, h,(P) is a 
polytope combinatorially equivalent to P? If so, can it be required in addition that 
when t = 1, h, carries P’s vertex-set onto that of h,(P)? (When d = 3, the latter 
condition can be required for all t. However, that is not true for d = 4, because 
there is a 4-polytope P C R* such that the automorphism group of &(P) is trivial 
and yet the isotopy theorem does not extend to P.) 

Recalling 4.2, we turn now to the uses and limitations of shellability as a tool 
for recognizing d-polytopes when d > 4. The following is due to Bruggesser and 
Mani (1971). 


Theorem 4.18. Each polytope is shellable. That is, the facets of any polytope can 
be arranged in an order that constitutes a shelling. 


In fact, not only is the boundary complex of a polytope always shellable, but 
there exist shellings satisfying various restrictions on the order in which the facets 
appear. That was shown by Bruggesser and Mani (1971) and Danaraj and Klee 
(1974), and partly extended to oriented matroids by Mandel (1981) and Lawrence 
(1984). (It seems that although oriented matroid spheres are shellable, this is not 
known for their duals.) See chapter 9 for a detailed discussion of oriented matroids, 
and see also chapter 34 for the axiomatization of oriented matroids in terms of 
signed bases. 

When is a pure simplicial d-complex, shellings of €@ are easy to recognize, for 
a permutation F,,..., F, of @’s facets is a shelling if and only if it is true when- 
ever 1 <i <j <n that there exists h < j for which F, 9 F; is a (d — 1)-simplex and 
F, 0 F; C F, 1F;. This leads to a routine (though slow) algorithm for testing shella- 
bility and various strengthened versions of it. However, that does not provide a 
procedure for recognizing d-polytopes when d > 4, because all simplicial 3-spheres 
(polytopal or not) with at most 9 vertices turn out to be shellable (Danaraj and 
Klee 1978a). 

Other necessary conditions for the polytopality of a complex € can be expressed 
in terms of €’s f-vector and @’s edge-graph (see sections 6 and 7), but these 
conditions are not sufficient. Indeed, it seems probable that there is no “nice” 
easily tested set of necessary and sufficient conditions for polytopality. A strong 
indication of this is the following theorem of Sturmfels (1988), which says that 
certain nonpolytopal spheres cannot be distinguished from polytopal spheres by 
any test of a “local” nature. 


Theorem 4.19. There exists a sequence {kn}nen of integers and a sequence {Sn}nen 
of triangulated spheres such that the following two conditions are satisfied for each 
n: 

(i) ¥, is a nonpolytopal (k, — 1)-sphere with k, +4 vertices; 

(ii) each proper subcomplex J of F extends to a polytopal sphere with no 
additional vertices. 


Since there seems to be no hope of finding an easily understandable characteri- 
zation of d-polytopality, it is natural to turn to the algorithmic approach. As noted 
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in Gr67, the following is a consequence of the Tarski (1951) decision method for 
real-closed fields. 


Theorem 4.20. There exists an algorithm for deciding whether a given complex is 
polytopal. 


This is in contrast with the observation of S.P. Novikov (in Volodin et al. 1974) 
that there is no algorithm for deciding whether a given 5-manifold is a sphere. 
However, Tarski’s method, though improved by Collins (1975) and Renegar (1992), 
seems to be still far from being applicable to test polytopality in significant cases. 
It seems likely that the problem of polytope recognition is NP-hard, and it does 
not seem to be known even whether the problem belongs to the class NP. (This 
class is discusssed in chapter 29.) 

For the practical recognition of polytopal complexes, by far the most success- 
ful method has been the one initiated by Bokowski and developed further by 
Bokowski and Sturmfels (1987). It has been used to enumerate combinatorial 
types of 4-polytopes (see section 5), to prove part of Theorem 4.19, and in a num- 
ber of other special problems. For example, it was used to establish polytopality 
of the Bokowski-Ewald—Kleinschmidt sphere mentioned after 4.17. 

The Bokowski-Sturmfels approach is based on the theory of oriented matroids. 
They describe oriented matroids in terms of signed bases and call the resulting 
structure a chirotope. (By results of Lawrence 1982 and Dress 1986, this definition 
is equivalent to the definitions of oriented matroids formulated by Bland and Las 
Vergnas 1978 and Folkman and Lawrence 1978. However, the approach in terms 
of signed bases is better suited to computations that arise from the combinatorial 
study of polytopes.) 

Before giving the formal definition of a chirotope, we note that if the vertices 
Y1,-++5¥n Of a k-polytope P in R* are embedded in R* x R as the points yi = (y;, 1), 
then many properties of P are reflected in algebraic relations among the y; (cf. 3.1). 
In particular, various combinatorial restrictions on @(P) lead to conditions that 
must be satisfied by the (A + 1) x (k + 1) subdeterminants of the matrix whose rows 
are the y;. Chirotopes provide an abstract framework for studying such conditions 
in terms of signs of determinants. 

For d <n, let 


A(n,d) = {(A1,...Ag) ENF 21 SAY < ++ < Ag <n}, 


the set of all increasing d-tuples of integers between 1 and n. A mapping y: 
A(n, a) > {—1,0,1} is a d-chirotope with n vertices if for each x € A(n, d+ 2) there 
are vectors X,,,.-.,X,,,. in R? such that 


X(A) = sign(det(x,,,...,xa,)) (4.1) 
for all A € A(n,d) with A C pu. (Here y is extended to {1,...,”}¢ in the canonical 
alternating way.) The chirotope x is realizable if there exist x1,...,%n € RR? such 


that 4.1 holds for all A € A(n, d), and in this case x is called the chirotope of linear 
dependencies on (x1,...,Xn). 
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A vector y € {—1,0,+1}" is a cocircuit of the chirotope xy : A(n,d) — {—1,0, +1} 
(or of the associated oriented matroid) if there exists » € A(,d — 1) such that for 
each i, the ith component of y is equal to y(14,-.., #a-1,4). A facet of y is a set 
F Cc {1,...,} that corresponds to a nonnegative cocircuit in the sense that for 
some » € A(n,d —1) it is true that 


i) = 0 forie F, 
XUEIs +s Bad-1t) = 9 9 for? € {1,...,n}\F. 


A set GC {1,...,n} is called a face of x if G is the intersection of some facets of 
x. The set of all such faces, ordered by inclusion, is denoted by FL(y). 

For any convex k-polytope P in R*, with vertex set y,,-.-,yn, P’s face-lattice 
is isomorphic to FL(y) for the chirotope x of linear dependencies on the set 
{(1,1),---, a, 1)} C R* x R. That is almost obvious, for (interpreting y as the 
determinant of k +1 points in R**') the definition of a facet of a chirotope gen- 
eralizes the fact that the vertex set of a polytope’s facet is the intersection of a 
supporting hyperplane with the polytope’s vertex set. 

The Bokowski-Sturmfels procedure accepts as input a (d —2)-sphere ¥ with n 
vertices, and its aim is to decide whether ¥ is polytopal. Success is not guaranteed, 
for the procedure may fail to terminate, but if it does terminate then it does so 
with a decision as to whether the sphere is polytopal. It is not feasible to include 
more than a very rough description of the method, but we do include that because 
of the special successes of the method. 

The first step consists of computing the finite set €,,,(P) of all d-chirotopes x 
with n vertices such that FL(y) is isomorphic with ¥. That is accomplished with 
the aid of the Pliicker-Grassmann relations, algebraic relations among the subde- 
terminants of a matrix (Hodge and Pedoe 1968). The close connection between 
these and chirotopes is discussed by Bokowski and Sturmfels (1985). The second 
step is to test realizability: if there is a realizable y € €,,,(¥), then # is poly- 
topal, and otherwise ¥ is nonpolytopal. This step is generally more difficult and 
time-consuming than the first one. However, as shown by Bokowski and Sturm- 
fels (1986), the second step can also be based in part on the Pliicker-Grassmann 
relations. For example, for each individual chirotope of the list produced in the 
first step, it may be possible to find a Pliicker-Grassmann relation that is violated 
by the sign-patterns of the determinants. When this occurs, the sphere has been 
shown to be nonpolytopal. The computational difficulty consists of producing such 
a contradiction systematically. 


4.3. Realization over ordered fields other than R 


Among ordered fields, only the real field R is complete and only the subfields of 
R are Archimedean. We turn now to the realization of polytopes in vector spaces 
over ordered fields other than R. Of course, the subfields of R are of special 
interest. It is almost obvious (and also true!) that each simplicial d-polytope P in 
R? is combinatorially equivalent to one in the rational d-space Q¢, for the vertices 
of P can be moved, one by one, to nearby points of Q? without disturbing P’s 
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combinatorial structure. Similarly, each simple polytope in R? is combinatorially 
equivalent to one in Q*. By Theorem 4.9, it is true when d < 3 that every polytope 
in R¢ is combinatorially equivalent to one in @?. It is unknown whether this result 
is valid when d = 4. However, the following theorem shows that it fails for d > S. 
Part (i) of Theorem 4.21 is shown by Richter-Gebert (1994) and part (ii) is shown 
by Perles in Gr67 (using the Gale diagrams described in section 3). 


Theorem 4.21. (i) For each d,5 < d <7, R@ contains a d-polytope with d +91 ver- 
tices that is not realizable in Q*. 

(ii) For each d > 8, R* contains a d-polytope with d +4 vertices that is not real- 
izable in Q4. 


With A denoting the field of all real algebraic numbers, the following two results 
are due to Lindstrém (1971) and Mnév (1983) respectively. 


Theorem 4.22. For an arbitrary ordered (even non-Archimedean) field ®, each 
polytope in ©4 can be combinatorially realized in A4. 


Theorem 4.23. A is the smallest subfield of R such that every polytope in a real 
vector space is realizable in some vector space over A. : 


To prepare for the final result of this section, recall the fact (the negative solution 
of Hilbert’s tenth problem by Matiyasevié in 1971) that there is no algorithm for 
deciding the integer solvability of Diophantine equations. As discussed by Mazur 
(1986) and Klee and Wagon (1991), it is still unknown whether this is true of the 
rational solvability of such equations. Sturmfels (1987c) proves the following, along 
with some related results that involve oriented matroids. 


Theorem 4.24. The following two statements are equivalent: 
(i) there exists an algorithm that determines, for an arbitrary finite lattice L, 
whether L is isomorphic to the face-lattice of a polytope in a rational vector space; 
(ii) there exists an algorithm that determines, for an arbitrary polynomial f with 
integer coefficients, whether f has zeros in the field of rational numbers. 


§. Enumeration of combinatorial classes 


We use the term combinatorial class to mean an equivalence class of polytopes 
with respect to combinatorial equivalence or a more elaborate relationship that 
involves additional information. The problem is to count or estimate the number 
of such classes corresponding to certain specified parameters (e.g., dimension and 
number of vertices or edges). For the history of the subject, see Gr67. 

This section is divided into three parts, as follows: Enumeration of 3-polytopes; 
Classes with few vertices; Higher-dimensional asymptotic results. Enumeration in 
the 3-dimensional case was first considered by Euler. With the aid of Steinitz’s 
characterization of 3-polytopes, sharp asymptotic results have been obtained and 
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several exact enumerations have been made. The second part deals with exact 
enumeration for d > 4, an area in which only a few results are known. The third 
part describes some higher-dimensional asymptotic estimates that focus on what 
happens as the dimension or number of vertices tends to infinity. 


5.1. Enumeration of 3-polytopes 


For 3-polytopes, Duivestijn and Federico (1981) provide a comprehensive survey 
of exact enumerations. We extract some of the most important results stated in 
their paper, but omit most further references as they can be found there. 

All enumerations of classes of 3-polytopes count 3-connected planar graphs, the 
general method being to construct all members of the class (often by a graph 
operation that introduces a new edge splitting a 2-face), and then, using suitable 
numerical characteristics of the graphs obtained, avoid the multiple counting of 
isomorphic copies. Exact enumeration is most complete for C\(n), the number 
of isomorphism-types of 3-connected planar graphs with n edges, and for R;(n), 
the number of such rooted graphs. To root a graph that is embedded in the plane 
means to specify an edge as the root, to give the edge a direction, and to distinguish 
between the two sides of the edge. 

If a planar graph G is 3-connected and has n edges, then (since G admits an 
essentially unique embedding in the plane) the number of rooted versions of G is 
4n. Thus the number of distinct rooted graphs corresponding to G is 4n/h, where 
h is the order of G’s automorphism group. If none of the graphs with n edges 
were symmetric, C;() would be equal to the quantity A,(n) := R,(n)/4n. As it is, 
A;(n) is a lower bound for C;(#) and is regarded as a good estimate of C,(m) for 
large n since the percentage of symmetric graphs with m edges is observed to fall 
drastically as increases. 

Table 1 gives some exact values for C\(n), Ri(m), and A,(7). It is taken from 
Duivestijn and Federico (1981), and the actual graphs are available on tape from 
the first author of that article. The same article contains the values given in table 
2 for Co(n), the number of combinatorial types of 3-polytopes with n vertices. In 
table 2, So() is the number of types of simplicial 3-polytopes with vertices; these 
values are taken from Griinbaum (1970). The rooted analogue of So(n) is given 
precisely for all n by the following theorem of Tutte (1962), from which the values 
for R,(n) can be deduced with the aid of Euler’s theorem. 


Theorem 5.1. The number of rooted simplicial 3-polytopes with v vertices is pre- 
cisely 


4av-11)! 3 a: 
Gu — iw — 2)! ~ 166m) ( aT}? 


where = indicates that the ratio converges to 1 as v — 00. 


For unrooted 3-polytopes, sharp asymptotic formulas for the number of combi- 
natorial types are established by Bender (1987). 
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Table 1 

n Cin) A,(n) Ri(n) 

6 1 1 

8 1 4 

9 2 6 
10 2 24 
11 4 66 
12 12 12 214 
13 22 “13 676 
14 58 40 2209 
15 158 122 7296 
16 448 333 24460 
17 1342 1220 82926 
18 4199 3946 284 068 
19 13384 12920 981 882 


20 43708 42767 3421318 
21 144810 142948 12007554 
22 485704 482 002 42416488 


23 1638248 150718770 

24 5.608 558 538 421 590 

25 19328566 1932856590 

26 67017765 6969847 484 

Table 2 

n 45 6 7 8 9 10 11 12 
Colm) 1 £2 #7 34 257 2606 32300 2437557 36363115 
Som) 1°91 2 = =«5 14 50 233 1249 7595 


Theorem 5.2. Let C(i, j) denote the number of combinatorial types of 3-polytopes 
that have i + 1 vertices and j +1 facets, and for 0 < i < 2 let C;(n) denote the number 
of types with n i-faces. Let 


BU) = saaeep (x) ( | 


Ci, f) * BGs), 


Then 


an(4+ /7) 


1/2 
A B(n-1,4(—-1)3+ V4), 


C2(n) = Co(n) = ( 


vm nn 
Ci(n) = “q Bla, 5): 


where fractional factorials in binomial coefficients are to be approximated by Stir- 
ling’s formula. 
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5.2. Classes with few vertices 


The d-simplices are the only d-polytopes with d+1 vertices. Beyond that, the 
combinatorial types of d-polytopes have been enumerated for arbitrary d only 
when the number of vertices is d + 2 or d+ 3. The results are based on Theorem 3.1 
in conjunction with the low dimensionality of the relevant Gale diagrams. 


Theorem 5.3. Among d-polytopes, the number of combinatorial types with d +2 
vertices is |d?/4]. 


There is also a formula for the number with d +3 vertices, but it is too compli- 
cated to state here, so we refer to Lloyd (1970). 


Theorem 5.4. Among simplicial d-polytopes, the number of combinatorial types 
with d +2 vertices is equal to |d/2| and the number with d +3 vertices is equal to 


d+4 1 
[a/2]_ | 2" * ee (d+3)/h 
Ss 2 | +e deez, 
odd al(d+3) 


where @ is Euler’s function. 


The Gale diagram of a d-polytope with d +2 vertices is 1-dimensional. In the 
simplicial case, each vertex is mapped to —1 or 1 in the Gale diagram, and each of 
—1 and 1 receives at least two vertices. If mm (m_1) denotes the number of vertices 
received by 1(—1), then two polytopes are isomorphic if and only if they yield the 
same unordered pair {m_;,,}. That takes care of the first part of 5.4, and 5.3 is 
similar except that some vertices may be mapped to 0. 

The second part of 5.4 is due to Perles (in Gr67). The results for d +3 vertices 
arise from complicated counting procedures for 2-dimensional Gale diagrams, us- 
ing Polya’s method. 

It has been possible to go a bit farther in the 4-dimensional case. The following 
statement combines results of Griinbaum and Sreedharan (1967), Barnette (1973a), 
Altshuler et al. (1980) and Altshuler and Steinberg (1985). 


Theorem 5.5. Among 3-spheres, there are 1336 combinatorial types with 8 vertices, 
1294 of which are polytopal. Among simplicial 3-spheres, there are 39 types with 8 
vertices, 37 of which are polytopal, and 1296 types with 9 vertices, 1142 of which 
are polytopal. 


McMullen and Shephard (1970) and Ewald et al. (1976) count types of polytopes 
with few vertices and symmetry group G of given order g. A typical result states 
that for k > 2 and d = (g — 1)k +1 there are exactly |k?/4| types of d-polytopes 
for which the number of vertices is gk and the set of points fixed by G is 1- 
dimensional. 

A cyclic d-polytope with n vertices is defined as the convex hull of 7 points of the 
moment curve M, in R¢, where Mg = {(t,t7,...,4): ¢ € R}. These d-polytopes are 
simplicial, and have the remarkable property of being neighborly in the sense that 
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each set of |d/2] vertices is the vertex set of a face. (For even d, neighborliness 
implies simpliciality.) For a given dimension and number of vertices, all cyclic 
polytopes are combinatorially equivalent (Gale 1963), but other types of neighborly 
polytopes exist and there has been interest in counting them because of the central 
role played by neighborly polytopes in connection with the Upper-bound theorem 
(see 6.10-6.11). The formula in the following result was established by McMullen 
(1974), using Gale diagrams. 


Theorem 5.6. For even d, all neighborly d-polytopes with d +2 vertices are equiv- 
alent, as are all with d+3 vertices. For odd d = 2k +1, there are 2 combinatorial 
types with d +2 vertices, and the number of types with d +3 vertices is equal to 


1{G + (2k tee! 4 6) est ys sworn), 


odd nlx+2 


where @ is Euler’s function. 


Altshuler (1977) enumerated the neighborly 3-spheres with 10 vertices, and by 
checking these for polytopality, Bokowski and Sturmfels (1985) found that there 
are 431 types of neighborly 4-polytopes with 10 vertices. Bokowski and Shemer 
(1987) found that there are 37 types of neighborly 6-polytopes with 10 vertices. 


5.3. Asymptotic results for arbitrary dimension 


Theorem 5.7. For each dimension d there are positive constants a, b and c (each 
depending on d) such that for ali n > a: 
(i) the number of combinatorial types of simplicial neighborly d-polytopes with 

n vertices ts at least n*”; 

(ii) the number c(n,d) of combinatorial types of d-polytopes with n vertices is 
at most nt4+1)n. 

(iii) the number s(n,d) of combinatorial types of triangulated (d — 1)-spheres 
with n vertices is between ce?" and no, 


The lower bound (i) is due to Shemer (1982), who shows that for each d the 
constant a may be taken arbitrarily close to } for sufficiently large n. The upper 
bound in (ii) actually applies to vertex-labeled polytopes. It is due to Goodman 
and Pollack (1986), whose original proof for the simplical case used a bound on the 
sum of the Betti numbers of a real algebraic variety. They (and Alon 1986) were 
later able to handle arbitrary polytopes, and to improve the bound. In particular, 
as is shown in chapter 32, it can be reduced to 


ny Fn(i+0( aha 
(") (1+0( adam) 


for ” os) 
oriimae. oF 
d 


The upper bound in (iii) follows from the Upper-bound theorem for triangulated 
spheres (see 6.10), and the lower bound is proved by Kalai (1988a) by extending 
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the construction of Billera and Lee (1981) of polytopes satisfying the McMullen 
conditions on f-vectors. (These tools are discussed in section 6.) The following 
consequences of (ii) and (iii) are noted by Kalai. 


Theorem 5.8. For each fixed d > 4 it is true that 
c(n, d)/s(n,d) +0 asn— oo. 
For each fixed b > 4, it is true that 
c(d+b,d)/s(d+b,d) -~0 asd—oo. 


6. Face-vectors 


For a polytope or complex X, the f-vector (or face-vector) f(X) is the sequence 
(fo(X), fi(X), ...), where f;(X) is the number of i-dimensional faces of X. When 
X is a d-polytope (d-complex), the sequence may be terminated at fy_)(X) (fa(X)) 
without loss of information. Then the f-vector of a polytope is the f-vector of its 
boundary complex. 

Since the time of Euler, the study of f-vectors has been a central topic in the 
combinatorial theory of polytopes. The notion itself is quite elementary, but the 
sharpest results concerning f-vectors have required deep applications of modern 
tools from commutative algebra and algebraic geometry. Griinbaum (1967, 1970) 
discusses the history of the subject up to 1970; Stanley (1985) and Bjémer (1986) 
provide more recent surveys. 

This section is divided into the following subsections: General complexes and 
multicomplexes; Euler’s relation and the Dehn-Sommerville equations; Face- 
vectors of simplicial polytopes; Face-vectors of general polytopes; Upper-bound 
and lower-bound theorems for polytopes; Upper-bound and lower-bound theo- 
rems for polytope-pairs. 


6.1. General complexes and multicomplexes 


The f-vectors of general complexes have been characterized, and the notions used 
there are used also in characterizing f-vectors of polytopes. For each pair of pos- 
itive integers k and n, there is a unique representation of 7 in the form 


rae) e+ (3), 


with a, > ay,_, > +++ >a; 2i> 0. This is called the k-canonical representation of 
n. In terms of it, we define 


(k)_ f  % a1 », G; 

‘ Car esey +(,,); 
w) _ (%+4 ay) +1 a;+1 
ale Cy one) 
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and set 
0 — 0 =0. 


The following is proved by Kruskal (1963) and Katona (1968) for simplicial 
complexes, and extended by Wegner (1984) to general complexes. See chapter 24 
for details. 


Theorem 6.1. For an ultimately vanishing sequence f = (fo, f,.-.-) of nonnegative 
integers, the following three conditions are equivalent: 
(i) f is the f-vector of a simplicial complex; 
(ii) f is the f-vector of a complex; 
(iii) fO") < fyi for each k > 1. 


A multicomplex is defined as a nonempty set of monomials in a finite number 
of variables such that each divisor of a member of is itself a member of . 
The f-vector of M is the sequence (f(M), fi(M),...), where f,() is the number 
of monomials of degree i in M. (Of course, fo(M) = 1.) When © is a simplicial 
complex with vertex set V, the corresponding multicomplex A¢ is formed by asso- 
ciating a variable x, with each uv € V, and defining dt as the set of all monomials 
of the form x,,%v,--- Xv, Where {vo, 0}, ...,0,} is the vertex set of a face of €. Then 
the f-vector of Mé¢ is just an index-shifted version of the f-vector of @. 

The following theorem is essentially due to Macaulay (1927), but the explicit 
formulation is that of Stanley (1978). 


Theorem 6.2. For an ultimately vanishing sequence f = (fo, fi,...) of nonnegative 
integers, the following three conditions are equivalent: 

(i) f is the f-vector of a muiticomplex; 

(i) there exists a field K and a finitely generated K-algebra R = Qj.) Ri such 
that Ro is isomorphic to K, R, generates R, and f lists the values of R’s Hilbert 
function (i.e. f; = dim R; for each i); 

(iii) fo = 1, and fxr < ft” for each k > 1. 


Sequences f = (fo, fi,.-.) satisfying the conditions of 6.2 are called O-sequences. 
They play a crucial role in characterizing the f-vectors of simplicial polytopes. 
6.2. Euler's relation and the Dehn—Sommerville equations 


Theorem 6.3. For the f-vector of a d-polytope, 
d-1 
Sof = 1+ (71. 
j=0 


The Euler relation (Theorem 6.3) was established by L. Schlafli in 1852, assum- 
ing the shellability of polytopes (see 4.18). The first complete proof of 6.3 was that 
of Poincaré (1893), using topological methods and applying to more general situ- 
ations. Theorem 6.3 is the only affine relation among the numbers fo, fi, ..-, fa-i 
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that holds for ali d-polytopes, and hence the affine hull of the f-vectors of all 
d-polytopes is of dimension d — 1 (Gr67). However, there are additional affine 
relations for simplicial polytopes. These are the Dehn-Sommerville equations for- 
mulated as follows in Gr67. (See Griinbaum 1967, 1970 for the history of these 
and the Euler relation.) (By convention, f_; = 1.) 


Theorem 6.4, For the f-vector of a simplicial d-polytope, it is true that 


d-1 a 
fi= Deo(? . ifs -l1<i<d-2. 
j=i 


i+] 


Here the Euler relation is obtained by setting i = —1. It turns out that if p is 
any affine relation for the sequence (fo,..., f4_1) that holds for the f-vectors of all 
simplicial d-polytopes, then p is an affine combination of the relations in Theorem 
6.4. Further, an appropriately chosen set of |(d+1)/2] of these relations forms 
a basis for all of them (Gr67), whence the affine space consisting of all p is of 
dimension |d/2]. 

Klee (1964a) establishes the Dehn-Sommerville equations for simplicial com- 
plexes much more general than the boundary complexes of simplicial polytopes. 
It is required only that the link of each face (including the empty face) has the 
same Euler characteristic as a sphere of the appropriate dimension. Complexes 
satisfying this condition are called Eulerian manifolds. See Stanley (1986) for a 
simple treatment in terms of Eulerian posets. 

We digress to mention regular polytopes. If P is a 3-polytope, and j and k are 
positive integers such that each vertex of P is incident to j edges and each 2-face 
of P is incident to k edges, then jfp = 2f; = kf). It follows from Euler’s theorem 
that 

2 s: 2 ae 2 

ik fi 
There are only five solutions of this equation in integers greater than or equal to 3. 
These solutions correspond to the five regular 3-polytopes (Platonic solids), whose 
numbers of facets are 4, 6, 8, 12 and 20, respectively. For each d > 3, the simplices 
(d +1 facets), cross-polytopes (24 facets) and cubes (2d facets) satisfy the most 
stringent regularity conditions, and for d 2 5 these are the only possibilities. How- 
ever, for d = 4 there are also regular polytopes whose numbers of facets are 24, 120 
and 600, respectively (see Coxeter 1963 for these). The various (equivalent) defi- 
nitions of regularity of polytopes all involve metric notions, but McMullen (1967) 
defined a notion of combinatorial regularity of polytopes and proved that a poly- 
tope is combinatorially regular if and only if it is combinatorially equivalent to a 
regular polytope. 

Gr67 discussed various homogeneity properties of polytopes that are related to 
but weaker then regularity. And several authors have defined purely combinatorial 
notions that are patterned after the theory of regular polytopes and provide suit- 
able frameworks for the study of combinatorially regular structures. References 
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to these appear in the paper of McMullen and Schulte (1990), which describes 
methods for constructing the regular incidence-polytopes introduced by Danzer 
and Schulte (1982). 


6.3. Face-vectors of simplicial polytopes 


If #2 denotes the collection of all simplicial d-polytopes, and f(F¢) the set of all f- 
vectors of such polytopes, then 6.4 determines the affine hull of f(#2). It is a much 
deeper task to provide an intrinsic description (without reference to polytopes) of 
the set f(%2) itself. Such a description was conjectured by McMullen (1971) and 
proved by Stanley (1980) and Billera and Lee (1980). The solution of this problem 
is a milestone in the combinatorial study of polytopes. 

McMullen’s conjecture is most conveniently stated in terms of h-vectors, where 
the A-vector (fo, ...,4g) of a simplicial (d — 1)-complex © is defined as follows: 


i ag 
hy = Y(-1) (Gj) few O<i<d. 
j=0 


Note that 4g = 1 and hk, = fy — d. The A; have various interesting interpretations, 
to which we return later. 
The f-vector can be recovered from the h-vector, for 


j+l é 
d-i F 
San O<j<d. 


i=0 j 


Alternatively, the relationship between f-vectors and h-vectors may be summarized 
by means of the following equality in terms of generating polynomials: 


Do f-ix! = Doha + VY. 
Here is McMullen’s characterization. 


Theorem 6.5. An integer vector (ho, ..., hg) is the h-vector of a simplicial d-polytope 
if and only if the following three conditions are satisfied: 
(i) h; = hq_i,9 <i |a/2]; 
(il) hin > hj,0 <i < |d/2); 
(iii) ho =1 and hig, — hj < (h; = hy)”, 0<i< |d/2|. 


Condition (i) is equivalent to the Dehn-Sommerville equations for the f;, and 


the equality 4g = hg is the Euler relation. Note also that condition (iii) is equivalent 
to the assertion that 


(ho, hi — ho, .--Ayay2) — Aya/2)-1, 9,9, .. ) 


is an O-sequence as defined above. This fact is the crucial part of Stanley’s proof of 
the “only if” part of Theorem 6.5. He shows that the quantities h; — hj_1(i < |d/2]) 
are the successive values of the Hilbert function of the rational cohomology ring 
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(modulo a suitable ideal) of a certain algebraic (toric) variety associated with a 
simplicial polytope. 

Using the polytope algebra, McMullen (1993) proved the “only if” part of 6.5 
in a way that remains in the framework of convexity. His proof reveals interesting 
connections between the combinatorics of polytopes and the Brunn—-Minkowski 
theory of convex bodies. 

Starting from a sequence that satisfies condition (iii) of 6.5, Billera and Lee show 
that certain facets of a suitable cyclic (d + 1)-polytope C form a shellable ball & 
that has the sequence as its h-vector. Then with an appropriate realization of C in 
R**!, there is found a point z of R%*! such that z is beyond the facets of C that 
form @ and beneath C’s other facets. The desired d-polytope is then obtained as 
the intersection of the convex hull of {z} UC with a hyperplane that separates z 
from C. 

There remain three important problems associated with 6.5: 

(1) Because McMullen’s and Stanley’s constructions rely so heavily on the poly- 
tope’s convexity, their proofs of “only if” do not extend directly to the case of 
the h-vectors of spheres. Nevertheless, it is believed that the characterization does 
extend. 

(2) Which d-polytopes realize equality in (ii) for a given i? McMullen and 
Walkup (1971) conjecture that they are precisely the ones that are i-stacked, mean- 
ing that they can be triangulated without introducing any new j-faces for j < d—i. 
However, the conjecture has been proved only for i = 0 andi = 1. (A stacked poly- 
tope is one that is 1-stacked. Equivalently, a d-polytope P is stacked if and only if 
it appears in a sequence Pp, P;,... of d-polytopes such that Po is a d-simplex and 
Pi, is formed from P; by adding a pyramidal cap over some facet of P;.) 

(3) Using Stanley’s proof of the “only if” part of 6.5, Billera and Lee (1981) prove 
a similar result for the h-vector of what they call a simplicial polyhedral (d — 1)- 
ball. They conjecture that their conditions are also sufficient for the existence of 
such a ball with the given h-vector, but this remains open. 


6.4. Face-vectors of general polytopes 


Theorem 6.6. A triple of integers (fo, fi, fr) is the f-vector of a 3-polytope if and 
only if it is true that 


fo-fit+th =2, 4< fo <2f.—4 and 4<hi <2fo—4. 


Theorem 6.6 is due to Steinitz (1906). For d > 4 there does not exist even a 
reasonable conjecture for a characterization of f-vectors of general d-polytopes. 
However, there have been two directions of progress, and we shall describe them. 

Let ¥(d) denote the collection of all subsets of the set {0,1,...,d—1}. The 
extended f-vector of a d-polytope P is the function f defined on (d) by setting 
f(0) := 1 and, when S = {i,,i,...,%} € F(a) with 4 <i, <---> <i, letting fs(P) 
denote the number of chains of faces of P of the form F, C F2 C --- C Fy with 
dim F; = i; for 1 < j < k. Thus P’s ordinary f-vector is just the restriction of the 
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extended f-vector to the singletons in #(d). And when P is simplicial, the ordinary 
f-vector completely determines the extended one. 

The following analogue of the Dehn-Sommerville equations is proved by Bayer 
and Billera (1985a) for Eulerian manifolds and hence for arbitrary polytopes. 


Theorem 6.7. Suppose that P is a d-polytope, -1 <i <k <d, {i,k} CSU{-1,d} 
and the set S © ¥(d) does not include any j between i and k. Then 


k-1 
> (19 fsuyy (P) = fs(P)[1 = (-1)*-*"1]. 


f=t+l 


When i = —1, k =d, and S = @, the equality of 6.7 is just the Euler relation. 
Bayer and Billera find a basis for the affine span of the extended f-vectors of 
all d-polytopes, and show that the dimension of this span is cy —1, where cq is 
the dth Fibonacci number (cp = c, = 1, cg = Cg_ + Cg_2). (Another basis is found 
by Kalai 1988b.) Carrying the investigation further in the case of 4-polytopes, 
Bayer (1987) finds inequalities for extended f-vectors that lead to new conclusions 
concerning ordinary f-vectors. (Projections of f-vectors of 4-polytopes onto two 
coordinates are completely determined in Gr67 and papers of D. Barnette and 
J. Reay mentioned by Bayer.) 

For an arbitrary d-polytope P, Stanley (1987a) introduces a new combinatorial 
invariant consisting of the polynomial gp(x) defined by the following recursion: 

(i) go(x) = 1; 
(ii) deg gp(x) < (dim P)/2; ee 

(iti) x4" gp(1/x) — gp (x) = Dorcocp) BF (x): (x — 1m? aim? 

It turns out that the coefficients of the polynomial gp(x} are linear functions of 
the numbers f5(P) defined earlier. And from the fact that gg = 1 when Q is a 
simplex, it is deduced that in the case of simplicial polytopes, (iii) corresponds to 
an equation for a generating function defining the quantities 4;,, — 4;. Thus the 
coefficients of the polynomial gp may be viewed as a generalization of the A-vector 
to nonsimplicial polytopes. 


Theorem 6.8. For each rational polytope P, all coefficients g; of the polynomial gp 
are nonnegative. 


This theorem, which may be regarded as a counterpart of the unimodality of the 
h; for simplicial polytopes, is conjectured to apply to all polytopes. However, that 
remains open because the theory of toric varieties does not apply to nonrational 
polytopes (which do exist — see section 4) and Stanley’s proof of 6.8 uses a compu- 
tation (by Bernstein, Khovanskii and MacPherson) of the intersection homology 
groups of toric varieties. 

It is known that g; = fo — d — 1 > 0, and g> is shown by Kalai (1987) to be the 
dimension of the space of stresses of a polytopal framework based on P. Hence 
the conjecture is correct at least for g; and go. Perhaps further results about the g; 
will yield deeper insight into the behavior of the f-vectors of general polytopes. 
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6.5. Upper-bound and lower-bound theorems for polytopes 


The vertices of a polytope P are of special interest with respect to the maximization 
of a convex function f on P,, because f’s maximum is attained at a vertex. There are 
good tests for recognizing local maxima, and when f is linear each local maximum 
is in fact a global maximum. (That is the basis of the simplex algorithm for linear 
programming.) However, if f is merely convex it may happen that each vertex 
provides a strict local maximum even though only one of them provides a global 
maximum. This leads to concern about how many vertices P may have when its 
dimension and number of facets are known, or, equivalently (by polarity), how 
many facets P may have when its dimension and number of vertices are known. 
The proof of the following is due to Gale (1963). 


Theorem 6.9. For a cyclic d-polytope with n vertices, the total number of facets is 


ia (" - Las ”) 7 (" - Utes a), 


The following was conjectured by T.S. Motzkin in 1957, 


Theorem 6.10 (Upper-bound theorem). Whenever 0 <i < d, the cyclic d-polytopes 
with n vertices have the maximum number of i-faces among all d-polytopes with n 
vertices. 


It is easy to see, by “pulling” the vertices slightly, that the maxima in 6.10 are 
attained by simplicial d-polytopes. For these, the conclusion of 6.10 follows from 
6.5’s description of the f-vectors of simplicial polytopes. (In fact, as noted by 
Bayer and Billera 1985b, the cyclic polytopes maximize all of the numbers fs(P) 
mentioned above.) However, 6.10 is known in much greater generality than is 
6.5. Working directly from the Dehn-Sommerville equations of 6.4, Klee (1964b) 
shows that for each d, 6.10 applies to all simplicial Eulerian manifolds in which the 
number of vertices is sufficiently large relative to the dimension (at least d?/4 for 
the case k = d — 1). Using the shellability of polytopes, Theorem 4.18, established 
by Bruggesser and Mani (1971), McMullen (1970) proves 6.10 for all polytopes. 
And with the aid of appropriate machinery from commutative algebra, Stanley 
(1975) extends 6.10 to all simplicial spheres. 

In addition to shellability, McMullen’s proof uses a nice interpretation of the 
quantities h; defined just before 6.5. Let F,,..., 7, be a shelling order of the facets 
of a simplicial d-polytope P, so that for 2 << k <r the intersection F, (Ui Fi) 
is the union of certain facets of F, . For 1 <k <r, let G, denote the intersection 
of all faces of F, that are not faces of any F; with i < k. Then A; is just the number 
of G, that have precisely i vertices. Using this interpretation of the h; , it is seen 
that the Upper-bound theorem is equivalent to the system of inequalities 


Mae), O<i<|d/2}, 


which are direct consequences of 6.5 (iii). 
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Stanley’s argument proceeds as follows for a simplicial complex @ with vertices 
U},---,Un. Let K be an infinite field and let A = K[x,,...,2,], the polynomial ring 
in independent variables x;,...,x, over K. Let I be the ideal of A generated by 
the monomials x;, .. .x;, for which {Uj,,--+5 u;,} 1S not the vertex set of any face of 
. Stanley shows that if the ring Ag := A/I (the Stanley—Reisner ring of @) has the 
Cohen-Macaulay property then there is an ideal J in Ag such that @’s h-vector 
is the Hilbert function of the graded K-algebra A¢/J. It then follows from 6.2 
that the 4; form an O-sequence, which implies the above inequalities on the ; 
and hence the upper-bound theorem. The proof of the upper-bound theorem for 
simplicial spheres is completed by citing the theorem of Reisner (1976) that if ¢ 
is a simplicial sphere then Ag does have the Cohen-Macaulay property. 

Another important consequence of 6.5 is the following. 


Theorem 6.11 (Lower-bound theorem). Whenever 0 < i < d the stacked d-polytopes 
with n vertices have the minimum number of i-faces among all simplicial d-polytopes 
with n vertices. 


An equivalent formulation of 6.11 is that if (fo, f\,...) is the f-vector of a sim- 
plicial d-polytope, then 
(i) fa-1 2 (4 - 1) fo — (d+ 1)(d — 2), 
Gi) fi>Ofo- Gye (<i <d —2). 
These inequalities can be derived directly from the following consequence of 6.5: 
hy =fo-—d and hj 2h, for 1 gig |d/2]. 


Like 6.10, Theorem 6.11 is a consequence of 6.5 but is known in greater gener- 
ality than 6.5. Barnette’s original proof (1971, 1973b) of 6.11 uses a reduction to 
polytopal graphs, and his (1982) paper extends the argument to homology man- 
ifolds. Klee (1975) proves (i) for closed simplicial pseudomanifolds. Using argu- 
ments from the rigidity theory of frameworks, Kalai (1987) establishes 6.11 for 
triangulated manifolds, and shows that equality holds in (i) or (ii) if and only if 
the manifold is equivalent to the boundary complex of a stacked polytope (a char- 
acterization proved for polytopes by Barnette (1973b) and Billera and Lee 1981). 
Kalai (1987) also proves (ii) for a certain class of closed pseudomanifolds, and 
shows that for general closed pseudomanifolds, (ii) is a consequence of an old 
unsettled conjecture on the rigidity of 2-manifolds. 

The case of centrally symmetric polytopes is of special interest, because they 
arise in so many contexts — for example, as unit balls of finite-dimensional polyhe- 
dral Banach spaces. For such polytopes there is so far no reasonable “upper-bound 
conjecture”, It is known that the degree of neighborliness of such polytopes is less 
than one might think. For example, when the number of vertices is large relative 
to the dimension there is always a pair of nonantipodal vertices that are not joined 
by an edge (Burton 1991). 

A theorem of Stanley (1987b) about A-vectors of centrally symmetric polytopes 
yields the following lower-bound theorem. The stated bounds can be achieved by 
starting with the d-dimensional cross-polytope and applying n — d successive pairs 
of stellar subdivisions of antipodal facets. 
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Theorem 6.12. /f (fo, fi,---) is the f-vector of a centrally symmetric simplicial d- 
polytope with n vertices, then 

(i) fa-y 2 24 + 2(n —d)(d — 1), 

(ii) fi > 21.4) +2(n- dy) O<i<d—2). 


Not much is known in the nonsimplicial case, except for the following result of 
Figiel et al. (1977). 


Theorem 6.13. There exists a positive constant c (independent of d) such that 


log fo(P) - log fa-1(P) 2 ed 


for each centrally symmetric d-polytope P. 


Inequality 6.12 (i) implies a result proved by Barany and Lovasz (1982) with the 
aid of Borsuk’s antipodal mapping theorem: If a d-polytope is simple and centrally 
symmetric then it has at least 24 vertices. 

Kalai (1989) conjectured that each centrally symmetric d-polytope has at least 
3¢ nonempty faces, and noted that this number is attained by cubes and a variety 
of other polytopes. Y. Kupitz conjectured that each d-polytope with no triangular 
face has at least 2% vertices, and this was proved by Blind and Blind (1990). 


6.6. Upper-bound and lower-bound theorems for polytope-pairs 


For a simple d-polytope Q with a given number of facets, the dual equivalents 
of Theorems 6.10 and 6.11 determine the maximum and the minimum number of 
i-faces that Q can have (0 <i < d — 2). These extrema are also of interest when Q 
is an unbounded simple d-polyhedron, and then it is appropriate to take separate 
account of the number of bounded facets, the number of unbounded facets, and 
the dimension of Q’s recession cone (the union of all rays in Q that issue from a 
fixed but arbitrary point of Q). In terms of the simplicial approach, this leads to 
the definitions and theorems below. 

A pair (P, F) is called a polytope-pair of type (d,v,k,r) if P is a simplicial d- 
polytope with vu vertices, F is a (kK —1)-face of P, and fo(link(P, @(P)) =r —k. 
Under polarity, (P,F) corresponds to a pair (P*,F*) where P* is a simple d- 
polytope with v facets and F* is a (d — k)-face of P* that has 7 — k facets of its 
own. Let H be a supporting hyperplane of P* with HM P* = F*, and let Q be 
the unbounded simple d-polyhedron obtained from P* by applying a projective 
transformation that sends H into the hyperplane at infinity. If k = 1 then Q has 
v — 1 facets, r — 1 of which are unbounded. If k > 1 then Q has v facets, r of which 
are unbounded. In either case Q has a recession cone of dimension d — k + 1. Every 
simple, pointed, convex d-polyhedron with (d — k + 1)-dimensional recession cone 
can be obtained in this way. 

As Q is unbounded, its face-lattice does not form a complex as we have used 
the term so far. However, our definition of a complex can be extended without any 
changes to allow cells to be unbounded pointed polyhedra. With this extension in 
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mind, we can define the boundary complex @(Q), and the dual of this complex, 
just as in the bounded case. The dual of &(Q) is the simplicial ball 


F=(P)\F := {G € B(P): F is not contained inG}. 


The triangulation of I’s boundary is dual to the complex generated by the un- 
bounded faces of Q. 

We will present sharp upper and lower bounds for the number of faces of P and 
I, thus providing sharp upper and lower bounds for the number of faces of P* 
and Q. The bounds are stated in terms of the h-vectors of P and I, but translating 
them into the context of P* and Q is easy, using the facts that 


d-} _3 
HPy=Y (77 \ne), o<j<a, 


i=0 
f,(Q) = 3 Vit) O<j<a. 


In particular, the second formula delivers the minimum and maximum possible 
numbers of faces of a simple d-polyhedron having a recession cone of dimension 
d—k+1 and v facets (v — 1 if k = 1), r of which are unbounded (r — 1 if k = 1). 


Theorem 6.14 (Upper-bound theorem for polytope pairs). Let 3<d<r<vu and 
l<k<gd—2. Put n= |d/2\. Let (P,F) range over ail polytope pairs of type 
(d,v,k,r). Set F = B(P)\F. 

If1<k < ((d—1)/2| then 


a 


O<ic¢k, 
} a 1) (races) + Catal, k+l <i n, 
max i(P) = sn eh (tae n+1l<i<d—-—k-1 
d~i-k d-i-k /» SS : 
oie iy, d—k<i<d 
Gee 1, O<i<k-—-1, 
C2 drinty (Cae 1 k < i <n 
maxh,(I) = (rie i n+1<icd—k-1l, 
Cl) red, d—k<i<d-1, 
0, i=d. 


If \(d —1)/2) <k <d—2 then 


v—dt+i-1 : 
maxh;(P) = re ), O<i<n, 
Cri) n+tlcic<d, 
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aN), O<i<n, 

Caer n+1<i<k-l, 
mah = 1 )-1, tk, 

(Zo) rtd, k+1<i<d-1, 

0, i-d. 


Moreover, for each of the above equalities, the indicated maxima can be achieved 
simultaneously. 


Portions of Theorem 6.14 appear in Klee (1974) and Lee (1984). It is proved 
completely (including the tightness of the bounds) in Barnette et al. (1986). Note 
that the cases k = d — 1 or d just correspond to the classical Upper-bound theorem. 
The following result is proved by Lee (1984), along with additional information 
about h-vectors of polytope-pairs. 


Theorem 6.15 (Lower-bound-theorem for polytope pairs). Let 4<d<rs<v and 
2<k<d—2. Putn = |d/2| and m= [(d —k)/2]. Let (P,F) range over all poly- 
tope 

pairs of type (d,v,k,r). Set F = B(P)\F. Then 


‘ ‘ _ 1, i=0, 

(i) minh(P)={) 1<ic<n; 
1, i=0, 

uv —d, l<igk-], 
Gi) mink(P)= 4 v-d-1, i=k, 
0, ti-—d. 

For k =1and3<d<r<v, (i) is the same but (ii) becomes 

di i=0, 
% = v—d-l, i=1, 

minh;(I) = v-r—-1l, 2<i<d-1, 
0, i=d. 


All bounds in (i) and (ii) can be achieved. 


7. Polytopal graphs 


In the edge-graph €(@) of a complex €, the nodes are ©’s vertices and the edges 
are pairs of vertices that share an edge of @. Dually, in the ridge-graph R(©) 
(defined when © is pure), the nodes are facets of @ and the edges are pairs of 
facets that share a ridge of €. When P is a polytope, the graphs 2(P) = R(B(P)) 
and @(P*) = &(@(P")) are isomorphic. Hence the d-polytopal graphs may be 
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defined as those isomorphic to the edge-graph of a d-polytope or, equivalently, as 
those isomorphic to the ridge-graph of a d-polytope. 

There is interest not only in polytopal graphs but also in polytopal digraphs, 
for these digraphs are closely associated with linear optimization, they provide a 
transparent approach to the shellability of simple polytopes (Danaraj and Klee 
1974), and they are involved in a simple formulation of McMullen’s proof of the 
Upper-bound theorem (Bréndsted 1983). 

A linear functional ¢ is admissible for a d-polytope P if y’s domain contains 
P and ¢ does not attain the same value at any two vertices of P. The ¢-digraph 
@Q(P,¢) of P is then formed by orienting each edge of &(P) in the direction of 
increasing p. A digraph is d-polytopal if it is isomorphic to one formed in this way. 
Note that each polytopal digraph has a unique source (node with zero indegree) 
and a unique sink (node with zero outdegree). 

The earlier sections have emphasized the “simplicial” approach to polytopes. 
However, in dealing with polytopal graphs we stress the “simple” approach because 
it leads naturally to a focus on edge-graphs, which seem more intuitive and more 
directly relevant to applications than ridge-graphs. 

This section is divided into three subsections as follows: Results for arbitrary 
dimension; The d-step conjecture and its relatives; Special low-dimensional prob- 
lems. 


7.1. Results for arbitrary dimension 


The following theorem of Balinski (1961) is the first significant one about arbitrary 
polytopal graphs. 


Theorem 7.1. The graph of a d-polytope is d-connected. 
There is an analogous result for polytopal digraphs. 


Theorem 7.2. In each d-polytopal digraph there are d independent dipaths from 
source to sink. 


To see that 7.2 implies 7.1, note that for any two nodes u and vu of a d-polytope 
P there exist a d-polytope Q, an admissible g for Q, and a projective bijection 
that carries P onto Q and u and v onto vertices of Q corresponding to the source 
and sink of 2(Q, ¢). 

For each n and d, let s(n, d) denote the maximum number of components into 
which a d-polytopal graph can be separated by removing n nodes. Klee (1964d) 
proves the following extension of 7.1, where y(n,d) is the maximum number of 
vertices of d-polytopes with n facets (see 6.9-6.10). 


Theorem 7.3. For all d and n, 


1, ifn <d, 
s(n, d) = ¢ 2, ifn=d, 
u(n,d), ifn>d. 
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Here is another property of d-polytopal graphs, due to Griinbaum and Motzkin 
(1963): 


Theorem 7.4. Each d-polytopal graph contains a subdivision of the complete graph 
on d+1 nodes. 


This theorem is extended as follows by Griinbaum (1965): if € is a d-polytopal 
complex and 0 < k < d then the k-skeleton of € contains a subdivision of the k- 
skeleton of a d-simplex. 

Various improvements and relatives of 7.4 provide other necessary conditions 
for d-polytopality of a graph (see Griinbaum 1970, 1975). However, it seems very 
unlikely that there exists an easily testable characterization of polytopal graphs. 
This pessimism seems justified by remarks in section 4 (4.19 ff.) in conjunction 
with the following fact, conjectured in 1970 by M. A. Perles and proved by Blind 
and Mani (1987) and by Kalai (1988c). 


Theorem 7.5. Each simple polytope is determined by its edge-graph. That is, two 
simple polytopes P and Q are isomorphic if and only if their edge-graphs are iso- 
morphic. 


By the dual equivalent of 7.5, each simplicial polytope is determined by its 
ridge-graph. Of course, simplicial polytopes are not in general determined by their 
edge-graphs. (Consider the neighborly polytopes!) However, it is known that each 
simplicial d-polytope is determined by its |d/2]-skeleton (Dancis 1984), while for 
an arbitrary d-polytope the best that can be said is that it is determined by its 
(d — 2)-skeleton (Gr67). 

Kalai (1988c) has a short proof of 7.5. Suppose that P is a simple d-polytope, 
and for each nonempty face F of P let p(P) denote the corresponding subgraph 
of €(P). Call an orientation < of &(P) good if it is acyclic and with respect to 
it each €;(P) has a unique sink. For each acyclic orientation < of €(P), let hs 
denote the number of nodes of indegree k, and define the quantity 


FS = hE + 2h8 + 4n¥ +---+ 27h. 


Note that if x is a node of €(P) of indegree k, then x is a sink in 2* faces, because 
(by simplicity of P) each choice of i edges incident to x determines an /-face that 
includes them. It follows that if f is the number of nonempty faces of P and < is 
any acyclic orientation, then f< > f, with f< = f if and only if < is good. Hence 
the good acyclic orientations of (P) are those yielding the minimum value for f“. 
To complete the proof, observe that the faces of P can be described in terms of the 
good acyciic orientations. Indeed, it is not hard to see that a subgraph H of €(P) 
corresponds to a k-face of P if and only if H is induced, k-regular, and connected, 
and its nodes precede all others with respect to some good acyclic orientation of 
8(P). 

When P is a simple d-polytope, each facet of P corresponds to a subgraph of 
€(P) that is induced, connected, and (d — 1)-regular. M. Perles has conjectured 
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that, conversely, each such subgraph of €(P) arises from a facet. If this could be 
proved, even with “connected” replaced by “(d — 1)-connected”, it would provide 
a different proof of 7.5. 


7.2. The d-step conjecture and its relatives 


For a polytope P, let 6(P) denote the diameter of the graph €(P). When 
gy is admissible for P, let é(P,){respectively 7(P,¢)) denote the length of 
the shortest (respectively longest) dipath from source to sink in 2(P,¢). For 
n>d 22, let A(d,n)(respectively 2(d,n), H(d,n)) denote the maximum of 5(P) 
(respectively £(P, @), n(P,@)) over all P or (P,¢@) for which P is a simple a- 
polytope with n facets. Obviously, A(d,n) < S(d,n) < H(d,n). These functions 
are all of interest in connection with the behavior of edge-following algorithms 
for linear programming, and the case n = 2d is of special interest. However, even 
though the diameter is one of the most intensively studied parameters of graph 
theory, little is known about these functions. They are difficult to study because 
the relevant graphs or digraphs are not defined in a purely combinatorial way but 
involve geometric notions as well. 

The d-step conjecture asserts that A(d, 2d) = d for all d, and the Hirsch conjecture 
asserts that A(d,n) <n — d for all d and n. Since a comprehensive survey of these 
conjectures is provided by Klee and Kleinschmidt (1987), we state only a few of 
the main results. The first part of 7.6 is due to Klee (1964b), its second part and 
7.7 to Klee and Walkup (1967). 


Theorem 7.6. A(3,n) = |2n/3] -— 1; 
A(d,d+k)=k when 1<k < min{d, 5}. 


Theorem 7.7. The following three statements are equivalent: 
(i) for all d, A(d, 2d) = d; 
(ii) for all d and n, A(d,n) < n—d; 
(iii) any two vertices of a simple polytope can be joined by an edge-path that 
does not revisit any facet. 


Note that by 7.6, (ii) is correct for d < 3, (i) for d < 5. The dual equivalent of 
the latter fact is extended to simplicial spheres (and more general complexes) by 
Adler and Dantzig (1974). The dual equivalent of (iii) is that any two facets of 
a simplicial polytope are joined by a ridge-path that does not revisit any vertex. 
Mani and Walkup (1980) show that this fails for a certain nonpolytopal simplicial 
3-sphere, and that leads to a counterexample to the sphere-analogue of the dual 
equivalent of (i). Specifically, there exists a nonpolytopal simplicial 11-sphere # 
with 24 vertices such that the diameter of the ridge-graph R(¥) exceeds 12. 

For large values of d and n — d, the following two results, due respectively to 
Larman (1970) and to Kalai and Kleitman (1992), provide upper bounds on A(d,n) 
that are the best known in certain directions. 


Theorem 7.8. A(d,n) < 24-3n; A(d,n) < d?*!8", 
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Neither of these results rules out the possibility that as d -» oo, A(d,2d) — d= 
co, but at least the second result provides a subexponential upper bound on 
A(d,2d). In any case, the following theorem of Lee (1984) shows that no dras- 
tic failure of the Hirsch conjecture (if there is such a failure) can be accounted for 
by strange behavior of f-vectors. 


Theorem 7.9. For each simple d-polytope P with n facets, there is a simple polytope 
Q such that P and Q have the same f-vector and the diameter of Q’s edge-graph is 
at mostn—d+1. 


The polytopes that are dual to neighborly (or almost neighborly) polytopes are 
natural candidates for violation of the Hirsch conjecture. However, they can in 
any case not be “exponential violators”, for Kalai (1991) has shown that their 
diameters are bounded by a polynomial in d and n. 

Because of their relevance to many problems of combinatorial optimization, 
there is special interest in polytopes whose vertices have all coordinates equal to 0 
or 1, A simple argument of Naddef (1989) shows that the Hirsch conjecture holds 
for such polytopes. 

Even less is known about the functions H and 4. The first part of 7.10 is due 
to Klee (1965), its second part to Todd (1980). In 7.11, which is due to Klee and 
Minty (1972), the upper bound is a consequence of the Upper-bound theorem. 


Theorem 7.10. 5(3,n) =n —3; 5(4,9) 26. 


Theorem 7.11. For each d there are positive constants aq and B, such that for all 
n, agnl4/?! < H(d,n) < Byn'4/2!. Also, H(d,2d) > 2% — 1. 


For additional results related to diameters of polytopes, see Kalai (1991, 
1992a,b). 


7.3. Special low-dimensional problems 


Our focus on high-dimensional results may have left the impression that most 
combinatorial questions about 3-polytopal graphs have been settled. That is far 
from being the case, and many interesting illustrations could be given. However, 
we limit ourselves to two old problems, both of which involve the p-vector of a 
3-polytope. This is the ultimately zero sequence (ps3, p4,...) in which p; denotes 
the number of 2-faces with precisely i vertices. There are many open questions 
concerning p-vectors, and it seems unlikely that easily testable characterizations 
will ever be known. However, in the case of simple 3-polytopes the following result 
comes tantalizingly close to providing a complete characterization. 


Theorem 7.12. if (p3, pa,...) is the p-vector of a simple 3-polytope, then 
26 ~ px) = 12. 


k23 
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For each sequence (p3,pa,...) of nonnegative integers satisfying this equation, the 
value of pg can be adjusted so that the altered sequence is the p-vector of a simple 
3-polytope. 


Thus the complete determination of the p-vectors of simple 3-polytopes reduces 
to the following: For a sequence (p3,p4,Ps,P7,--.) such that ¥°,53(6 — px) = 12, 
which values can be filled in for pe so that the sequence (p3, p4, Ps, P6,P7;---) is the 
p-vector of a simple 3-polytope? 

The first part of 7.12 follows readily from Euler’s theorem. The second part is 
an 1891 discovery of V. Eberhard, proved much more simply in Gr67 by use of the 
reductions of 3-connected planar graphs mentioned in section 4. Griinbaum (1968) 
shows that if p; = pa = 0, then ps may be required to have any preassigned value 
28. For other results on this, and on related problems concerning 3-polytopes that 
are not required to be simple, see Griinbaum (1967, 1968, 1970, 1975), Roudneff 
(1987), and their references. 

In an early attack on the 4-color problem, P. G. Tait conjectured in 1880 that 
every simple 3-polytope is Hamiltonian (admits a circuit using all vertices). That is 
disproved by a 1946 example of Tutte, with 46 vertices, and there are later examples 
with only 38 vertices. (See Gr67 and Barnette 1983 for the history.) Garey et al. 
(1976) show that the problem of detecting Hamiltonicity in 3-polytopes is NP- 
complete, even when restricted to simple 3-polytopes for which p3 = pa = 0. On 
the other hand, Hamiltonicity has been established for several broad classes of 3- 
polytopes. The successive classes in 7.13 are treated in the papers of Tutte (1956), 
Holton and McKay (1988), Holton et al. (1985), and Goodey (1975, 1977). 


Theorem 7.13. A 3-polytope P is Hamiltonian if P’s graph is 4-connected, and also 
if P is simple and satisfies any of the following supplementary conditions: 
(i) the number of vertices is at most 36; 
(ii) the number of vertices is at most 64 and each 2-face has an even number of 
vertices; 
(ili) the total number of 2-faces is equal to p3 + p4 or to p3 + pe OF to Pa + Do. 


Some of the above results are relevant to the conjecture of D. Barnette (in 
Griinbaum 1970) that a simple 3-polytope P is Hamiltonian if it has no odd 2- 
faces, or the conjecture of Goodey (1977) that P is Hamiltonian if no face has 
more than 6 vertices. Barnette conjectures also that each simple 4-polytope is 
Hamiltonian, which is discussed by Rosenfeld (1983). 
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1. Introduction 


A point lattice L (called simply a /attice in the following) is a set of points in R” 
which form a discrete additive subgroup of R”. That is, L satisfies the two 
conditions: 

(1) Additivity. If v,, v,E L, then —v, EL and v,+v, EL; 

(2) Discreteness. There exists cy >0 such that for any v,, v, EL with v, #v,, 
|v, — v,| cy, where |-| is the Euclidean norm. 

Elements in L are called lattice points or lattice vectors. The most familiar 
lattice is the integer lattice Z". See fig. 1.1. 
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Figure 1.1. Part of the lattice Z’. 


Equivalently, a lattice may be viewed as a discrete group G, of translations 
acting on R”, in which case the orbit of 0 under G,, i.e., L = {g(0): gE G,}, isa 
point lattice in the sense above, and L determines G,. The group G, is a discrete 
subgroup of the group 7(R’) of all translations. 7(R”) is naturally identified with 
R", and is a subgroup of the group E, of all Euclidean motions or “isometries” of 
R", which in turn is a subgroup A, of all affine motions of R’. 

There are several classes of combinatorial problems involving lattices. These 
problems are related in one way or another to convex bodies. The first class of 
problems concerns the packing or covering of space with translates of a given 
convex body. One may consider the special subclass of such problems in which 
the set of translates forms a lattice, which are called lattice packings and lattice 
coverings, respectively. The geometry of numbers is a branch of number theory 
concerned with finding the best lattice packings and coverings. The second class of 
problems is algebraic, and studies the possible symmetries of lattices, and more 
generally the problem of classifying discrete subgroups G of the group of 
isometries E,, for which the quotient space E,/G is compact. This is the subject 
matter of mathematical crystallography. It is also closely related to the problem of 
classifying possible tilings of spaces with congruent polytopes. Polytopes of special 
shape admitting lattice tiling of space are called parallelotopes, and one problem is 
to classify the possible combinatorial types of parallelotopes. A third class of 
problems concerns the study of properties of convex polytopes all of whose 
vertices are in the lattice Z”, which are called lattice polytopes. There are many 
interesting relations between geometric invariants of such polytopes such as mixed 
volumes and combinatorial invariants such as the counting functions of the 
number of points in lattices (1/d)Z” in such polytopes. These include com- 
binatorial reciprocity laws. A fourth class of problems concerns the determination 
of when a polytope contains a lattice point of Z", which is the integer program- 
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ming problem. Many combinatorial problems are naturally formulated as integer 
programs and such a formulation is often useful in their solution. For example, set 
covering problems, matching problems, existence of finite projective planes, 
existence of specific t-designs, and the problem of finding maximal independent 
sets in graphs are all naturally encodable as 0—1 integer programming problems. 
General methods from the geometry of numbers lead to integer programming 
algorithms, see section 7. These problem classes are the ones surveyed in this 
article. 

Point lattices are one of the most pervasive structures in mathematics. In 
number theory they arise in studying the integral equivalence of positive definite 
integral quadratic forms (see section 2), in the geometry of numbers and 
Diophantine approximation, in algebraic number theory and class field theory. In 
algebraic geometry elliptic curves and more generally Abelian varieties have the 
structure C”/L, where L is a lattice. Modular functions, modular forms and their 
generalizations are invariants of isomorphism types of lattices. The theory of toric 
varieties and the problem of finding the number of solutions of systems of 
algebraic equations both use invariants attached to lattice polyhedra (section 6). 
The creation of group theory was partially motivated by the study of crys- 
tallography. Many of the sporadic simple groups have been constructed as (part 
of) the automorphism groups of particular lattices, e.g., the Monster group is the 
group of automorphisms of a certain 196884-dimensional lattice. In Riemannian 
geometry all compact flat manifolds arise as quotients E,/G for certain crystallo- 
graphic groups G. More generally, lattices viewed as groups of translations 
generalize to the study of discrete subgroups of arbitrary Lie groups and their 
quotient spaces. The lattice structure is evident in Fourier series, which exist on 
R"/Z", and harmonic analysis is an important tool in the study of lattices. Lattices 
and their cosets play a fundamental role in algebraic coding theory. Finally, 
lattices arise in numerical analysis in the construction of multivariate splines and 
in numerical integration. These areas are for the most part not treated here; 
references to some of them are provided in section 8. 

The lattice concept originally arose in two different areas, number theory and 
crystallography. In number theory it appears in the reduction theory of binary 
quadratic forms, which was created by Lagrange in 1773. This has a lattice 
interpretation due to Gauss in 1831. The reduction theory of m-dimensional 
positive definite quadratic forms, which is essentially reduction theory for lattices, 
was developed by Hermite in 1850 and Korkin and Zolotarev (1873, 1877). The 
full power of the lattice concept in number theory was indicated in Minkowski’s 
development of the geometry of numbers in 1896. In crystallography Bravais 
(1850) began the mathematical development. In 1869, Jordan classified certain 
groups of movements and L. Sohnke pointed out its application to crystallography 
in 1879. The classification of all 3-dimensional space groups, the ones of direct 
relevance to crystallography, was accomplished by Fedorov (1889) and later, 
independently, by Schoenflies and Barlow. In 1900 Hilbert raised the problem of 
N-dimensional analogues of crystallography in the 18th problem on his famous 
problem list. Bieberbach (1910a,b, 1912) gave a solution. 
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General books and surveys for the individual sections are as follows. For 
geometry of numbers see Cassels (1971), Gruber (1979), Gruber and Lekker- 
kerker 1987). For packing and covering see Baranovskii (1971), Conway and 
Sloane (1988a), Fejes-Toth (1972), Rogers (1964), Ryshkov and Baranovskii 
(1979), Saaty and Alexander (1975). For mathematical crystallography see 
Charlap (1986), Delone et al. (1973), Hilton (1903), Schwartzenberger (1980). 
For tilings in low dimensions see Grinbaum and Shepard (1987). For integer 
polytopes see Ewald (1985) and Gritzmann and Wills (1992). For algorithmic 
gcometry of numbers see Kannan (1987b) and Lovasz (1986). Lists of problems 
concerning lattice points appear in Erdés et al. (1989) and Hammer (1977). See 
section 8 for references to other areas of mathematics. 

It is impossible to give more than an overview of this area in a short space. I 
have especially benefitted from and borrowed from other surveys, e.g., Ewald 
(1985), Milnor (1976) and Rogers (1964). 


2. Lattices, convex bodies and reduction theory 


A (point) lattice L is a discrete additive subgroup of R”. Let lin(L) denote the 
vector space spanned by the elements of L. Let rank(L) denote the dimension of 
lin(L), i-e., the number of linearly independent vectors in L. A lattice is of full 
rank if rank (L) =n, ice., if lin(L) =R’. 

An (ordered) basis B=[b,,...,6,] of L is a set of vectors in ZL with 


L={n,b, + +++ +n,b,:(n,,...,2,)EZ}, 
where r=rank(L), We represent vectors in L as row vectors, and a basis 


b 


as an rXn matrix. If B is a basis of a lattice L, so is B'=UB for any 
U €GL(r, Z), and conversely all possible bases of L are obtained in this fashion. 
We associate with a basis B the fundamental parallelepiped 


F(B) = (w=a,b,+ +++ +0,b,:0<a,<1}. 
The translates of F(B) by elements of L give a perfect tiling of lin(L). 
Definition 2.1. The determinant det(L) of a lattice L is 

det(L) = det(BB")'” , 
where 8 is any basis of L. 


The determinant of L is independent of the choice of basis, and gives the 
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r-dimensional volume of the fundamental parallelepiped F(B) regarded as 
embedded in R”. 

A lattice L, is a sublattice of a lattice Ly if L, C Ly and rank(L,) = rank(Z,). 
The index of L, in Ly is 


det(L, ) 
det(L,) ° 


[Lo:L,] = 


A sublattice L, of a lattice L, is completely specified by an integer matrix M of 
determinant +[L,:L,] which takes a fixed basis B of L, to a basis B of L,. There 
are Only a finite number of sublattices L, in a lattice L, having a fixed index 
[L,: L,] =, which are classified by the following result. 


Theorem 2.2 (Hermite normal form). For each integer matrix M of nonzero 
determinant, there exists a unique matrix M = UM with matrix U € GL(n, Z) such 
that M is lower triangular with positive diagonal elements, i.e., 


ay; 0 
M=UM=\°2 922 
G1 a 


with all a,,>0 and 0<a,<a, for 1<j<is<n. The matrix M is said to be in 
Hermite normal form. 


In particular for such a lattice L, the complete set of sublattices of index n are 
given by the set of bases B = MB where M is in Hermite normal form with 
det(M) =n. There are only a finite number of these, see Newman (1972). 

The simplest example of a lattice is the integer lattice Z" in R”, which has full 
rank and determinant det(Z") = 1. A sublattice of the integer lattice is called an 
integral lattice. 

The fundamental invariants of a lattice describe how its points sit in space with 
respect to a norm, and more generally with respect to homothetic copies AK of a 
general body K. 

A body K is a connected subset of R” which is the closure of its interior int (K). 
It is convex if x, y€K implies that Ax + (1— A)y EK for0 <A <1. It is symmetric 
(often called O-symmetric in the literature) if 0G int (K) and x © K implies that 
—x€K. It is a star body if GEInt (K) and xEK implies that AxEK for 
O0<A<1. For AER the scaled body AK is defined by 


AK = {Axr:xEK}. 
Definition 2.3. The Minkowski sum K, + K, of two bodies is 


K, + K, = {x, + x,:x, EK, andx,EK,}. 
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In particular the difference body DK = K — K is the Minkowski sum of K and 
—K. A difference body DK is always symmetric, and is convex if K is convex. 

A norm (or distance function) ||, is a map R" > R satisfying 

(1) (Triangle inequality) |x + y|, <|x|,+|y|,. 

(2) [Axl =IAl |alx . 

(3) Ifx #Othen |x|, >0. 

A norm is completely determined by its unit ball 


K=K(|-l) = (x: lee =}, 
since one has 
|x| = inf{A: x EAK} . (2.1) 


The unit ball of a norm is always a symmetric convex body K, and conversely any 
such body gives rise to a norm by (2.1). In fact (2.1) can be used to define |-|, for 
any star body K, but conditions (1) and (3) above do not necessarily hold. 

The symbol |- | will denote the Euclidean norm |x|’ = . x), which is induced 
from the Euclidean inner product (x, y) =xy= 0 1x 

The volume of a convex body K in R” is denoted Vouk ) or vol, (K). The 
Euclidean unit ball B, = {x: |x|? <1} has volume 


an"? 
vol(B,) = nT Gan) ; 
where I(-) denotes the Gamma function. 


Definition 2.4. Given a symmetric convex body K in R” the ith successive 
minimum 4,(K,L) of a lattice L for 1<i<rank(L) with respect to K is the 
minimum value of A such that AK contains i linearly independent lattice points of 
L (inside or on its boundary). The ith successive minimum 4,(L) refers to the case 
K =8,, i.e., the Euclidean norm case. 


The successive minima measure to what extent the lattice vectors “‘stretch’’ in 
various directions with respect to the convex body K. Here }A,(K, L) gives the 
maximum size AK of a homothetic copy of K that can be packed in space without 
overlap with centers located at the points of L. The integer lattice Z” has 
A (Z") =A,(Z") = --- =A,(Z") = 1. 

Lattices have a duality theory related to the duality theory in Fourier analysis 
on R”/Z". The Euclidean inner product plays an important role in this duality. 


Definition 2.5. The dual lattice L* (also called reciprocal lattice or polar lattice) to 
L is 


= (y: y& lin(L) and (x, y) €ZforallxEL}. 


One has rank(L*) = rank(L) and det(L*) =det(L)' 
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An analogous duality notion can be defined for symmetric convex bodies. 


Definition 2.6. The dual body K* or (polar body) to a convex body K containing 
0 in its interior is 
K* = (y: (x,y) <1 forallxEK}. 
If K is symmetric then K* is symmetric. The norm |-|,. is said to be dual to 


|-|,. Note that the L?-norm is dual to the L’-norm in this sense, when 1/p + 
1/q=1, for l<p, gq=™. 


Theorem 2.7. There is a constant 0<c<1 such that for all n, any symmetric 
convex body in R" satisfies 


c” vol (B,)’ < vol(K) vol(K*) < vol(B,)’ . 


The upper bound is due to Santalé (1949) and the lower bound to Bourgain 
and Milman (1987). Still unsolved is a conjecture of Mahler (1939) that the best 
possible lower bound is 4"/n!, which is attained by 


k={xeR’: > le <i}. 
i=] 
There are general relations between the successive minima of a lattice and its 
dual lattice, with respect to a given pair of dual norms. 


Theorem 2.8. The successive minima \,(L) of a rank n lattice L and of its 
reciprocal lattice 4,(L*) are related by 

1<A,(L)A,_(4,(L*) Sa 
for 1<i<n,. More generally, for any symmetric convex body K, 

1SA(K, L)A,_;,,(K*, L*) < ne? 
for 1Si<n. 

Here the well-known lower bounds go back to Mahler (1939), while the upper 

bounds are proved in Banaszczyk (1993). The lower bounds are sharp, and the 
upper bounds are best possible up to a constant, because a result of Conway and 


Thompson (given in Milnor and Husemoller 1973, chapter II, Theorem 9.5) gives 
lattices L, = L* in R” with 


A(L,)A(L2) = (4a +0(1)) asn—o, 


Reduction theory deals with the choice of a “good” basis of a lattice L among 
the set of all possible bases of L. We describe reduction theory for the case of a 
full rank lattice L. The set of possible ordered bases of all full rank lattices L in 
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R” make up the group GL(, R), and the set of ordered bases of a single lattice L 
is a GL(n, Z)-orbit in GL(@,R). A reduction theory consists of a choice of 
reduction domain ®, for the quotient space GL(n, Z)\GL(n, R). A reduction 
domain &,, is an open set R in GL(n, R) such that each lattice has at least one 
and at most c(n) bases in #,, where c(n) is a finite bound depending on @,. An 
ordered basis lying in a reduction domain is called reduced (in the sense of a 
particular reduction theory). 

Two classical reduction theories are due to Korkin and Zolotarev (1873, 1877) 
and to Minkowski (1905). They were originally developed as reduction theories 
for positive definite quadratic forms, and had the object of choosing reduced 
quadratic forms with small coefficients. A positive definite quadratic form g(x) = 
x'Mx, where M lies in the cone 9; of positive definite symmetric matrices. A 
reduction theory for positive definite quadratic forms consists of a choice of a 
reduction domain ¥, for GL(n, Z) acting on Y,, treating g(x) and q(Ux) for 
U EGL(n, Z) as the same quadratic form. A quadratic form reduction theory 
always arises from a lattice basis reduction theory using the map 
¢:GL(n, R)—> FY; which has ¢(B) = BB". The region ¢~'(#) in GL(n, R) is a 
reduction domain for lattice bases which is invariant under the orthogonal group 
O(n, R), ie., if MEG '(F) then MOE @ '(F) for all O € O(n, R), see Cassels 
(1978). 

An ordered basis [b,,...,5,] is Minkowski-reduced if b, is a shortest 
(nonzero) vector in L and is inductively defined by the requirement that b, be a 
shortest vector in L subject to the condition that [6,,..., b,] can be extended to a 
basis of L. The domain ¥,, of Minkowski-reduced bases has a simple shape. 


Theorem 2.9. The domain ¥,, of Minkowski-reduced lattice bases has the property 
that 


Qy = {BB": BE F,} 


ts a closed polyhedral cone having a finite number of facets contained in the cone 
P * of positive definite symmetric n X n matrices. 


In particular, one can write down a finite set of inequalities for a basis 
B=[b,,...,8,] of a lattice L of the form 
|m,b, ee ae m,b,|" < |m,b, ane oF m,b,|° 
where m=(m,,...,m,) and m'=(m'},...,m’) are in Z", which are necessary 
and sufficient conditions fora BE ¥,,. For n <4 they are 


[b,|? = [2,|” 1<i<j<n, 


2 


2Sisn, 


é-l 
|b,|? < lo +d mob, 
imo *? 
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where m, take all possible values from 1, 0, —1, see Cassels (1978, p. 257). 

To define Korkin—Zolotarev reduction we first define projected lattices. A 
lattice subspace W of L is a subspace spanned by lattice points in L. Let P,, denote 
orthogonal projection onto the subspace V of R" and let W* = {y: (x, y) =0} be 
the orthogonal complement of W in R” (with respect to the Euclidean inner 
product (:,-+)). If W is a lattice subspace of L then the projected lattice my .(L) is 
a lattice in R” contained in W*, where 7,,.(-) denotes orthogonal projection onto 
W°. (If W is not a lattice subspace then m,.(L) need not be a lattice.) An 


ordered basis [b,,...,5,] of a lattice L is Korkin—Zolotarev reduced if 
(1) 8, is a shortest nonzero vector in L. 
(2) For W,=R[b,,...,8,], the vector my.(b,,,) is a shortest vector in the 


projected lattice 7y.(L), for 1<i<n—1, 
(3) Size-reduction condition. For 1Si<jsn, 


{ary 2 ACRE Tw, (b;)) <1|my. (8) > 


where W; =R”. 

One can show that almost all lattices have a unique choice of Minkowski- 
reduced basis (resp. Korkin—Zolotarev reduced basis), up to the choice of sign * 
+b, for each basis vector. Furthermore, no, rank n lattice has more than c(7) such 
reduced bases, where c(n) grows like 200"), ) 

The length of basis vectors in such reduced bases can be bounded in terms of 
the successive minima A,(L). There are very good bounds for Korkin—Zolotarev 
reduced bases, cf. Lagarias et al. (1990). 


Theorem 2.10. Jf [b,,...,6,] is a Korkin—Zolotarev reduced basis of a lattice L 
then 


oy AML) < |b) > Ly. 


Weaker bounds of a similar type are known for Minkowski-reduced bases, with 
constants depending exponentially on i. 
We end this section with an analytic invariant of a lattice. 


Definition 2.11. The zeta function {,(s) of a lattice L is the Dirichlet series 
(8) = 2 |x|. 
xEL 
x*0 


Here ¢, (s) coincides with the Epstein zeta function of the quadratic form 


Qs wd= |Z 


where [8,,...,6,] is a basis of L. 
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Theorem 2.12. The Dirichlet series for £,(s) converges in the half-plane Re(s) > r, 
where r=rank(L). ¢,(s) analytically continues to C except for a simple pole at 
s =r which has residue det(L)~'. It satisfies the functional equation 


(6s) =€.-(r—5), 


where L* is the dual lattice and 


£,6)=de(Ly 2 'T(5) £6). 


Several authors have studied the problem of finding lattices L whose zeta 
functions ¢, (s) have certain extremal properties, e.g., maximizing or minimizing 


lim (s 7)" det(L )e, (6) 


over all rank r lattices, cf. Ryshkov and Baranovskii (1979) and Gruber (1979). 


3. Geometry of numbers 


The geometry of numbers studies relations between lattices and convex bodies 
(more generally star bodies), which frequently are expressed as inequalities. 


Minkowski’s fundamental theorem 3.1. Let K be a symmetric convex body and L 
be a full rank lattice in R”. If 


vol(K) = 2” det(L) , 


then K contains a nonzero lattice point of L (either inside it or on its boundary). 
That is, one always has 


A, (K, LY” vol(K) <2” det(L) . 


Minkowski provided this by observing that, if K contained no such lattice point, 
then the translates of int(}K) by all members of L are mutually disjoint. Suppose 
K is contained in a ball of radius c,. Now the volume covered by such translates 
inside a large cube of size T +c, is 2'” vol(K)(# lattice points in 7) while 


(# lattice points in T) = +O(T"') asT>~. 


re 
‘det(L) 
Since the cube of side T +c, has volume T” + O(T” ') one obtains 
-aA fae mi mt n-) 
2" vol(K) Garcey + OCF”) ST" + O(T"™’) 
and letting T— < proves the first part of the theorem. Since vol(AK) = A” vol(K) 


the second part of the theorem is equivalent to the first, by rescaling K so that 
A (K, L)=1. 
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Siegel (1935) observed, using Fourier analysis techniques (Parseval’s theorem), 
that one obtains a sharpening of Minkowski’s fundamental theorem. 


Theorem 3.2. Let K be a symmetric convex body and L a full rank lattice in R”, 
and suppose K contains no nonzero lattice point of L. Then 


2 
vol(K) +4" vol(K)' > { exp(—2mi(b, x)) dx) =2" det(L) , 
BEL*-{0) | 


where L* is the dual lattice of L. 


Fourier analysis provides powerful techniques for proving results in the 
geometry of numbers, and conversely geometry of numbers suggests general 
inequalities in harmonic analysis, e.g., Bourgain and Milman (1987) and 
Banaszczyk (1989, 1991, 1993). 

Van der Corput (1935) gave a different extension of Minkowski’s theorem. 


Van der Corput’s theorem 3.3. Let K be a symmetric convex body and L a full 
rank lattice in R". If 


vol(K ) = k2" det(L) , 


then K contains at least k pairs +v of nonzero lattice points in L. 


The central result in the geometry of numbers is the following strengthening of 
Minkowski (1896) to Theorem 3.1. 


Minkowski’s second fundamental theorem 3.4. Let K be a symmetric convex body 
and L a full rank lattice in R". Then 


n 


= det(L) <A,(K, L)A,(K, L)---A,(K, L) vol(K) <2” det(L). 


The upper bound * attained for L = Z” when K is a cube, the lower bound 
when K={xER": ),_, |x,|<1}. Proofs of the more difficult upper bound 
appear in Bambah et al. (1965), Cassels (1971, p. 208) and Gruber and 
Lekkerkerker (1987, p. 59). A generalization of this theorem to adele spaces 
appears in McFeat (1971) and in Bombieri and Vaaler (1983) and another 
generalization appears in Woods (1966). 

An inhomogeneous extension of Theorem 3.1 is due to Blichfeldt (1914). 


Blichfeldt’s theorem 3.5. Let K be any measurable subset of R" and L a full rank - 
lattice of R”. If 


vol(K) = det(L) 


then K contains two different points v,, 0, such that v, — v, € L. 
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Bombieri (1962) proved a Fourier-analytic generalization of Theorem 3.5 
analogous to Siegel’s theorem 3.2. 
Blichfeldt also proved a kind of converse to this result. 


Theorem 3.6. If a convex body K in R” contains j lattice points from a lattice L 

and these span R", then 

} — 
n! 


vol(K) =2— det(L) . 

Weaker versions of Minkowski’s fundamental theorem 3.1 have been proved 
for nonsymmetric convex bodies. Define the coefficient of asymmetry o(K) of a 
convex body K containing 0 by 


max{A: Are K} 


o(K) = sup max{A: —AxE K}° 


jz|=1 


Then o(K) = 1 and equality holds if and only if K is symmetric. 


Theorem 3.7 (Sawyer 1954). A convex body K in R" containing 0 contains a 
nonzero point of a lattice L if 


vol(K) = (1 + o(kyy"|3 - (1 ze) | det(L). 


Next we consider quantities which are associated to lattice packings of bodies. 
A full rank lattice L is called admissible for a set K in R" if Int(K) contains no 
lattice points of L except 0. That is, L is admissible for K if {4K +v:vEL} isa 
(lattice) packing in R” with copies of 4K. 


Definition 3.8. The lattice constant A(K) of a set K is 
A(K) = inf{det(L): L is an admissible lattice for K} . 
A lattice L is called critical for K if it is admissible and det(L) = A(K). 


Mahler (1946) proved a convergence theorem that is often useful in showing 
that critical lattices exist. The set of lattices L“? converges to a lattice L if there 
exist bases {b{,...,8} of L© and {b,,...,b,} of L such that b\—}, as 
i, This defines a topology on the set of all lattices L. 


Mahler’s selection theorem 3.9. Let a sequence L of lattices be given. If the 
sequence of determinants det(L“?) is bounded and if there is a ball centered at 0 
such that all L“ are admissible for it, then the sequence contains a convergent 
subsequence. 


In particular any bounded symmetric convex body K has a critical lattice. 
There are general results giving upper and lower bounds for lattice constants. 
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The following result was conjectured by Minkowski and proved by Hlawka 
(1944). 


Minkowski-Hlawka theorem 3.10. Let K be a Jordan measurable subset of R”. If 
vol(K) <1 then there exists a lattice L with det(L) <1 admissible for K. If K is 
star-shaped and symmetric around 0 and 


vol(K) <2Z(n) 


" 


there these exists such a lattice. Here {(n)=b,_,k". 


Siegel (1945) derived Theorem 3.10 from a stronger result. He defined a 
natural measure m on the set of all lattices of determinant 1 (which is the double 
coset space GL(n, Z)\GL(n, R)/R*). Call a lattice point x € L primitive if there 
are no lattice points on the segment [0, x] = {Ax:0 <A <1} except 0 and x. 


Siegel’s mean value theorem 3.11. Let f:R"—R be Riemann integrable and 
nonnegative. Then 


J (2, fla)) dm.) = J fle) dx , 


where the integral on the left is taken over all lattices of determinant 1. In addition 


a@E€L— {0} 
4 primitive 


Ji > fe) mth = ey J He 


This result implies that the set of admissible lattices of determinant 1 for a 
general Jordan measurable set K has Siegel-measure =1 — vol(K), giving the first 
part of Theorem 3.10. 

The Minkowski-Hlawka theorem gives a criterion for admissible lattices to 
exist, and establishes the upper bound 


A(K) <vol(K) . 
Schmidt (1963) obtained the following result using Siegel-measure estimates. 
Theorem 3.12. For a general Jordan measurable set K in R?, 

A(K) = 42 vol(K) . 


For a general Jordan measurable set in R’, 


A(K) < 


an—B vol(K) 


where a = 0.3465 and B are constants. 
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In fact one can often exhibit admissible lattices in other cases, e.g., the star 
body {(x, y): |xy| <1} has infinite volume yet has admissible lattices, see Theorem 
3.17 below. 


Minkowski’s fundamental theorem 3.1 gives a lower bound for the lattice 
constant 


A(K) 22°" vol(K). 
A more general lower bound problem arises in connection with the following 


special case of Minkowski’s theorem 3.1. 


Minkowski’s linear form theorem 3.13. Let L be a lattice with det(L) = 1 and let c, 
be positive numbers with II; , ¢; = 1. Then there is a nonzero lattice point of L in 
the box 


Ix,|<e; , forlsisn. 


This result includes Dirichiet’s theorem, which asserts that given any reals 
(0,,...,6,) and a bound AN there exists a denominator 1 < Q <N and numerators 
(P,,...,P,) with 


| —I-lé/n 
——t|< . 
le a|<N 


This is obtained by applying Theorem 3.13 with L = {(ko, k 0, — k,,... .ko@, — 
k,): kEZ"*'}, on taking cy = N and c, =N~'" for 1<i <n. Mordell conjectured 
the following result, later proved by Siegel, cf. Gruber and Lekkerkerker (1987, 
section 24). 


Converse to Minkowski’s linear form theorem 3.14. There is a constant «(n) such 
that for any full rank lattice L in R" with det(L) = 1 there exists a box 


Ix,|sc, forl<i<n, 


with 


such that L is admissible for this box. One has x(n) = (273° +++ (n— 1)" nly. 


There has been a great deal of work devoted to finding critical lattices for 
specific bodies. Concerning critical lattices, Swinnerton-Dyer (1953) proved the 
following result. 


Theorem 3.15. Any critical lattice of a symmetric convex body K contains at least 
n(n + 1)/2 pairs +x of boundary points of K, which include n linearly independent 
points. 
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Minkowski conjectured a form for the critical lattices for boxes and this was 
proved by Hajés (1942). 


Theorem 3.16. A lattice L with det(L) = 1 is a critical lattice for the cube if and 
only if after a suitable permutation of the n coordinates it has a basis of the form 


b, =(1,0,...,0), 
b, =(By,1,0,...,0), 


b,, =(B,1,- seo eee 0 


In general it seems a very hard problem to determine the critical lattices of a 
convex body, even for very symmetrical bodies such as spheres. 

Some unbounded star bodies have the property that a finite number of critical 
lattices exist and any admissible lattice with determinant sufficiently close to that 
of a critical lattice must be a scaled version AL, of some critical lattice. In such a 
case the critical lattices are said to be isolated. The simplest example of this 
phenomenon is associated to the star body Ky = {(x, y): |xy| <1}. 


Theorem 3.17, The unbounded star body K, = {(x, y): |xy| <1} has lattice constant 
A(K,) = V5, with exactly two critical lattices: 


io = (1, 1), (- ance +N) : 


L, =2| (1, 1), (- = =) 


If L is an admissible lattice for K, with det(L) < V8 then L=AL, for i= or 2. 


See Gruber and Lekkerkerker (1987, p. 426) for a proof. A closely related 
problem is the determination of the Markoff spectrum in one-dimensional 
Diophantine approximation, see Cassels (1957, 1971, chapter II.4). 


4. Packings and coverings 


Packing and covering problems often lead to combinatorial structures of interest. 
Extremal packings in small dimensions tend to have very regular structures, with 
large groups of symmetries. 

A collection of translates 9f = {K + v,:i 21} of a body K is a packing if 


int(K +u,)Nint(K +u,)=@8 ifi Aj. 


It is a lattice packing H(K, L) by a lattice L if in addition the translates are the set 
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of vectors in L. To a given collection of sets # and a cube C(r) of side r centered 
at 0 we set 


p(X, ee s vol(K + v;) , 


K+uNC(r) 48 


Pe ee s vol(K + v,) 


Pk +u,CC(r) 
and define the upper and lower asymptotic densities of X by 
p.(H%) =lim sup p, (H, 1) , 
p_(#) =lim inf p(K,rn. 
A packing % is said to have density p,(#). A lattice L packs K if A,(K, L) > 2. 
For lattice packings #(K, L) one has 


vol(K) 
det(L) ° 


p+ (H(K, L)) = p_HU(K, L)) = 


Definition 4.1. The packing constant 5(K) for a body K is 
5(K) = sup{p,(H): H# a packing of XH} . 

The lattice packing constant 6,(H) for K is 
5, (K) = sup{p,(A(K, L)): L a lattice packing for K} . 


A symmetric convex body K in R” has a lattice packing with a lattice L if and 
only if L is admissible for the body 2K, whence 
vol(K) _ >" vol(K) 
A(2K) A(K) ° 


5,(K) = 


The difficulty of packing problems grows with the dimension, and much more is 
known in two dimensions than in higher dimensions. 
Theorem 4.2. For any convex body K in R’, 
5(K) =6,(K). 
If H(K) denotes the volume of the smallest hexagon circumscribing K then 


vol(K) 
A(K) ° 


5(K) = 


If K is also symmetric then 


vol(K) 


8(K) = 8,(K) = Frags 


936 J.C. Lagarias 


The history of these results, which are due to Rogers, Fejes-Toth, and 
Reinhart, is detailed in Rogers (1964, chapter 1). 
The least dense packing of any convex body in R? is given by a triangle. 


Theorem 4.3. For any convex body K in R’ 
5,(K) = 3(K) > 3, 


with equality if and only if K is a triangle. 


The problem of finding dense packings of spheres (i.e., of balls B,) first arose 
in bounding the minimum of a positive definite quadratic form in terms of its 
determinant, and more recently in connection with coding theory, see Conway 
and Sloane (1988a). It appears as part of Hilbert’s 18th problem. The densest 
sphere packing in R? is the hexagonal packing, and by Theorem 4.2 its density is 


vw 
6(B,) == = 0.9069... . 
(B.)= 35 
The densest sphere packing is not determined in any dimension 23. It remains a 
notorious unsolved problem (Kepler conjecture) to show that 5(B,) = 6,(B,), 
i.e., that the ‘‘cannonball” packing is the densest packing of 3-dimensional 
spheres. It is known that 


0.7407... = PY 3 = §,(B,) <8(B,) <0.7730.... 


For the latest word on this problem see Hales (1994). The relation of lattice 
packings of spheres to quadratic forms is incorporated in the following definition. 


Definition 4.4. Hermite’s constant y, is the smallest constant such that 
A(LY’ <y,(det(L))*" 


holds for all lattices L of rank n. 


That is, every positive definite quadratic form of determinant one takes on a 
nonzero value <¥, at an integer point. This shows that the lattice constant 
A(B,) =,” and that 


6,(B,) = pa hae vol(B, ) : 


The densest lattice packings of spheres and Hermite’s constants y, are known in 
all dimensions <8, see table 4.1. Table 4.1 also lists critical lattices, using the 
notation of Conway and Sloane (1988a, chapter 5). This notation reflects the fact 
that these lattices are associated to root systems of certain Lie algebras; however, 
the association of critical lattices and Lie algebras breaks down in high dimen- 
sions. 

Both lower and upper bounds are known for sphere-packing densities. 
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Table 4.1 
Densest lattice packings with balls B, 


dim Critical (y,)" 5,(B,) 
lattice 
1 Zz 1 1 
2 A s —_ = 0.90690 
: s wan? 
3 A 2 —_ = 0.74048 
: 3v2 
m 
4 D, 4 76 = 0-61685 
rT 
5 D 8 = 0.46526 
‘ 15V2 
= 
6 D os = 0.37295 
6 3 48Vv3 
r 
7 E, 64 aps = 0.29530 
4 
vw 
8 E, 128 apa 7 0-25367 


Theorem 4.5. There is a positive constant cy) so that 
Con 
6, (8,) Fon ’ 
so that one has 
> Fag (1 + O(1)) =0.05855n(1 + 0(1)) an, 


Schmidt (1958) proved the same lower bound for 6,(K) for any symmetric 
convex body K, for any c, < log 2 and all sufficiently large n. His proof shows that 
a “random” lattice L drawn from the set of all lattices of determinant 1 has a 
suitably large value of A,(L). 

The best upper bound is due to Kabatyanskii and Levenshtein (1978), and 
arose from studying problems in coding theory. 


Theorem 4.6. As n—© one has 


(8B )< 2 ~ (0.5990 +0(1))a 
a 


’ 
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so that 
y, $0.07731n(1 + 0(1)), asnom. 


The methods used to obtain these bounds were successfully applied by 
Levenshtein and Odlyzko and Sloane to solve the kissing number problem in 
dimensions 8 and 24, which is the problem of finding the maximum number of 
unit balls 8, that can touch one unit ball, see Conway and Sloane (1988a). 


Theorem 4.7. The maximum number of unit balls B, that can touch a given unit 
ball in dimensions 1, 2, 3, 8 and 24 is 2, 6, 12, 240, and 196560, respectively. 


These solutions are unique up to rotations in dimensions 1, 2, 8 and 24, see 
Bannai and Sloane (1981). That the kissing number problem can be solved in 
dimensions 8 and 24 is fortuitous, and arises because particularly dense lattice 
sphere packings exist in these dimensions, which have large automorphism groups 
and attain the bound. These are the lattices E, and the Leech lattice A,,. 

Next we consider packings of nonsymmetric bodies K. Minkowski observed that 
this case may in principle be reduced to that of symmetric bodies by using the 
difference body DK = 


Lemma 4.8. Any convex body K in R" has 


(K 
8(K) = 2° apy 810K) 
and 
KK 
5,(K) = 2 py 8(0K). 


In applying this result it is useful to have bounds relating the volumes vol(K) 
and vol(DK), see Rogers (1964, Theorem 2.4). 


Theorem 4.9. Any convex body K in R” has 


Z (DK) _ 
= ~aitK} 9. 


with equality on the right if and only if K is a simplex. 


It seems possible that simplices are the most difficult convex bodies to pack. 
This is true for R? by Theorem 4.3. 


Theorem 4.10. For a simplex T, one has 


2(n!)* »? 
Cnt SOT) $8(T,) <a 
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This result is due to Rogers and Shephard, see Rogers (1964, chapter 6). 

It is widely believed that in sufficiently large dimensions n there exist convex 
bodies K with 6,(K)<6(K), but this has never been proved. Rogers (1964) 
conjectures that 5, (B,) <6(8,) for sufficiently large n. 

A set # = {K +u,:i=1} of translates of a body K is a covering if all points of 
R" are in some translate. If in addition v, are the vectors in a lattice L then it is a 
lattice covering H(K, L). A covering is said to have density p_(H). 


Definition 4.11. The covering radius w(K, L) of a body K by a lattice L is 


M(H, L)= int {e R"= U (K+ »)} : 
vel 
The covering radius w(L) of a lattice L is u(B,, L). 


Alternatively, the covering radius (Kk, L) is the supremum of values ¢ such that 
some translate tK + v contains no lattice points. 

A lattice L gives a lattice covering of a symmetric body K if u(K, L) <1. For 
lattice coverings #(K, L) one has 


u(K, zy 
det(L) J 


p_(K(K, L)) = vol(K) min { 
Definition 4.12. The covering constant &(K) for a body K is 
O(K) =inf{p_(9%): XH a covering with K} . 
The Jattice covering constant 3,(K) is 


8, (K) =inf{p_(9(K, L)): % a lattice covering with K} . 


The covering problem is most well understood in two dimensions. Results of 
Fary and Fejes-Toth give the following theorem, see Rogers (1964). 


Theorem 4.13. For any convex body in R’ 


vol(K ) vol(K) 
TK) 2 8K) = 1K) = FR > 


where h(K) is the area of the largest hexagon inscribable in K, h,(K) is the area of 
the largest symmetric hexagon inscribable in K. Hence for symmetric bodies K, 


vol(K) 


8K) = 8K) = Fay 


Furthermore, one has the following. 
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Theorem 4.14. For any convex body in R’, 
O(K) <3, 


with equality if and only if K is a triangle. 


Fejes-Toth also showed that the circle gives the most dense lattice covering 
among symmetrical convex bodies, i.e., 


2m 
8, (K)<—= 
(kK) <7 


if K is such a body. 

The value of 9,(8,) giving the thinnest lattice covering with balls is known for 
n <5. In each case the extremal lattice is a scaled version of A*, the dual of the 
lattice A,, which is the n-dimensional lattice in R”*' defined by 


A, = {Xp Xq.-- +X, EL" x, $x, + +++ +x, =0} 
see Conway and Sloane (1988a, chapter 2). 


Rogers obtained good upper bounds for covering densities and for lattice 
covering densities in arbitrary dimension. 


Theorem 4.15. (Rogers 1957). For any convex body K in R" with n 2 3, 
O(K) <n logn +nloglogn+ Sn. 
Rather weaker density bounds are known for lattice coverings. 
Theorem 4.16 (Rogers 1959). For any convex body K in R" with n 23, 
8, (K) < nioBzos rte 
for some constant c. For the ball B, one has 
3,(B,) < en(log n)!'082(re”? 


for some constant c. 


For the unit ball B,, Coxeter et al. (1959) obtained a lower bound for the 
covering density. 


Theorem 4.17. One has 
n 
9,(B,) = (B,) = soe (1 + 0(1)) 


as n->o%, 
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Transference theorems relate lattice packings of a symmetric body K by a lattice 
L to lattice coverings of the dual body K* by the dual lattice L*. 


Theorem 4.18. For the ball B, one has 
HB, LAB, LY <3( Oh?) <4 


where y; = max(y,,..., ¥,). For a general (not necessarily symmetric) body K in 
R” one has 


HK, LYAA(K*, LY <c n* ’ 
0 


for an absolute constant cy. 


The first bound appears in Lagarias et al. (1990), and the second follows from 
Kannan and Lovasz (1988, Lemma 2.3 and Theorem 2.7). 


5. N-dimensional crystallography and tilings 


Crystalline structures provided one inspiration for formulating the concept of 
symmetry groups and for studying point lattices. The existence of crystals poses 
two distinct mathematical problems. The first is that of classifying the possible 
underlying arrangements in space of the atoms making up a crystal. This converts 
to the mathematical problem of classifying (up to isomorphism type) the possible 
discrete subgroups of the group of Euclidean motions E, in R° which have 
compact quotient space. The second problem is that of classifying the possible 
polyhedral shapes that crystals may take in nature. A third problem, not studied 
here, is the problem of inferring from physical measurements (e.g., angles of 
cleavage planes in crystals, X-ray diffraction patterns) a basis of the underlying 
lattice of atoms making up the crystal, which is of great interest to chemists. 

The first question was initially studied by Bravais (1850), in the form of 
classifying the possible discrete subgroups of Euclidean motions E, (space 
groups). Here the group E, of Euclidean motions in R” is the set of linear 
transformations 


E(x)=Qxt+n xER’, 


where Q € O(n, R) is an orthogonal matrix and n ER” is a translation. Early 
authors did not explicitly introduce the group structure but instead used the 
combined operations of rotations, reflections and translations. A complete 
classification of all possible 3-dimensional space groups was achieved by Fedorov 
(1885, 1889) in Russia and later independently by Schoenflies (1889, 1891) in 
Germany, and Barlow in England; they found 230 types of space groups. The 
methods used in this classification were special to three dimensions. Hilbert 
(1900) posed as the 18th of his celebrated problems that of showing the finiteness 
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of the classification problem for the N-dimensional version of this problem. This 
was solved by Bieberbach (1910a,b, 1912), who proved three major theorems. 


Bieberbach’s first theorem 5.1. Every discrete subgroup I of L, having compact 
fundamental domain contains n linearly independent translations. 


Bieberbach’s second theorem 5.2. The subgroup T consisting of all translations 
x->xtn in T is a free abelian normal subgroup of T of finite index, and it is a 
maximal abelian subgroup of I. 


Hence the finite group @ = I’/T has a trivial center, and its action on T = 2" by 
inner automorphisms defines a faithful representation of ® in GL(n, Z). Bieber- 
bach then used a result of Minkowski stating that there is an upper bound on the 
order of any finite subgroup of GL(n, Z) to prove the following result. 


Finiteness theorem 5.3. For each fixed n there are only finitely many isomorphism 
classes of groups I which can be obtained as an extension of Z” by a finite 
subgroup of GL(n, Z). 


Such groups are now called Bieberbach groups. The embedding of such groups 
r into E, is not uniquely determined up to conjugation g/g ' by an element 


g &E,. However, Bieberbach (1912) proved the following result for the affine 
group A, = {h: h(x) = Mx +n, MEGL(n, R), nER"}. 


Bieberbach’s third theorem 5.4. Two discrete subgroups I, and I, of E,, having 
compact oe domain are isomorphic if and only uf they are conjugate 
subgroups of A,, i.e., there exists hE A, such that I, =hTjh"'” 


For proofs of Bieberbach’s thoerems, see Buser (1985) or Charlap (1986). 
Zassenhaus (1948) pointed out much later that a converse to the Finiteness 
theorem 5.3 holds. 


Theorem 5.5 (Zassenhaus). Any group I which can be described as an extension 
of Z" by a finite subgroup ® of GL (n, Z) can actually be embedded as a discrete 
subgroup of the Euclidean group E,, so as to act discontinuously on R" with 
compact fundamental domain. 


In the process of proving his third theorem, Bieberbach had to show that any 
finite group ® has only finitely many embeddings into GL(n, Z) up to conjugating _ 
by an element of GL (n, Z), a result now called the Jordan—Zassenhaus theorem, 
after Zassenhaus (1938), although Bieberbach apparently first formulated it. 

The number of Bieberbach groups are shown in table 5.1. This count of 
Bieberbach groups does not distinguish between two groups of motions that are 
mirror images of each other. If mirror images are counted as distinct, one adds 11 
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Table 5.1 
Bieberbach groups 
Dimension x Number of distinct groups 
1 2 
2 17 
3 219 
4 4783 


more groups to obtain the 230 space groups counted by crystallographers. The 
4-dimensional result appears in Brown et al. (1978). 

The two-dimensional Bieberbach groups are given in table 5.2, taken from 
Milnor (1976). The finite quotient @ is either cyclic, consisting only of rotations, 
or dihedral, containing reflections also. (Of course the dihedral group of order 2 
is isomorphic to the cyclic group of order 2, but these two groups are listed 
separately in this table, since they are not conjugate as subgroups of GL(2, R).) 

The 54 Bieberbach groups consisting only of orientation-preserving elements 
(det Q = 1) are given in table 5.3. In addition there are 165 Bieberbach groups 
with orientation reversing elements. The quantities in parentheses give extra 
groups obtained if mirror image groups are counted as distinct. 

An important open problem remaining is to prove appropriate analogues of 
Bieberbach’s theorems that classify discrete subgroups of the affine group A,, for 
n = 4, see Charlap (1986). The case 1 = 3 is treated in Fried and Goldman (1983). 

Now we turn to the second problem, that of classifying the possible polyhedral 
shapes that crystals may take. Crystalline structures fracture in preferential 
directions, called cleavage planes. It is experimentally observed that such cleavage 
planes satisfy a law of rational indices, which is as follows. Given three linearly 
independent cleavage planes L,, L, and L,, let ¢,,, ¢,,; and ¢,, denote lines 
through the origin 0 parallel to the lines: of intersection L,;L,, L,QL, and 
L,L, respectively. Given any cleavage plane L let d,,, d,,, d,, denote the 


Table 5.2 
Bieberbach groups in the plane 
Group ® Order Number of Number of 
embeddings in GL(2, 2) Bieberbach groups 
Cyclic 1 1 1 
Cyctic 2 1 1 
Cyclic 3 1 1 
Cyclic 4 1 1 
Cyclic 6 1 1 
Dihedral 2 2 3 
Dihedral 4 2 4 
Dihedral 6 2 2 
Dihedral 8 1 2 
Dihedral 12 1 1 


Total 13 17 
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Table 5.3 
Orientation-preserving Bieberbach groups in R* 

Group Order Number of Number of 

embeddings in SL(3, Z) Bieberbach groups 
Cyclic 1 1 1 
Cyclic 2 2 3 
Cyclic 3 2 3 (+1) 
Cyclic 4 2 5 (+1) 
Cyclic 6 1 4 (+2) 
Dihedral 4 4 9 
Dihedral 6 3 5 (+2) 
Dihedral 8 2 8 (+2) 
Dihedral 12 1 4 (+2) 
Tetrahedral 12 3 5 
Octahedral 24 3 7 (41) 
Total 24 $4 (+11) 


distance from the origin 0 of the intersection LM €,, (which is ~ if undefined.) 
Then the ratios d,,/d,, and d,,/d,, are rational numbers. The law of rational 
indices has a second formulation. Given a lattice A, we say an r-dimensional flat is 
in a lattice direction of A if it is parallel to an r-dimensional lattice flat. Then a 
polyhedron P satisfies the law of rational indices (in R°) if there is a lattice A such 
that each face and edge of P is in a lattice direction of A. This definition of the 
law of rational indices extends to 2 dimensions in an obvious way. 

It is easy to see that any polytope in R” whose vertices are lattice points of A, 
which we call a Jattice polytope, satisfies the law of rational indices. There are an 
infinite number of combinatorially inequivalent types of such polytopes in all 
dimensions =2. 

Cleavage planes of crystals in nature satisfy additional restrictions, apparently 
related to the requirement that the atoms in the underlying crystal lattice be 
closely spaced in the direction of the plane of cleavage; this leads the rational 
numbers that occur in the law of rational indices to have small denominators. A 
mathematically precise version of characterizing all polytopes using such con- 
ditions has apparently not been formulated. Instead, a related, narrower, 
mathematical problem was formulated by the crystallographer Fedorov, which is 
that of characterizing those convex polytopes that tile R* by translations. 


Definition 5.6. A closed convex polytope P is a parallelotope if it tiles R" by 
translations (except for boundary points). It is primitive if each (n — 1)-dimension- 
al face coincides with the face of one translate P + u for u € A, and if each vertex 
belongs to n + 1 edges of bodies P + u. 


The study of parallelotopes was initiated by Fedorov (1889, 1949). He showed 
that in two and three dimensions any parallelotope has a center about which it is 
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centrally symmetric, hence its facets occur in pairs that are parallel. This 
motivates the choice of the name parallelotope. Two tilings by parallelotopes are 
isomorphic if there is an invertible affine transformation taking one to the other. 
Fedorov enumerated all parallelotopes in R°. One source of parallelotopes is as 
follows. Given a lattice A and an invertible n < n matrix A the Voronoi region or 
honeycomb associated to (A, A) is 


V(A, A) = {x ER": |Ax| <|A(x — u)| for all nonzero u € A} . 


Any Voronoi region is a parallelotope. Voronoi (1908) proved a partial converse. 


Theorem 5.7 (Voronoi). Every primitive parallelotope of R" with center at 0 is the 
Voronoi region V(A, A) for some lattice A and invertible matrix A. Every primitive 
parallelotope has exactly 2(2” — 1) facets. 


This leads to the following famous problem. 
Problem of Voronoi 5.8. Is every parallelotope in R” a Voronoi region? 


This was proved true for dimensions n <4 by Delone (1926). A sharpening of 
this result is due to Zitomirskii (1929). 


Theorem 5.9. Every parallelotope in R" whose (n —2)-dimensional faces each 
have (n — 2)-dimensional intersection with and are a face of exactly 2 other tiles in 
the tiling is a Voronoi region. 


R. Penrose discovered finite sets of polygonal tiles in R’ that tile the plane, for 
which all such tilings are nonperiodic. De Bruijn (1981) developed an algebraic 
theory for such tiles and tilings, and shows that that can be constructed from using 
projections from lattices in higher dimensions, e.g., a lattice in R° is used to 
construct Penrose tiles. The recent discovery of quasicrystals has a possible 
explanation using such nonperiodic tilings of R°, see De Bruijn (1986) and 
Gratias and Michel (1986) and Katz (1992). 


6. Lattice polytopes 


A lattice polytope or integer polytope is a convex polytope in R” all of whose 
vertices are in the integer lattice Z”. In recent years many results have been 
proved relating the number of lattice points in such bodies K to geometric 
invariants such as its volume, vol,(K). 


Definition 6.1. For a lattice polytope P (not necessarily full-dimensional) the 
lattice point enumerator L(P) counts the number of lattice points in P. Let L°(P) 
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count the number of lattice points in the relative interior of P. Then 
L(P) = L(P) — L°(P) = L(eP) 


counts the number of lattice points on the boundary oP of P. 


Definition 6.2. For a point x in an arbitrary polytope P of dimension d in R” the 
normalized internal angle a(x, P) of x is defined by 


__ vol,((eBy +2) NP) 
ae, PY him ol (eB,) 


Then for a lattice polytope P the total /attice internal angle function L(P) is 
L(iP)= > a(x,P). 


The geometric invariants considered are mostly generalizations of the notion of 
volume, which are special cases of mixed volumes. 


Definition 6.3. Given convex bodies K,,...,K, in R” the mixed volumes 
V(K;,,--->K,) are given by 


vol aK + F4KI= DG VK Kates oh 


"iz 
O<i)<--<i,<n , 


for all 4, 20,...,4,20. 


Ss 


A proof that mixed volumes are well-defined is given in Eggleston (1958). Note 
that V,(K, K,...,K)=vol,(sK) is the ordinary n-dimensional volume of sK. 
A special case of a mixed volume is the following. 


Definition 6.4. The quermassintegral of a convex body K in R” is 
W(K) =V,(K, K,...,K,B8,,...,8,), 


where 8B occurs : times and K n — i times, i.e., 
vol,(K + 18,)= > (7) WiAK)e 
i=0 
The intrinsic r-volume V.(K) of K for -O<r<n is given by 


V.(K) = (vol, _,(B,-,))"" (7) W,_(K) . 


It can be shown that when K is an r-dimensional polytope in R” then V.(K) is 
the usual r-dimensional volume vol,(K) of K, and V,_,(K) is half the surface area 
of K. 

Mixed volumes satisfy several important inequalities. 
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Theorem 6.5 (Brunn—Minkowski inequality). Given two convex sets K, and K, in 
R", set K, =(1—6)K, + 0K,. Then vol(K,)'” is @ concave function of 6, i.e., 


vol(K, )'" = (1 — 8) vol(K,)'’” + 6 vol(K,)'” . 


Now 
vol, (Ky) = > (F) 1-6)" 6'V(K,, 2-3, K,,4), 


where V(K,, — i, K,,i)=V,(Ky,...,Ko,%,,..-,K,) with K, taken (” ~ i) times 
and K, i times. One can deduce from Theorem 6.5 that 


V(Ky,n — 1, K,, 1)" = (vol,,(Ky))” | vol,(K;) , 


which is called Minkowski’s inequality. 


Theorem 6.6 (Alexandrov—Fenchel inequality). For n convex sets K,,...,K, in 
R" one has 


V,(K,,..-,K, =V,(K,,K,,Ky,.--,K,)V,(Ky, Ky, Ky,..-,K,)- 


For a proof see Burago and Zalgaller (1988) or Schneider (1993). The 
Alexandrov—Fenchel inequalities have had a number of direct combinatorial 
applications, e.g., to show log-concavity of certain counting sequences (cf. Stanley 
1981), and in the proof of the Van der Waerden conjecture for permanents by 
Falikman (1981) and Egorychev (1981) (see also Van Lint 1982). This asserts that 
the permanent of any doubly stochastic n by m non-negative matrix is always at 
least n!/n”. In particular, this implies that any r-regular bipartite graph on 2n 
vertices has at least n! (r/n)” perfect matchings (cf. chapter 3). 

A first set of results relates lattice point counting functions to the volume of an 
integer polytope. In 1899, G. Pick (according to Coxeter 1969) proved that the 
area of a lattice polygon is related to the number of lattice points it contains. 


Pick’s theorem 6.7. The area of any simple polygon P in R? (not necessarily 
convex) whose vertices are lattice points is 


area(P)= L(P)+1L(P)-1. 


This result is easily proved by showing that the right-hand side of this equation 
is additive, then splitting P into a union of lattice triangles containing no interior 
lattice points, and observing that any such lattice triangle has area 4. Generaliza- 
tions of Pick’s theorem, and the problem of counting lattice points in lattice 
polytopes, are related to toric varieties, see Morelli (1993) and Pommersheim 
(1993). 

J.E. Reeve found an extension to dimension 3; to do so he introduced the 
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quantities 
1 
L(InP) = #{nP NZ"}= #{P nz 


forn€Z,, which count lattice points in a blown-up copy of P. A general result 
for computing the volume in R” using these quantities was then found by 
MacDonald. 


Theorem 6.8 (MacDonald 1963). For a lattice polytope P in R", 


Hn =1yntvolgP)= 5 nity *) M(a- —j-1,P), 


where M(j, P) = LJP) —4L(j(aP)) if j = 1 and M(0, P) = 1 or 0 according as d 
is even or odd. 


This result applies more generally to any pure d-dimensional simplicial complex 
K composed of lattice simplices, except that one must take M(0, K) = x(K)— 
4+x(0K) where y(K) is the Euler characteristic of K, see MacDonald (1971). 

The lattice-point counting function L(kP) has a very nice structure, found by 
Ehrhart (1967b), and extended by McMullen (1975a) and others. 


Theorem 6.9. For a lattice polytope P the generating function 


, LRP ye! _ Efe 6PM! 0 (Pye 
~ a- r"*! ’ 


where the quantities cP) are nonnegative integers. In particular L(kP) is a 
polynomial of degree n in k, for all k > 1, i.e., 


iMs 


L(kP) = > L,(P)k' , 
i=0 
for suitable coefficients L,(P). 


The polynomial L(kP) is called the Ehrhart polynomial. Similarly, it can be 
shown that one has polynomials 


a 


L°(kP) = & LAP), 


L(kP) = > LAP)K' , 
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for k= 1. These coefficients have geometric interpretations, e.g., 
L,(P) = Li-(P) = LAP) = vol, (P) : 


F vol, _,(F) 
=]J° aid =1 a nA 
L,-(P)= Ly _(P)=4£,(P)=42 Gag n ky’ 
where F runs over all (m — 1)-dimensional facets of P, and Z” 1M F is the (n — 1)- 
dimensional sublattice of Z” generated by lattice points on the facet F. Also 


L,(P) =(-1)"Eo(P) = x(P), 


where x(P) is the Euler characteristic of P. Morelli (1993) obtains geometric 
formulae for all L,(P), see also Barvinok (1994). 

These results extend further to Minkowski sums of several lattice polytopes 
(recall Definition 2.3). If P is an integer polytope in R” let L,(P) denote the sum 
of the lattice points in Z” in the union of i-faces of P. 


Theorem 6.10 (Bernshtein 1976, McMullen 1975a). Let K,,... »K, be lattice 
polytopes in R". Then 
(a) L(k,K, +--+ +k,K,) is a polynomial in k,,...,k, fork, 20,...,k, 20. 
(b) For any i, OSi<n—-1, L(k,K,+---+k,K,) is a polynomial in 
.»k,, fork, #1,...,k, 21. 


The condition in (b) that all k;=1 is necessary because, e.g., for q=1, 
choosing k, =0 gives L,(0)=1, while the polynomial in (b) at k, =0 gives the 
Euler characteristic of the i-skeleton of K, which need not be one. Bernshtein 
(1976) gave a combinatorial proof, and showed that the mixed volume of 
K,,...,K, can be expressed in terms of coefficients of these polynomials. There 
is a close connection between these results and counting roots of systems of 
multivariate polynomial equations in (C*)”, see Bernshtein (1975). The most 
natural proof of Theorem 6.9 may be algebro-geometric. See Ewald (1985) for a 
survey, and a discussion of an algebro-geometric proof of the Alexandrov— 
Fenchel inequality 6.6 related to integer polytopes. 

There are reciprocity laws relating the quantities L(P), L°(P), and L(P) for P 
and its boundary dP, due to Ehrhart (1967a). 


Theorem 6.11. For a lattice polytope P in R", for O<isn, 
LAP)=(-1""L9P), LE (P)=(— (-1)" LP). 


Stanley (1974) has given a “monster” combinatorial reciprocity law extending 
this kind of reciprocity law in many directions. 

McMullen (1975b, 1986) related L(P) to £(P) for lattice polytopes. Define the 
internal angle B(F, P) of a face F of an arbitrary polytope P (of any dimension) to 
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be a(x, P) for any x & F in the relative interior of F. Then for lattice polytopes 
£(P) =X (-1)"" PO" BEF, PIL AF). 
F 


He also obtains an inversion law for intrinsic r-volumes, as a special case of a 
much more general inversion law. 


Theorem 6.12. For a polytope P in R", 
(=1)8, ime) VP) = 2, (- 1)" Fa(F, PVF), 
where 5, , is the Kronecker delta function and the sum is over all faces of P. 


Are there general inequalities relating the number of lattice points in a lattice 
polytope to its volume, and, more generally to its quermassintegrals? There are 
good lower bounds. For a convex body K in R” with nonempty interior, 
Bokowski et al. (1972) showed that 


L({K) > V,(K) — V,_4(K) , 


and observed that this is sharp. Hadwiger (1972) then obtained the following 
bound, also sharp. 


Theorem 6.13. For any convex body K in R", with nonempty interior, 
LK) & 1)" Vik). 
Schnell (1992) generalized the result of Bokowski et al. (1972) in another 
direction, to counting points in arbitrary lattices. 


For upper bounds the situation is more complicated. An inequality of Daven- 
port (1951) implies that 


L(P)= 3 L(P)< 2 (F) er). 
Wills (1973) conjectured that 
L(P)< 2 V(P). 


Overhagen (1974) proved Wills’ conjecture for 2 =3. Later McMullen (1975b). 
conjectured the stronger bounds 
LAK) <V(K), OSi<n. 


and observed that the following holds. 
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Theorem 6.14. If K is a convex body in R" then 
L(K) < V(K) 
holds for i=0, n—1 and n. 


These inequalities need only be proved for lattice polytopes P, because if P C K 
then V,(P) < V,(K) by general properties of mixed volumes (Burago and Zalgaller 
1988, p. 138). However, Hadwiger (1979) disproved Wills’ conjecture, hence also 
McMullen’s conjecture, showing that it fails for a regular n-simplex K, of edge 
length Vn for n = 441. Gritzmann and Wills (1986) proved a weaker bound 


L(K)< > a,.¥, V,(K) 


for certain complicated constants a,,,. They conjectured that this inequality still 
holds with (smaller) values a, , = 9;2'(vol,(B,)) ', where a, is the fraction of the 
volume inside a regular simplex of side 2 covered by d +1 unit balls centered at 
its vertices. Wills (1990) conjectures that a similar inequality holds with a, , = 
vol,(B,) vol, _(8,_;)(vol,8,) . Further inequalities on lattice-point enumerators 
appear in Betke and McMullen (1985), Wills (1991), and Gritzmann and Wills 
(1992). 

In R” for n =3 there exist lattice simplices of arbitrarily large volume that 
contain no interior lattice points, e.g., the simplex in R° with vertices {(0, 0, 1), 
(0, 1,0), (1,0, 0) and (1,1, m)}, for m€Z*. However, Hensley (1983) showed 
that if a lattice polytope P in R” contains a fixed nonzero number k of lattice 
points, then its volume is bounded in terms of n and k. This result was sharpened 
by Lagarias and Ziegler (1991), as follows. 


Theorem 6.15. Any lattice polytope P in R" that contains exactly k 21 points in 
dZ” in int (P) has 
vol, (P) <kd(7(kd +1)". 

There are examples of lattice simplices S containing exactly one interior point 
in dZ” having vol,(S) = (d+ 1)’ "/n\. See Reznick (1986) for related results. 

The simplest lattice polytope in R” is a minimal lattice simplex, which is a lattice 
simplex of volume 1/n!. Any lattice polytope in R? can be cut up into minimal 
lattice simplices, but for n 3 not all lattice polytopes can be cut up into minimal 
lattice simplices. However, one has the following result. 


Theorem 6.16 (Kempf et al. 1973). In any dimension n there exists an integer 
constant N, such that for every lattice polytope P in R" the homothetic image 


(N,!)P can be subdivided into minimal lattice simplices. 


Additional properties of lattice polytopes may be found in the book of Ebrhart 
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(1977) and surveys of Betke and Wills (1979) and McMullen and Schneider 
(1983). 


7. Algorithmic geometry of numbers and integer programming 


The fundamental theorems in the geometry of numbers are theorems asserting the 
existence of a lattice point (or set of lattice points) meeting certain conditions. 
The corresponding algorithmic problem is that of explicitly finding such lattice 
points or proving their nonexistence. The construction of such algorithms and the 
analysis of their computational complexity form the subject matter of the 
algorithmic geometry of numbers, cf. Kannan (1987b). These include algorithms 
for finding the Hermite normal form of an integer matrix, lattice basis reduction 
algorithms and integer programming algorithms. 

Given a basis B=[b,,...,8,] of a lattice L,, and a sublattice L, of finite 
index, there is a canonical choice of basis B = [6,,..., 6,] for L, with respect to 
the basis B, for which B = MB is in Hermite normal form, see Theorem 2.2. 
Frumkin (1977) and later Kannan and Bachem (1979) gave polynomial time 
algorithms for converting an integer matrix to Hermite normal form, or equiva- ° 
lently, given bases B, B, of Ly, L, respectively, finding U € GL(n, Z) such that 
the basis B = UB, is in Hermite normal form with respect to B. Chou and Collins 
(1982) improved the algorithm of Kannan and Bachem, obtaining the following 
result. 


Theorem 7.1. There is an algorithm which when given an n X m integer matrix M 
of full row rank n will find the unique UE GL(n, Z) such that M = UM is in 
Hermite normal form. If B denotes the maximal absolute value of an element of M 
then the algorithm takes at most O(m‘(m+nlognB)’) bit operations using 
integers of length at most O(n log(nB)) bits. 


A linear Diophantine equation is a set of linear equations Ax = 6 in which A is 
an m Xn integer matrix, 6 an m X 1 integer vector, and only integer solutions 
x€Z" are considered. If L(A) denotes the lattice of integer solutions to the 
homogeneous linear Diophantine equation Ax = 0 then the general solution of 
Ax = b is {x, + y: y € L(A)} where x, is any solution. Frumkin (1976, 1977), and 
Von zur Gathen and Sieveking (1976), and later Chou and Collins (1982) 
construct polynomial time algorithms to decide if Ax = 6 has an integer solution 
X; and if so to find it and a basis of the lattice L(A). 

The algorithmic version of Minkowski’s fundamental theorem 3.1 is that of 
finding a shortest nonzero vector in a lattice L with respect to a given norm. An 
algorithm to answer this question easily allows one to decide if a given symmetric 
convex body K contains a nonzero point of L, and if so, to find one. However, 
this problem appears to be computationally difficult even for simple norms such as 
the supremum norm (K =n-cube) or Euclidean norm (K = 8,). Consider the 
following problems. , 
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Sup-norm short vector problem 7.2. 
Input: A basis [b,,...,6,] of a sublattice L of Z” and an integer bound B. 
Question: Does L contain a nonzero vector v = (v,,...,u,,.) with all |v,|< B? 


Euclidean norm short vector problem 7.3. 
Input: A basis [b,,...,5,] of a sublattice L of Z” and an integer bound B. 
Question: Does L contain a nonzero vector v with |v|’ < B. 


The sup-norm short vector problem 7.2 is known to be NP-complete when the 
dimension n of the input may vary, see Van Emde Boas (1981). In particular, if 
P*NP then no polynomial-time algorithm exists to find the shortest nonzero 
supremum norm vector in an integer lattice L, cf. Garey and Johnson (1979). 
Deciding the complexity of Euclidean norm short vector problem 7.3 is a 
fundamental open problem. All known algorithms for finding the shortest 
Euclidean norm vector in an integer lattice require exponential time. 

It is natural to consider the weaker problem of finding a relatively short vector 
in a lattice. This problem can be shown to be nearly of the same computational 
difficulty as that of the apparently more difficult problem of finding a basis of 
short vectors of a lattice, see Lovasz (1986, p. 24). 


Theorem 7.4. A polynomial-time algorithm for solving problem (A) below gives a 
polynomial-time algorithm for solving problem (B). Given as input a basis 
[b,,...,5,] of a sublattice L of Z”: 

(A) Find a nonzero vector 6, € L with 


|b,| <a(n)a(L). 


(B) Find a basis [b,,...,5,] of L such that 
I] |6,| <n"a(n)" det(L). 
i=1 


Lovasz found a polynomial-time algorithm, now called Lovdsz’ lattice basis 
reduction algorithm or L’-algorithm, which finds a “good” basis, cf. Lenstra et al. 
(1982). This algorithm finds a basis of a lattice reduced in the following weak 
sense analogous to the Korkin—Zolotarev reduction conditions. 


Definition 7.5. An ordered basis [b,,...,8,] is Lovdsz-reduced if its Gram- 
Schmidt orthogonalization 


b=b7 + > w,b*, 1<i<n 
lej<i 


satisfies: 
(i) Size-reduction condition: |y,,| <4 for j <i. 
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(ii) For 1<i<n—1, the Gram—Schmidt vectors b;* satisfy 
os) <4|b}., + Bis id? | , 
It is relatively easy to prove the following bounds. 


Theorem 7.6. Any Lovész-reduced basis [b,,...,b,] of a lattice L satisfies: 
(i) For 1<isa, 


|b,’ < 2 NIL) ; 
(ii) |b,}---|b,|<2"°"”? det(L) . 


Lovasz’s algorithm proceeds to convert a given basis [b,,.. . , b,] to a Lovasz- 
reduced basis as follows. In a size-reduction step one arranges that the size- 
reduced condition hold. Find the largest j having some i with |y,,| >4. If m is the 
integer nearest y,,, form the new basis [b,,...,6,-;, b,—mb,, b;,,,-.-,d,], 
which has the same associated Gram-Schmidt vectors and new pw, =, for 
j<k <i and w;, = 4, — mp, for 1<k <j. In particular all |u,,| with i=k>j are . 
unaffected, while |;|<4. Continuing in this fashion one obtains a size-reduced 
basis in at most (5) such steps. In an exchange step one looks for some b, such that 
(ii) is violated (say the smallest such ) and exchanges 5; and b,,, in the ordered 
basis. The algorithm alternates size-reduction steps and exchange steps until it 
halts. To bound the number of steps the algorithm takes, one defines the invariant 


D(b,,...,6,)= I] yd ae 
i=t 


One can show that this invariant remains constant during a size-reduction step 
(since the 6; do not change) and decreases by a multiplicative factor of size at 
most 1V2 at each exchange step. Since D(b,,...,6,)=det(L), the number of 
exchange steps is finite, and the algorithm halts. If the input to the algorithm is a 
basis of an integer lattice L, then one obtains the following polynomial running 
time bound, see Lenstra et al. (1982). 


Theorem 7.7. Lovdsz’ lattice basis reduction algorithm, when given an ordered 
basis [b,,...,6,] of a sublattice L of Z” as input, produces a Lovdsz-reduced 
basis [b,,...,6,] of L in at most O(n’ log B) arithmetic operations on integers 
requiring at most O(n log B) bits, where B = max(|b,|": 1 <i <n). 


This algorithm has found many applications. The motivating application was a. 
polynomial-time algorithm to factor polynomials in Q[x], obtained by Lenstra et 
al. (1982). It has also been used to decide in polynomial time the solvability of the 
Galois group of the equation f(x) = 0 with f(x) € Q[x] (Landau and Miller 1983), 
to find good simultaneous Diophantine approximations, (Lagarias 1985, Lovasz 
1986), to solve low-density subset sum problems (Lagarias and Odlyzko 1985), 
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and to show that for many combinatorial optimization problems, polynomial-time 
solvability implies strongly polynomial-time solvability (Frank and Tardos 1987). 
It has also had various cryptographic applications, e.g., in cryptanalyzing 
knapsack public key cryptosystems and in predicting congruential pseudorandom 
number generators. Computer implementations of the algorithm have been used 
to disprove Merten’s conjecture (Odlyzko and Te Riele 1985) and to find new 
t-designs (Kreher and Radziszowski 1987). 

Schnorr (1987) found a hierarchy of successively stronger variants of this 
algorithm that find bases of lattices that are reduced in a stronger sense than 
Lovasz-reduced, and which are guaranteed to find shorter vectors in the lattice 
than Theorem 7.6(i). 

Lattice basis reduction ideas lead to a polynomial-time-computable version of a 
weakened form of Minkowski’s convex body theorem. We consider convex bodies 
K defined by a well-founded membership oracle. Such an oracle gives a sphere of 
radius r contained in K, a sphere of radius R containing K, and for each x € R” it 
correctly answers x © K or x ZK. The input size of such an oracle for K in R” is 
defined to be n + [|log R]| + [log r|]. 


Theorem 7.8. For variable dimension n there is an algorithm which when given a 
‘well-founded membership oracle for a convex body K in R" symmetric around 0 
with 


vol(K) 22”, 


always finds a nonzero vector in K 2". The algorithm makes only a number of 
oracle calls polynomial in the input size of K, and uses at most a polynomial 
number of bit operations in this input size. 


The algorithm uses a rounding procedure of Lovasz (1986, Theorem 2.4.1) to 
find a ellipsoid E centered at 0 such that E CK C(m + 1)vnE. This can be done 
so that the ellipsoid E = {x: x'B* Bx <1} has a rational matrix B with entries of 
polynomial length in the input size. Then one makes a linear change of variable of 
determinant 1 to take E to a sphere AG, taking Z” to a rational lattice L. Now 
one uses Lovasz’ lattice basis reduction algorithm to find a short vector x in L. 
Since 


vol(E) = (n + 1) 73"? vol(K) = (n +:1)77"72" , 


one has A<=2"-°'°8"), and also A,(L) <y, det(L)'’" <n. Now the bound in 
Theorem 7.6 shows that x € AB, for all n = ny. Its inverse under the linear map is 
the desired vector in Z". For n<n, one modifies this algorithm to find the 
shortest vector in L, using the method of Kannan (1987a). 

Next we turn to the inhomogeneous problem, the closest vector problem. 
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Euclidean norm close vector problem 7.9. 

Input: A basis [b,,...,5,] of a sublattice L of Z”, a vector wE Z" and an 
integer bound B. 

Question: Does L contain a vector uv such that |v — w|?< B? 


This problem and its supremum norm variant were shown NP-complete by Van 
Emde Boas (1981). The Lovasz basis reduction algorithm can easily be adapted to 
find a fairly close vector in polynomial time, cf. Babai (1986). If u(v, L) denotes 
the distance of v to the nearest lattice point of L, then for an integer lattice L one 
can find a vector bE L with 


lv - b|*<2"pQ, LY’, 


in polynomial time. 
Integer programming studies the set of integer solutions x = (x,,...,%,) €2Z” 
in the polytope P(A, b) determined by a set of linear inequality constraints 


Ax2=b. 


Of particular interest is the 0-1 integer programming problem which arises when 
the extra condition that all x, € {0,1} is present, i.e., the 2m inequalities x, =0 
and —x, =—1 occur in Ax = b. Many basic combinatorial problems are naturally 
formulated as {0,1} integer programs, e.g., the existence of a finite projective 
plane of order n, the existence of specific t-designs, the existence of an 
independent set of given size of a graph, the existence of a coloring of a graph 
with k colors, and the existence of a perfect tiling of a planar region with 
polyominoes of given shapes. The existence question for integer programs is the 
following. 


Integer program feasibility problem 7.10. 

Input: (A, b) are m X n and m X 1 matrices of rationals given in binary, with m 
arbitrary. 

Question: Does P = {x: Ax = b} contain a point x EZ"? 


This problem is well known to be NP-complete when the dimension n of the 
input is allowed to vary, see Garey and Johnson (1979). Indeed, extremely special 
subclasses of integer programs are NP-complete, including {0,1} integer pro- 
grams and even those integer programs with at most two variables per constraint, 
cf. Lagarias (1985). 

Lenstra (1983) used ideas from the geometry of numbers to obtain a polyno- 
mial time algorithm for solving the feasibility problem in a fixed dimension. 
Kannan (1987a) obtained the following improved bound. 


Theorem 7.11. For variable dimension n the Integer programming feasibility 
problem can be solved in O(n?"’*L) arithmetic operations on integers of O(n*"L) 
bits in size, where L is the number of bits in the input. 
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Lenstra’s basic idea is to either find a vector v € Z” — {0} such that 
max{(v,x):x€P} —min{(v,x):x€P} <c” , (7.1) 


‘or prove that no such v exists. This inequality says that the feasible solution 
polytope P = P(A, B) is “narrow” in the direction perpendicular to v. If no such v 
exists, then Lenstra finds a large cube inside the polytope P containing an integer 
feasible point. If such a v exists, then the quantity vu-x can take at most c” 
values. Now Lenstra can replace the original integer program by c” integer 
programs in (n — 1)-variables by adding an extra constraint 


(0.x) =m, 


where m runs over the c”’ values allowed by the inequality (7.1). Lenstra reduces 
the problem of finding such a wv to a short vector in a lattice problem, which can 
be solved in polynomial time. Kannan’s improvements come from replacing c” in 
(7.1) with n°’, at the cost of taking time O(7") to decide if a solution to (7.1) 
exists, see also Kannan (1987b). 

A related, more difficult problem, is to count the number of integer feasible 
solutions to an integer programming problem. This problem can be polynomial- 
time reduced to the problem of counting the lattice points in a lattice polytope, 
see Cook et al. (1992). Recently, Barvinok (1993b) showed that this latter 
problem is solvable in polynomial time when the dimension n is fixed. 

As a final example, we consider the problem of computing vol(P) for a 
polytope P. Dyer and Frieze (1988) and Khachiyan (1989) showed that it is 
#P-complete to compute vol(P) for a lattice polytope P when given either a list 
of all its vertices or a list of all its facets as input. Some special cases where it can 
be computed quickly are given in Barvinok (1993a). Recently Dyer et al. (1989) 
showed there is a good random polynomial time approximation algorithm for the 
volume of a convex body. They assume that the body K is described by a 
well-founded membership oracle. When given «> 0, the algorithm produces an 
estimate v6l(K) such that 


(1 + €) vol(K) = val(K) = (1 — €) vol(K) 


with probability 2, and the algorithm runs in time polynomial in 1/e and the input 
size of K. This contrasts with a result of Barany and Furedi (1986) asserting that 
for fixed n, and for every polynomial-time deterministic algorithm give a convex 
body K in R” specified by a well-founded membership oracle which produces 
both upper and lower bounds vol(K) and vol(K) for vol(K), there is some convex 
body K in R® for which 


vol(K) = (cen) vol(K). 


Thus Dyer et al. (1989) provide an example where a probabilistic algorithm is 
theoretically superior to all deterministic ones. 
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8. Connections to other areas of mathematics 


Lattices are ubiquitous. The following references give starting points for study of 
some of these other areas of mathematics. 

(a) Quadratic forms: Cassels (1978), Conway and Sloane (1988a), Milnor and 
Husemoller (1973). 

(b) Perfect and extreme forms: Coxeter (1951), Conway and Sloane (1988b). 

(c) Diophantine approximation: Bombieri and Vaaler (1983), Cassels (1957), 
Schmidt (1980). 

(d) Multi-dimensional continued fractions: Brentjes (1981), Hastad et al. 
(1989), Lagarias (1994). 

(e) Transcendental number theory: Baker (1975), Wustholz (1987). 

(f) Algebraic number theory and class field theory: Kubota (1987), Lang 
(1970). 

(g) Abelian varieties, Jacobian varieties, elliptic curves: Lang (1973, 1978), 
Martens (1978), Mumford (1983, 1984, 1991), Swinnerton-Dyer (1974). 

(h) Toric varieties and integral polyhedra: Danilov (1978), Fulton (1993), 
Kempf et al. (1973), Oda (1985). 

(i) Lattice polyhedra and random walks: Handelman (1987). 

(j) Classification of flat manifolds: Charlap (1986). 

(k) Discrete subgroups of hyperbolic space: Beardon (1983), Mumford (1971), 
Patterson (1975). 

(1) Discrete subgroups of Lie groups and symmetric spaces: Margulis (1991), _ 
Terras (1985, 1988), Zimmer (1984). 

(m) Nonperiodic tilings, quasicrystals: De Bruijn (1981, 1986), Gratias and 
Michel (1986), Janot (1992), Katz (1992), Radin (1991). 

(n) Coding theory: Calderbank (1991), Calderbank and Sloane (1987), Con- 
way and Sloane (1988a), Forney (1988a,b). 

(0) Numerical analysis: Dahmen and Micchelli (1988), Hua and Wang (1981), 
Neumeier and Seidel (1983). 

(p) Optimization: Saaty and Alexander (1975), Schrijver (1986). 

(q) Integral geometry: Santalé (1976, chapter 8). 
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1. Introduction 


What is the cardinality of the largest subset of {1, 2,...,N} that does not 
contain two relatively prime numbers? This is a typical problem in combinatorial 
number theory. That the problem is one in number theory, there is no doubt. But 
someone who leans towards combinatorics might prefer to think of it as a 
question of the largest complete subgraph of that graph on {1, 2,...,N} with 
edges that connect two numbers when they are not coprime. 

The above question illustrates a common theme in combinatorial number 
theory. Namely, what arithmetic properties must a ‘“dense’’ subset of the integers 
possess? One of the greatest theorems of this type, Szemerédi’s theorem, is 
discussed in section 6. But combinatorial number theory also deals with other 
issues. For example, under what conditions is a subset of the natural numbers a 
basis, i.e., for some h, every number can be represented as a sum of h or fewer 
elements from the subset. Such issues are discussed in section 3. Combinatorial 
sieve methods, the subject of section 2, takes its starting point at the inclusion— 
exclusion principle. Its simpler aspects might be described as a device for 
controlling the “combinatorial explosion” in the number of terms involved in an 
inclusion—exclusion argument. 

Combinatorial number theory can also deal with some classical problems of 
number theory when the methods used have a strong combinatorial flavor. In 
section 7 we present a proof of Wirsing’s theorem on perfect numbers. This gem 
uses nothing but simple counting arguments from elementary combinatorics. 

Combinatorial number theory is a relatively young field. In 1850, P. L. 
Chebyshev proved that 


x 
—— < a(x) <c, 


c; log x forx2=2, (1.1) 


x 
log x 
where 7(x) denotes the number of primes up to x, and c,, c, are positive 
constants. This result constituted important progress towards the prime number 
theorem, 2(x) = (1+ 0(1))x/log.x, which was not proved until some 45 years 
later. Chebyshev’s proof of (1.1) (which was later analyzed and simplified by 
Landau, Erdés and Diamond) had a certain combinatorial flavor. 

In the period 1915-1924, V. Brun essentially single-handedly began the subject 
of combinatorial sieve methods. In 1927, B.L. van der Waerden proved his 
famous theorem that whenever the set of natural numbers is partitioned into two 
sets, then one set contains arbitrarily long arithmetic progressions. L.G. Schnirel- 
mann, in 1930, used both Brun’s results and his own ideas on the relationship 
between density and bases to prove that the set consisting of one and the primes is 
a basis. Inspired by these works, combinatorial number theory came into full 
flower in the 1930s and 1940s. Certainly the most significant force to shape and 
define the subject both then and now has been P. Erdés. We are much indebted to 
him for his generous help with this chapter. 

In writing a chapter such as this, certain hard choices were necessarily forced 
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upon us. The subject is very broad and does not have clearly delineated 
boundaries. It soon became clear that we had no chance of covering it all. 
Moreover, our philosophy for this chapter mandated the inclusion of representa- 
tive proofs. Thus even the few areas that we do cover are not done encyclopaedi- 
cally. Fortunately there are several excellent books that treat various aspects of 
combinatorial number theory with great thoroughness, books that we refer to 
frequently throughout for further problems, details, and references. These are 
Sequences by Halberstam and Roth (1983), Old and New Problems and Results in 
Combinatorial Number Theory by Erdés and Graham (1980), and Unsolved 
Problems in Number Theory, 2nd edition, by Guy (1994). 

Some may consider the subject of integer partitions an important part of 
combinatorial number theory. Unfortunately, though, it is a subject we complete- 
ly ignore. The interested reader is referred to the excellent monograph of 
Andrews (1976). 

We now say a word about notation. If n € N (the set of positive integers), then 
t(n) is the number of positive divisors of nm, v(m) is the number of prime divisors 
of n, and Q(n) is the number of prime and prime power divisors of n. For 
example, 7(12) = 6, v(12) =2, and 2(12) =3. We say n EN is squarefree if n has 
no square factor exceeding 1. We define the Mébius function p(n) by p(n) = 
(-1)’” if n is squarefree, and «(n) =0 if n is not squarefree. The sum of the 


positive divisors of n is denoted o(n). The number of integers in {1, 2,...,2} 
that are coprime to n is denoted ¢y(7); this is Euler’s function, of course. 
The symbols #, @,... are reserved for sets of non-negative integers. If & is 


such a set, then A(x) denotes the number of members of # not exceeding x. By 
3 + B® we mean the set of numbers representable as a + b with aE 4, b © B. By 
2% we mean & + , by 38 we mean 2x + Xf, etc. By of — 8 we mean the set of 
numbers a — a’, where a, a’ € &. By || we mean the cardinality of «. 

The letters p, g shall always denote primes. The function log x is the natural 
logarithm. When we say f(x) ~ g(x) as x >, we mean f(x) = (1 + 0(1))g(x). 

So as not to have too long a reference section, we often give only one or a few 
later references on a particular problem so that an interested reader may begin a 
literature search. We do not mean to imply that the articles for which we give 
bibliographic data are necessarily the most important ones. Sometimes we defer 
all references to the extensive listings in Erdds and Graham (1980) or Guy 
(1994). 

Have you solved the problem at the start of the introduction? Suppose N > 1. If 
sf is the set of even numbers in {1, 2,...,N}, then |] = L$N]| and no two 
members of sf are coprime. Moreover, if a set @ C {1, 2,...,N} has more than 
L4N] members, then either 1 € & or B contains two consecutive numbers (which 
are Clearly coprime). Thus the answer is [5] if N>1; the case N = 1 is clearly 
degenerate. See section 7.4 for more on this problem. 


2. Combinatorial sieve methods 


Many number-theoretic problems can be reduced to a problem of the following 
type. A finite set w CN and a finite set P of prime numbers are given. Estimate 
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the number of of members of sf that are not divisible by any primes belonging to 
¥. In other words, we “sift out” the multiples of the prime numbers belonging to 
Y from # leaving the residual set whose cardinality S(, P) we wish to estimate. 
As an illustration, we are going to consider the following three problems: 

(i) Estimate a(x); 

(ii) Estimate the number of prime twins q, q’ with g'=q +2 and q' <x; 
(iii) For each x EN, estimate the number of prime pairs g, r with g+r=x. 
Problem (ii) is connected with the famous twin prime conjecture which asserts 
that there are infinitely many such pairs g, ¢ + 2. Problem (iii) is connected with 
Goldbach’s conjecture which asserts that if x is an even integer at least 4, then x is 
a sum of two primes. These are among the most famous unsolved problems in 

mathematics. 
For any set CN, let 


A(d)={nEA:d|n}. (2.1) 
First we are going to study problem (i). Let x =1, and let us write 

A={nENin<x}, (2.2) 
so that 

|sé(d)| = |x/d] . (2.3) 


Furthermore, let us write 
P ={p prime: p< vx}. (2.4) 


Consider the following simple fact: an integer n with Vx <n <x is a prime if and 
only if there is no prime p € with p|n. Thus if we start out from the set in 
(2.2) and we sift by the primes in the set # in (2.4), then the set left after the 
sifting procedure consists of the number 1 and the primes q with Vx <q <x [so 
that Si, FP) =1+ w(x) — a( vx). 

On the other hand, the number of integers left after the sifting procedure can 
be computed by the well-known inclusion—exclusion principle of elementary 
combinatorics (cf. chapter 21). In this way, we get the following formula: 


|{1} U {q prime: Vx <q <x}| 
m(vx) 
=|fl+ 2 (yi dll papa Ped (2.5) 


PY SPs <P ex 


In fact, to prove this identity, we have to show two facts: 

(a) the contribution of 1 and of each prime g with Vx <q <x to the right-hand 
side of (2.5) is 1; 

(b) if nm <x and it is divisible by at least one p & Y, then its contribution to the 
right-hand side of (2.5) is 0. 

a1 that only positive integers n<x contribute to the right-hand side of 
(2.5). 


We have (a) immediately since in the first term, ||, every positive integer 
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n <x is counted exactly once, while 1 and the primes g with Vx <q <x are not 
multiples of any prime p<=Vx, so are counted in none of the terms 
(-1)' slp, P2*** PadI- 

To show (b), assume that 1<n <x and p;, p;,..., p, are all the distinct prime 
divisors not exceeding Vx of n, where /=1. Then the first term |] on the 
right-hand side of (2.5) contributes with a weight 1. Any other term 
(-1)*|x¢(p, p2*++ p,)| contributes with a weight (—1)* if and only if p,, 
Po,---, P, ate chosen from p;, p3,..., p;; for a fixed k there are (}) such terms 
with this property. Thus the total contribution of this 7 to the right-hand side of 
(2.5) is 


1 > (-1)* (4 


By the identity 


! H 

> cy (,)=a-v!=0, (2.6) 
k=0 k 

this contribution is 0, which completes the proof of (2.5). 

Writing 

IT p=Pe), (2.7) 
pz 

we rewrite (2.5) in the following equivalent form: 
1+ m(x)—a(vx)= 2 w(d)|sé(d)|, (2.8) 

d|P(Vx) 


where yw is defined section 1. In view of (2.3), we obtain: 


Theorem 2.9. If x =1, then 
1+n(x)-a(vx)= » pwld)lx/d] . 


d|P( Vx) 
In fact, Legendre used this formula in his numerical studies of a(x). The sieve 
method described above is called the sieve of Eratosthenes. 
By choosing the set # in an appropriate way, problems (ii) and (iii) can be 
studied similarly. For example, in the case of problem (ii) we choose 


A ={n(n+2):nEN,n<x-2}, P={pprime: ps<vx}. (2.10) 


Then by using the inclusion—exclusion principle, one may similarly derive the 
following formula analogous to (2.8), where is now defined by (2.10) rather 
than (2.2): 


I{q: ¢, q+ 2are primes, Vx<q<x—2}}= > p(d)l(d)|. (2.11) 
d|P( Vx) 
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Here we have 
|sé(d)| = |{n(n +2): n EN, n<x—2,d|n(n+2)}|. 
It is easy to see that |sé(d)| ~ w(d)x/d, where 
w(d) = |{n EN: 0<n <d, n(n + 2) =0 (mod d)}} . (2.12) 


Clearly, #(p) =1 for p=2, w(p)=2 for any odd prime p, and by the Chinese 
Remainder Theorem, the function w(n) is multiplicative [i.e., w(mn) = o(m)w(n) 
when (m, n) = 1]. Thus for 4(d) #0, i-e., for d squarefree, we have 


|| s#(@)| — o(d)x/d|< w(a) <2", (2.13) 


where »(d) is defined in section 1. 
Finally, to attack problem (iii), one may choose 


= {n(x -—n): n EN, n <x} (2.14) 


and as in (2.4) and (2.10). We leave the further details to the reader. 

The main problem with the sieve of Eratosthenes is that as in (2.8) and (2.12), 
it gives the number of integers left after the sifting in the form of a sum, and this 
sum has “‘too many” terms. For example, the sum on the right-hand side of (2.8) 
has 7(P(Vx)) = 27° terms and, in view of (1.1), this is much bigger than the 
number a(x) which is being computed. As a consequence, one cannot use 
Theorem 2.9 for estimating a(x) as x > ©. Indeed, the best we can do is use the 
approximation 


\x/d| ~x/d, 


whose error is less than 1 so that, in view of |4(d)| <1, the total error would be 
bounded by the number of terms, namely 27°””?, 
Ignoring the error, this approximation would lead to the estimate 


x pu(d 1 

mx~ 2 wd) =x > w@) Il (1--). (2.15) 
d|P(Vx) d|P(Vx) pave p 

We now recall Mertens’ theorem, an elementary result in prime number theory: 


MI (1-= 


1 
pez ome ie ere G20) 


where 7 is Euler’s constant. Thus, from (2.15) and (2.16), we get the approxi- 
mation 
x 
a(x) *~c Togx’ (2.17) 
where c=2e %~1.123, but we get this approximation with an error term 


bounded only by 27) which is much greater than the approximating function. 
This error term is certainly not negligible since, by the prime number theorem, we 


974 C. Pomerance and A. Sarkézy 


have 
mx) = (1+ 0(1)) jog 


while the constant c appearing in (2.17) is larger than 1. The heuristic (2.17) does 
give the correct order of magnitude for w(x), a fact that we shall see can be 
expected (at least for upper bounds) from sieve methods. 

The situation is even slightly worse in case of problems (ii) and (iii). For 
example, in the case of problem (ii), if we re lace |sf(d)| by w(d)x/d, our only 
bound for the error in each term of (2. 11) is 2” as given by (2.13). This is worse 
than the error bound of 1 per term in our analysis of (x). [Note that the number 
of terms in (2.8) and in (2.11) are the same.] If we nevertheless make the 
approximation |¢(d)| ~ w(d)x/d in (2.11) we get 


l{q: q,q + 2are primes, g<x—2}|~ >) p(d)w(d)x/d. (2.18) 
d|P(vx) 


Using the fact that 4(d)w(d)/d is a multiplicative function of d, we have for any z 
that 


> u(d)w(d)/d = nt - tp) (2.19) 


4|P@) P 


Then, using Mertens’ theorem (2.16) and the fact that w(2) = 1, and w(p) = 2 for 
p > 2, we have by an easy calculation that 


I (1 - 1) ~e “a/log’z asz>™, (2.20) 
where 
a= 21] (1-1) 7J=1.3202..., (2.21) 


the so-called “twin prime constant”. Putting (2.19) and (2.20) (with z = Vx) into 
the heuristic approximation (2.18), we have the “conclusion”? that the number of 
twin primes up to x is of order of magnitude x/log’x with the “suggestion” that 
the asymptotic constant is 4e° "a. In fact, this is not far from the strong twin 
prime Conjecture which asserts that the number of twin primes up to x is 
(a + 0(1))x/log’x. That is, the above heuristic gives the conjectured order of 
magnitude, but not the conjectured constant. 

In general, the above attempts at using a sieve lead us to the following 
thoughts. There is no hope of giving asymptotics for the number of integers left _ 
after the sifting process if we sieve by a “large” set of primes Y. On the other 
hand, one may hope to get good bounds for the number of these integers (even if 
P is ‘‘large’”’) by some sort of refinement that reduces the number of terms in the 
sum ¥) , #(d)|sd(d)]. 

V. Brun, working in the period 1915-1924 and at least in part basing his work 
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on that of J. Merlin, was the first to succeed in modifying the sieve of 
Eratosthenes to prove highly non-trivial results with a sieve. First we are going to 
discuss a relatively simple version of Brun’s method which is called Brun’s simple 
(or pure) sieve. As we shall see it enables one to derive estimates very close to the 
conjectured best possible one in a quite cheap way. 

Brun’s first idea is to reduce the number of terms in the sum )), 4(d)|é(d)| by 
reducing the number of sifting primes. The simplest way to do this is to replace 
the condition p < Vx in the definitions (2.4) and (2.10) of ? by p <z, where z is 
a parameter much smaller than Vx whose exact value should be fixed as some 
function of x depending on the problem being studied. This means that, for 
example, in the case of the twin prime problem, we sift out only those integers n 
for which n(m + 2) has a small prime factor (i.e., at most z). Since the remaining 
integers include all the twin primes larger than z, in this way we get an upper 
bound for the number of twin primes between z and x. On the other hand we do 
not get any lower bound for this number. We might only hope to get a lower 
bound for the number of twin “almost primes” up to x, where qg in an “almost 
prime” if it has no prime factor up to z. (Usually in sieve methods the term 
“almost prime” is reserved for the case when log z/log x is bounded away from 0, 
so that if g <x is an almost prime, it has a bounded number of prime factors. We 
do not follow this convention here.) 

So let us now study the general sieve problem: given # CN finite and 


Y = {p prime: p<z}, (2.22) 
estimate 
S(A, P):= |{n EA: (n, P(z)) = 13], (2.23) 


where P(z) is defined by (2.7). Then by using the inclusion—exclusion principle, 
we get in the now familiar way that 


S(, P) = ~ w(d)|4(a)} , 


where #(d) is given by (2.1). 
The inclusion—exclusion formula is based on the identity (2.6). This identity is 
only a special case of the following more general identity: 


> (-1)* (‘) =(~1)! Cs ‘) for all j,JEN, (2.24) 


which can be proved easily by induction on j. This identity implies that the sum on 
the left-hand side is =0 for even j and <0 for odd j. 

The second idea of Brun is to utilize this alternation of sign for the sum in 
(2.24). We are going to show that for every t@ N we have both 


S(8, P)< ~, 2(d)|s6(d)| (2.25) 


v(d)62r 
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and 


S(t, P)> DD w(d)ls#(d)}. (2.26) 
| P(z) 
v(d)e2r-1 
To prove (2.25) we will show that if 2 © x, then the contribution of 7 to the 
right-hand side of (2.25) is 
(a) 1 for (7, P(z))=1, 
(b) 20 for (1, P(z)) > 1. 
The assertion (a) is trivial since if m € of and (n, P(z)) = 1, then v is counted only 
in the term d= 1 with weight (1) = 1. To show (b), write (n, P(z)) =p, p.°-: Dp, 
so that p,, P2,..., p, are distinct primes up to z and / 21. Then 7 is counted on 
the right-hand side of (2.25) only for d’s of the form 


d=piP2°** Py\P\P2°** Py» KS 2t. 


For a fixed k <2t, the number of these a’s is (j), and they get counted with 
weight (—1)*. Thus the total contribution of n to the right-hand side of (2.25) is 


Eo(). 


By (2.24), this sum is non-negative, which completes the proof of (b) and (2.25). 
In a similar way we can prove (2.26). 

As an application of Brun’s simple sieve we adapt (2.25) and (2.26) to the twin 
prime problem, getting a relatively sharp bound in a relatively simple way. 


Theorem 2.27, The number N of integers n <x — 2 such that both n and n+ 2 are 
free of prime factors up to z satisfies 


N~e"ex/log’z , 


where a is the constant defined in (2.21), and where the asymptotic relation holds 
as X,z—+& in the region z < x )/(20 log log x) 


Thus Brun’s simple sieve actually gives an asymptotic formula for the number 
of twin “almost primes’ up to x. Moreover, by making the largest choice of z 
allowed, it gives a non-trivial upper estimate for the distribution of twin primes: 


Corollary 2.28. The number of primes psx with p+2 also prime is 
O(x(log log x)*/ log’x). In particular, the sum 3) 1/p for primes p with p +2 prime 
is either convergent or finite. 


The number ¥) 1/p described in the corollary is referred to as Brun’s constant. 
Note that the O-estimate in the corollary is only off by a factor (log log x)” from 
the conjectured order of magnitude. 

On the other hand, Theorem 2.27 does not give any lower bound for the 
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distribution of twin primes. We still do not know if there are infinitely many; 
mentioned above, this is one of the great unsolved problems in mathematics. We 
have witnessed a general pattern with sieve methods: they often give good upper 
bounds, but no or weak lower bounds, unless one is interested in “almost primes” 
of some kind. 


Proof of Theorem 2.27. Define & by (2.10), A by (2.22), and S(, P) by (2.23). 
Then the quantity N in the theorem is just S(x%, ?). In view of (2.19) and (2.20) 
it is thus sufficient to prove 


S(4,P)=x D p(d)w(d)/d + o(x/log’z) . (2.29) 
d|P(z) 


Note that from the upper bound and lower bound for S(x, 7) given by (2.25) 
and (2.26), we have 


S(t, P) = p> p.(d)|s#(d)| + O ( ~ —) 


v(d)=2¢ v(d)=21 


for any choice of EN. Thus by (2.13) we have 


S(A, P)=x Dd way - > a Ol > =) 


alP(z) a|P(z) d|P(z) 
v(d}<2r v(d)=21 v{d)s2r 


=x D uae a ( > ae Ss a 


a|P(z) d|P(z) d|P(z) 
v(d)>2t v(d) S20 
=x > yay + O(E,) + O(E,) , (2.30) 
a} P(z) 


say. 
We shall show that E,, E, are O(x/log°z), O(x'’’), respectively, if we choose 


t= [5 log log z] 


(and z large enough so that f=1). Thus (2.30) will imply a strong version of 
(2.29) and thus prove the theorem. 

To estimate E,, we use a weak form of Mertens’ theorem. In fact, by taking the 
logarithm of (2.16), we have 


1 
> — <log logz+c 
psazP 


for some constant c and all large z. (This inequality is actually a weak form of a 
much older theorem of Euler.) Thus using w(¢@)<2” and the multinomial 
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theorem, 


w(d 1 

E,=x > Derd7 DF 
d | P(z) f>2t a|P(z) 
v(d)F2t v(d)=i 


z Ly 1 
<x) +(5 *\ =x » 77 (2 log log z + 2c)’. 
. fe2e °° 


i=2t pez Pp 


The terms in this last sum are decaying at least geometrically with a common ratio 
bounded below 1, so that the sum is majorized by its first term. Thus 


2e log | + 2ce | 
£, = 0 (Gr; 2 loglog 2 + 2e}") =0(z| oe =e ‘| ) 


= O(x(4e)'?'°8'°87) = O(w/log®z) . 


The majorization of E, is easier. We have 


2 
Ee > 2X4) ch > 1=27" Ss eu <2 q(2)". 
i=0 


d\P(z) d\P(z) 
v(d)=2¢ v(d)=2e 


For large z, we have m(z) <4z (in fact, z=8 will do), so that 
E, <27! ve Oe") : 


This estimate concludes our proof of Theorem 2.27. O 


To eliminate the unwanted (log log x) factor in Corollary 2.28, one needs 
Brun’s sieve in its complete form. To explain the crucial idea of Brun’s sieve we 
start out from the inequalities (2.25) and (2.26). These inequalities can be 
rewritten in the form 


2 x (d)w(d)Iet@)] < S(t, ) <2 ald )u(a))A(a)|, (2.31) 


where 


1 for v(@)<2¢-1 

wld)={9 for v(d) > 21-1, (2.32) 
1 for p(d@) <2, 

x)= {4 for v(d) >2r. 


The idea is to replace these two functions by certain other functions y,(d), x,{d) 
with the following properties: 

(a) xi(@), x(a) satisfy (2.31), 

(b) x,(1) = x2(1) = 1, 

(c) x(d) =0 or 1 if d| P(z) for i =1, 2. 
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The goal is to make the choice for y,, vy, so that we get better upper and lower 
bound estimates than that afforded by (2.32). 

Brun succeeded in constructing functions x,(d), x,(@) with all these properties 
and giving very good estimates for S(s¢, ?). The construction is too complicated 
to describe here; see, e.g., Halberstam and Richert (1974) for further details. 

We now cite one general theorem that may be proved by Brun’s sieve —it is a 
special case of Theorem 2.3 in Halberstam and Richert (1974). 


Theorem 2.33. Let kEN, let a,, b, be pairs of integers for i=1,...,k such that 
each (a,;,b;) =1 and 


e=(Ma)( 


Let 1>e>0, x, yER be arbitrary with 2<y <x, and let z=y*. Let 


II @o,- a,b,)) #0. 


ler<sek 


k 
a ={[](on+b):nEN,x-y<n<at (2.34) 


t=1 


and denote the number of solutions of I, (an + b,)=0 (mod p) by w(p) for 
each prime p. Then if P = {p prime: p <z}, 


1 w(p)-k y 
S(4, P) (1 -+) . 
( yee (u P ) log*y 


pzy 


where the constant c depends only on k and e. 


Note that all of the sets of as in (2.2), (2.10), and (2.14) are of the form (2.34). 
For example, by choosing k = 2, a, = 1, b, = 0, a, =1, b, =2, y=x, and z = Vx 
we obtain: 


Corollary 2.35. There is a positive constant c such that if x =2, then 


x 


\{q: g,q +2 are primes, g <x — 2}|<c ote 


Similarly, if we let k =2, a, =1, 6, = 0, a,=—1, b, =n, x =y =n, and z= Vn, 
we get: 


Corollary 2.36. There is a positive constant c such that if n € N and n is even, then 


1\""\_2 
tp: pand n—p are primes}| =e (TT cs, log’n ” 


Even for the simpler problem (i) where the prime number theorem gives us an 
asymptotic formula for a(x), Theorem 2.33 can tell us something non-trivial when 
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y is small compared to x. By choosing k=1, a,=1, and b, =0, we have the 
following result originally due to Hardy and Littlewood. 


Corollary 2.37. [f 2<y =x, there is an absolute constant c such that 


y 
log y ° 


Brun’s sieve has numerous applications and many of these are due to P. Erdés 
who, perhaps more than any other person, showed that sieve methods are indeed 
a powerful tool in number theory. For example, Erdés used Brun’s sieve to 
estimate the differences between the consecutive primes. Let p, denote the ith 
prime (so that p, =2, p, =3, p,=5S, etc.), writed, =p, —p,_, form >1, and let 
d, =2. It follows easily from the prime number theorem that 


a(x) — w(x — y) Se 


lim inf d, /log n <1. 
We now prove the following result due to Erdés (1940). 


Theorem 2.38. There is a constant c<1 such that ; 
lim inf d, /logn<c. (2.39) 


Proof. Let « >0 be arbitrary, but fixed. Suppose that 
lim inf d,/logn > 1 —te. (2.40) 


Then there is some x, such that for x >x,, if n > a(x/logx) we have d, >(1—- 
e)logx. Let L = a(x) — w(x/log x) and assume 6 = &(x, €) is such that there are 
exactly 6L values of n with a(x/log x) <n < a(x) and d, is between (1 — e)log x 
and (1 + e)log x. Then for x > x, there are (1 — 6)L values of 7 in this range with 
d, =(1 + e)log x. Thus 


6L(1-s)logx +(1—8)L(1+ ellogx< D> d, <x, 


nsm{x) 


so that 


e(1— 28) < =e (2.41) 


a 
L logx 
By the prime number theorem, L log x = (1 + o(1))x, so that (2.41) implies (since 
€ >0 is fixed) there is some x,(e) such that if x 2 x,(e€) we have 6 >4 

We now use Theorem 2.33 to show that 6 = O(e) so that if € is sufficiently 
small, (2.40) cannot hold. This will prove the theorem. For any EN, let D(t, x) 
denote the number of primes p <x with p +t prime. Thus 


6L< > D(t, x). (2.42) 


(L-e)log x<t<(1+e)log x 


But by Theorem 2.33 with k = 2, a, =1, 6, =0, a, =1, 6, =t, y=x, and z = Vx, 
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we have some absolute constant c’ with 


_1\"\_+ 
D(t,x) <c (1 (1 5) Vee (2.43) 


Pex 


From elementary arguments it is not difficult to prove that 


= 
> IT (1-<) ~c"u asu—>o 


ru plr 


for some constant c”. Thus 


D(t, x) <c'(c” + 0(1))2e log x 


(1-8) log x<r<(1t+e)log x log” 
Since L ~ x/log x, (2.42) thus implies for x > x,(e) 
6 <3c'c’e , 


which is what we wanted to prove. 0 


Since 1942, the value of the constant c in (2.39) has been improved by several 
authors. The best estimate (derived by both combinatorial and analytic tools) has 
c <4 and is due to Maier (1988). Of course the twin prime conjecture implies that 
c=0. 

Theorem 2.33 can be proved also by another sieve method of less combinatorial 
nature which is due to A. Selberg. In some applications, Selberg’s sieve is slightly 
superior to Brun’s. 

Returning to the three problems at the beginning of this section, note that in 
problem (i) we counted integers n #0 (mod p) for primes p <z, in problem (ii) 
we counted integers satisfying n ~0 (mod p) and n #2 (mod p), and in problem 
(iii) the excluded classes were n 40 (mod p), n#x (mod p). In other words, 
there are 1, 2 and 2 “forbidden” residue classes, respectively. Brun’s sieve and 
Selberg’s sieve have a common feature: both methods can be used only in the case 
that the number of forbidden residue classes is bounded or it grows only very 
slowly in terms of p. If the number of forbidden residue classes grows rapidly (for 
example, more than cp residue classes for each p), then other sieve methods must 
be used. The most important sieve method of this type is the /arge sieve of Linnik 
and Rényi. 

See Halberstam and Richert (1974) for detailed discussion of “small sieves” 
(Brun’s and Selberg’s sieves) and Montgomery (1971) for the large sieve. The 
former reference also contains proofs of J. Chen’s remarkable theorems that (1) 
there are infinitely many primes p such that p + 2 is either prime or the product of 
two primes, and (2) every sufficiently large even number is the sum of a prime 
and another number which is either prime or the product of two primes. 

Finally we mention Rosser’s sieve, a general principle for a combinatorial small 
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sieve. Iwaniec (1981) has done extensive work developing this general principle 
and has given details for some important special cases. 


3. Bases and density theorems on addition of sets 


As mentioned in the preceding section, it is conjectured that every even number 
exceeding 2 is a sum of two primes. This conjecture, which was stated by 
Goldbach in a letter to Euler in 1742, has the immediate corollary that every 
number exceeding 5 is a sum of three primes and that every number exceeding 1 
is a sum of at most three primes. 

In 1770, Waring stated without proof that for every n there is some number g(n) 
such that every natural number is the sum of at most g(”) positive nth powers. In 
that same year, Lagrange solved Waring’s problem for n = 2, showing that every 
natural number is the sum of at most four squares. In 1909, Waring’s conjecture 
was finally proved by Hilbert using a combinatorial argument. 

Let N, =NU {0} denote the set of non-negative integers. A set of CN, is said 
to be a basis of order k if every natural number can be represented as the sum of 
at most k elements of &. If every sufficiently large integer can be represented as 
the sum of at most k elements of %, then # is said to be an asymptotic basis of 
order k. Thus Goldbach’s conjecture implies that the set of primes is an 
asymptotic basis of order 3 and that the set of primes together with 1 is a basis of 
order 3. Not only has Waring’s problem been settled, the minimal choices for the 
numbers g(n) are “known” for every n. If G(n) is the least number such that the 
positive nth powers form an asymptotic basis of order G(n), then no value of 
G(n) is known except for n = 1, 2, and 4. See Vaughan (1981) and Balasubrama- 
nian et al. (1986) for more details. 

If f CNo, then the lower asymptotic density d(), the upper asymptotic density 
d(sf), and, if it exists, the asymptotic density d(¢) of # are defined by: 


d(x) =lim inf A(n)/n , 
d(x) =lim sup A(n)/n , 
d(sf) =lim A(n)/n , 


respectively. 

Assuming the Riemann hypothesis, Hardy and Littlewood proved in 1922 that 
every sufficiently large odd integer can be represented as the sum of three primes, 
which would imply, of course, that the set of primes is an asymptotic basis of 
order 4. The Hardy and Littlewood theorem was proved unconditionally in 1937 
by Vinogradov. But the first to unconditionally prove that the set of primes is an 
asymptotic basis of some finite order was Schnirelmann in 1930. 

This work of Schnirelmann opened up an important chapter in combinatorial 
number theory. His starting point was the following simple corollary of Brun’s 
sieve and, in particular, Theorem 2.33. 
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Corollary 3.1. The set of integers which can be represented as the sum of two 
primes has positive lower asymptotic density. That is, if ® denotes the set of 
primes, then d(2?)>0. 


Thus to prove that # is an asymptotic basis of finite order, it would be 
sufficient to show that any set of positive lower asymptotic density must 
necessarily be an asymptotic basis of finite order. Unfortunately, this is not so, as 
the set 


a ={0,2,4,...} (3.2) 


of even non-negative integers show. This set has asymptotic density 4, but no odd 
integer is a sum of members of 3. 

However, Schnirelmann was able to save this idea with his concept of 
Schnirelmann density. If & CN, and we write A*(n) for the number of positive 
members of # up to n, then the Schnirelmann density o(#) of of is defined by: 


o(#)= inf A*(n)/n. 


Thus o()>0 holds if and only if both 1€ # and d(#)>0 hold. In addition, 
o()=1 if and only if #@=N or No. Schnirelmann proved the following 
theorems on the Schnirelmann density of sum sets. 


Theorem 3.3. If 4, BCN, with OE 41, then 

a(t + B) = a(s)+ o(B)— a(S )a(B). 
Proof. We may assume o(@)>0. Let n be an arbitrary natural number and 
suppose 

l=a,<a,<---<a,Sn 
are the positive members of #& that do not exceed n. Since 0€ &, we have also 
@,,...,a,E8 + B, What other members in + @ do not exceed n? We now 
count those members of # +9 of the form a,+b, where i<k, b€%, and 
a,<a,;+b<a,,,. This number is B*(a;,,—a@;—1)=(@,., —4@; — 1)o(B). Simi- 


larly the number of a, + 6, where b € B and a, <a, +b <n, is B*(n-a,)2(a- 
a,)o(@). Thus the number of positive members of + @ up to n is at least 


A*(n) + (a ~a,)0(B) + D (G41 - 4, - 1)0(B) 
=A*(a)t+(n-a,—k+ l)o(B) 
= A*(n) + (n— A*(n))o(B) 
=(1—o0(B))A*(n) + no(B) 
2 n(1 — a(B))a(h) + no(B) , 


which proves the theorem. O 
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Theorem 3.4. if ¢, BCN, with OES N& and o(4)+ o(B) = 1, then a(8 + 
B)=1. That is, every non-negative integer can be represented in the form a+b 
with aE 4, and bE @. 


Proof. If nE 4 US, then n€ MS + B. So suppose nEN, and n¢ UB. Then 
n> 1. We have 


n= A*(n) + B*(n) = A*(n—1)+ Bt(n-1). 


Consider the positive integers a€ # for a<n, and n—b for bE B, 0<bd<n. 
There are at least » of these numbers and they all lie in (1, 2,...,2—1}. Thus 
we have a=n—b for some aE #, DEB, sothatn=at+tbExe +B. O 


Corollary 3.5. If & CN, with o()>0, then & is a basis of some finite order. 


Proof. Write 9%, = sf U{0}. Then by Theorem 3.3 and induction, o(kx,) = 1 — 
(1 — a(,))* for every KEN. Thus there is some k with a(k4,) 24. Thus by 
Theorem 3.4, with ¢ = B= ks,, we have 2k, = NU {0}. Thus # is a basis of 

order 2k. O . 


Schnirelmann’s theorem on the set of primes # follows easily from Corollaries 
3.1 and 3.5 as we now see. 


Theorem 3.6. The set of primes is an asymptotic basis of finite order. 


Proof. Write ?, =? U {0,1}. Then by Corollary 3.1, o(2%,)>0, so that by 
Corollary 3.5, 2P, is a basis of some finite order. Thus , is a basis of some order 
k. Thus every n GN may be represented in the form s + v, where s is a sum of at 
most & primes and 0<u <k. 

We now show that every n > 2 may be represented as a sum of at most 3k + 1 
primes. Indeed, n —-2EN, so we may write n —2=s +0, where s is a sum of at 
most k primes and O<u <k. Then n=s+(v+2) and2<v+2<k+2., But it is 
trivial that every integer m > 2 can be represented as a sum of 2’s and 3’s with at 
most $m summands. Thus v + 2 is a sum of at most 4(k + 2) primes, each prime 
being 2 or 3. Thus n =s + (uv +2) is a sum of at most $k +1 primes. O 


In 1942, Mann improved on Schnirelmann’s theorems 3.3 and 3.4 by proving 
the following result which had come to be known as the a + £8 conjecture. 


Theorem 3.7. If @, BCN, with OE NB, then o(sd +B)>min{l, o(d) + 
o(B)}. 


Thus in the case a(.#) > 0, o(@) > 0, and a(x) + o(B) <1, Theorem 3.7 gives 
a sharper result than Theorems 3.3 and 3.4. As we saw above, these latter 
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theorems can be proved relatively easily. However, Mann’s theorem 3.7 is much 
deeper. 

A disadvantage of Schnirelmann’s and Mann’s theorems is that in both cases, 
the statement is formulated in terms of Schnirelmann density which is a fairly 
artificial concept. In particular, if we use these results for estimating the order of a 
basis, then we often get rather poor estimates. Thus in many applications it would 
be preferable to have an addition theorem involving asymptotic (lower) density. 
In 1953, Kneser proved the following (very deep) theorem. 


Theorem 3.8. if %,,..., 4%, CN, are infinite, then either 


d(, + +++ + f,)2lim inf (A(n) + +++ + A,(n))/n 


or there are natural numbers g, a),...,@,, such that 
(i) each sf, is contained in the union 4, of a, distinct congruence classes 
(mod g), 
(ii) there are at most finitely many positive members of 4, +---+ 4, not in 
Hy ts+++ A, 


(iii) d(l,+---+A,) 2 (ay t+: +a, —kKig. 


The following result of Nathanson and Sarkdézy (1989) gives a particularly 
simple application of Kneser’s theorem 3.8. 


Theorem 3.9. If sf CN is an asymptotic basis of order h and if d(sf) > 1/h, then 
for every B with 1/h< B <d(#), there is a set BC with d(B®) = B and with B 
also an asymptotic basis of order h. 


Proof. Let sf be as described and choose 8 with 1/h< 8B <d(#). Let € be any 
subset of of with d(€) = 8. Let H =(h —1)/(AB — 1) and let 4% Co be a finite 
set such that for each j/ <H, &, contains a representative of each residue class 
(mod j) that has at least one representative in &. We claim that B= € US, 
fulfills the conditions of the theorem. 

First, it is clear that d(@) =f, since , is finite. To show @ is an asymptotic 
basis of order h, we apply Kneser’s theorem 3.8 with 4 copies of @. Since 
d(B)>1/h, the first condition cannot hold. Thus there is some number g and a 
set @’ of a residue classes (mod g) such that @C@’, all sufficiently large 
members of 48’ are in AB, and 

1 > ata = OW) 5 ip --=. 


Thus g <H, so that & contains representatives of exactly the same residue classes 

(mod g) as does #. Since 8 C B' and B’ is a union of complete residue classes 

(mod g), we have »«f CB’. Thus B’ is an asymptotic basis of order h, as is B. 
O 
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Kneser’s theorem 3.8 serves a unifying role in additive number theory in that 
certain prior results with longer proofs can also be seen as fairly immediate 
corollaries of Theorem 3.8. For example, it is possible to prove via Schnirel- 
mann’s theorem 3.3 that if #@ CN, d(#) > 0, and »# contains a finite subset that is 
relatively prime, then # is an asymptotic basis of some order — this is essentially 
what is done in the proof of Theorem 3.6. But assuming Kneser’s theorem, the 
result is virtually immediate. 

As one further example, we state the following corollary of Kneser’s theorem, 
leaving the proof for the reader. [The proof is not completely trivial—for help, 
consult Halberstam and Roth (1983, pp. 54-55).] Parts of this result are due 
independently to Cauchy in 1813, Davenport in 1935, and I. Chowla in 1935. It is 
also possible to give a (relatively simple) direct proof, not using Kneser’s 
theorem. 


Theorem 3.10. (The Cauchy—Davenport—Chowla Theorem). [f #, 8 CZ/g with 
|s|=r, |A|=s, OE B, and every other member of B is coprime to g, then 
|sf + B| > min{g, r+s—1}. 


When we are studying a finite set # CN and we need an addition theorem, 
then the only assumption that one might like to use is that # C {1,2,...,N} and 
||/N is large. Improving on a joint theorem with Nathanson, Sarkézy has 
recently proved the following addition theorem of this type, see Sarkézy (1989/ 
1994). 


Theorem 3.11. Assume that @ C {1,2,...,N} and that || >(N/k)+1, where 
KEN. Then there are d, 1EN with d<k, 1<118/k such that lx contains an 
N-term arithmetic progression of multiples of d. 


Bourgain (1990) and Freiman et al. (1992) have recently shown that if of is 
“dense”, then there are “long” arithmetic progressions in 2 and considerably 
longer ones in 3%. 

Using exponential sums and methods from the geometry of numbers, Freiman 
(1973) gave a deep analysis of the structure of sum sets of the form kf for finite 
sets of assuming that |k#| is not much greater than ||. Indeed, suppose 
k,,...,k, are integers at least 2, u, v,,...,v, are integers, and there are k, -+-k, 
distinct integers of the form 


d 
n=ut > xv, where x,€ {1,...,4,} fori=1,...,d. 


t=] 


Then the set ? of such numbers n: is called a d-dimensional arithmetic progression 
of size |P| =k, ---k,. Freiman’s most important result, the so-called ‘doubling 
theorem’’, says that if |2s¢| is not much greater than ||, then |x| can be 
well-covered by a generalized arithmetic progression: 
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Theorem 3.12. For all a > 1 there are constants c, =c¢,(a), ¢, =¢,(a) such that if 
\284|< a|sd|, then there is a generalized arithmetic progression P of dimension 
d<c, with & CP and |P|<c,|A|. 


Many further details, results, and problems on addition theorems and bases can 
be found in Halberstam and Roth (1983), Stohr (1955), and Ostmann (1956). 

A set a CN, is said to be a minimal basis of order k if is a basis of order k, 
but no proper subset of of is a basis of order k. A set & CN, is said to be a 
maximal nonbasis of order k if is not a basis of order k, but A U {a} is a basis 
of order k for every a E No\&. Stohr, Hartter, Erdés, Nathanson and others have 
studied properties of minimal bases and maximal nonbases, see Stohr (1955), 
Erd6s and Nathanson (1987) and Nathanson (1989). 

Similarly one can define the concept of a minimal asymptotic basis or maximal 
asymptotic nonbasis. Note that an immediate corollary of Theorem 3.9 is that if sf 
is a minimal asymptotic basis of order h, then d(#) < 1/h. That this result is sharp 
is shown in Erdés and Nathanson (1988). 

If @ CN, is such that 


x CN, and 0< o()< 1limply o(# + B)>oa(8), 


then @ is said to be an essential component. In 1933, Khintchin proved that the 
set of squares is an essential component. In 1936, Erd6s generalized this theorem 
as follows by proving that every basis is an essential component. 


Theorem 3.13. If @ CN, is @ basis of order h and x4 CN, is arbitrary, then 
1 
a(f + B)=a(a#) +5, 0 ~ a(#))o(#). 


Landau slightly sharpened this result. Using a complicated graph-theoretic 
argument, Pliinnecke improved the conclusion of Theorem 3.13 to the much 
sharper o( + ®)20(sf)'"'. Ruzsa (1990/91) analyzed Pliinnecke’s method 
and gave further applications. 

In 1942, Linnik proved the existence of a “‘thin’’ essential component @ with 


B(x) < exp[(log x)""**]. 


This shows that an essential component need not be a basis. Wirsing improved on 
this estimate and Ruzsa (1987), using exponential sums, proved the following 
theorem (which settles the problem). 


Theorem 3.14. For every e>OQ there is an essential component B with B(x) = 
O((log x)'**). Moreover, if B is any essential component, then there is some c>0 
such that B(x) > (logx)'** for all sufficiently large x. 


It is not known if @ = {2°3’: i, j €No} is an essential component. Note that 
B(x) ~ c log’x for some c>0. 
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4, Other additive problems 


4.1, Sidon sets 


As before, let No=NU {0}. If #CNo, let s(xf,n) denote the number of 
solutions of 


ata’=n witha,a’EA, asa’. 


In 1931, S. Sidon posed the following two problems: 

(i) How “dense” can & be if s(#,2)<1 for all n? 

(ii) What is the slowest growing function f() such that for some , 

1<s(#, n) <f(n) holds for all n EN,? 

The first question motivates the following definition. If a set o# CN, satisfies 
s(4,n) <1 for all n, then it is called a Sidon set. The first remarkable fact about 
Sidon sets is that the greedy algorithm provides a simple way to show the 
existence of relatively dense, finite Sidon sets. 


Theorem 4.1. [f NEN, then there is a Sidon set 4 C{1,2,...,N} with |xt|> 
IN]. 


Proof. Clearly it suffices to show that if {@,, a,,...,4,}C{1, 2,...,N} isa 
Sidon set of cardinality ¢ and 

t<N'O-1, (4.2) 
then there is an integer b such that 

1<b<N, b€{a,,a,,...,a,}, (4.3) 
and 

{a,,@,,...,a,} U {b} is a Sidon set . (4.4) 


To show this, note that if an integer b satisfying (4.3) does not satisfy (4.4), then 
there are a,, a,, a, with 


a,+b=a,+a,, 
or there are a,, a, with 
b+b=a,+a,. 


There are at most ¢° choices for triples a,, a@,, a, and at most t choices for pairs 
@,,, @,, SO there are at most f° + 1° “bad” b’s. Thus in view of (4.2), the number of 
“good” b’s satisfying (4.3) and (4.4) is at least 


N-t-(0+P?)2=N-(t+1) +121, 


so there is at least one “good” b. O 
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This argument can be modified easily to give the existence of a dense infinite 
Sidon set. 


Theorem 4.5. There is a Sidon set sf CN such that A(n)= |n'*| for all nEN. 


In the finite case, the estimate obtained in this way can be improved 
considerably. In fact, by using Singer’s theorem on perfect difference sets, Erdés 
and Chowla independently proved in 1944 the following result. 


Theorem 4.6. For infinitely many NEN, there is a Sidon set 4 C {1, 2,...,N} 
with ||>N"’. For all NEN, there is a Sidon set oC {1, 2,...,N} with 
|.f| > N"? — cN*’!® (for some absolute positive constant c). 


On the other hand, in 1941 Erd6és and Turan had proved the following. 


Theorem 4.7. There is an absolute, positive constant c such that if NEN and 
AC (1, 2,...,N} is a Sidon set, then |sf|<N''? +cN*, 
Erdés conjectures that the expression cN*’'* in Theorem 4.6 can be replaced 
with c and that the expression cN'’* in Theorem 4.7 can be replaced with N°). 
Much less is known in the infinite case. The best-known lower bound, due to 
Ajtai et al. (1981) is annoyingly only slightly better than the near-trivial Theorem 
4.5, 


Theorem 4.8. There is a Sidon set € CN such that A(n) >10-*(n log n)'"? for all 
sufficiently large n. 


We can do much better if we only want an Q-result. Improving on a result of 
Erdés, Kriickeberg showed the following in 1961. 
Theorem 4.9. There is a Sidon set & CN such that 
lim sup A(n)/Va > 1/V2. 
Erd6s conjectures that this lim sup is 1. Note that by Theorem 4.7, it is at most 
1 


We do know the lim sup in Theorem 4.9 cannot be replaced by lim inf, as the 
following result of Erdés in 1955 shows. 


Theorem 4.10. There is an absolute constant c such that if 2 CN is any infinite 
Sidon set, then 


lim inf A(n)/Yn/logn<c. 


The gap between Theorems 4.8 and 4.10 remains an important unsolved 
problem in the subject. 
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We can obtain interesting problems and results if we relax the condition 
(4, n) <1 of a Sidon set to s(#, n) <g. In 1960, using probability theory, Erdés 
and Rényi proved, among other interesting results, the following. 


Theorem 4.11. For every e>0, there is a number g=g(e) and an infinite set 
CN with s(,n)<g for all nEN and 


lim A(n)in'?-* =o, 


In connection with problem (ii), in 1956 Erd6és, using a probabilistic method, 
proved the following. 


Theorem 4.12. There are positive constants c,, c, and a set A CN such that 
c, logn<s(#,n)<c,logn forallnEN. 


It is not known if s(#,n)~clogn for some positive constant c is possible. 
Another attractive problem is whether s(#,n) > 1 for all n © N implies (of, n) is 
unbounded. 

The following result due to Erdés and Fuchs in 1956 involves analytic methods. 
Let r(,n) denote the total number of solutions of a+a’=n with a,a' EX, 
where now, we do not require a <a’. 


Theorem 4.13. /f c>0 and 4 CN, then 


> r(x, n)=cN + o(N"“(log N)~"’?) 


nN 


cannot hold. 


As a corollary one can get an 2-result for the error term in the circle problem; 
that is, for the quantity 


ar — > 1. 


G@pjez? 
G24 j2<r? 


Recently, Montgomery and Vaughan (1990) have shown that the factor . 
(log N < ? in Theorem 4.13 may be dropped. See Halberstam and Roth (1983, 
including the footnote on p. 106) and Erdés (1956) for further information. 

The results and problems discussed so far have been extended and generalized 
by many people. For further references, see Hayashi (1981) and Erdés et al. 
(1986). 
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4.2. The arithmetic structure of sum sets and difference sets 


In 1934, Erdés and Turan proved the following theorem. 


Theorem 4.14. There is a positive constant c such that if {a,, a,,...,a,) CN, 
then 


»( Il (a,+4,)) =clogn. 


l<i,jen 


Thus the set of integers with prime factors coming from a small set cannot 
contain a subset of the form 2. with || large. 

Since then many papers have been written on the arithmetic properties of sum 
sets + B® and difference sets # — xf. For example, in 1978-79, Fiirstenberg and 
Sarkézy (independently) studied the solvability of the equation a — a’ =n’ for a, 
a’ © 4. In these papers written on “hybrid” problems (i.e., problems involving 
both general sets and special sets of integers), combinatorial, analytic, and 
ergodic methods are used. See Sark6zy (1989) for a survey of these results; see 
also Pintz et al. (1988) and Gydry et al. (1988) for further recent efforts. 

Recently it has been proved that if of, @ are “dense’’, then (i) the sum set 
x# +B contains an element a + b all whose prime factors are “small” (Balog and 
Sarkézy), (ii) there is a sum a +b which is ‘‘almost prime” in the strong sense 
that it is the product of a prime and a bounded integer (Sark6ézy and Stewart), 
(iii) there is a sum a + b with “many” distinct prime factors (Erdés et al. 1993), 
(iv) the members of # + 8, weighted with respect to the number of their 
representations, behave like normal integers for the function v(m) (Erdés et al. 
1987). This last result has been sharpened and extended in various directions by 
Elliott and Sarkézy and by Tenenbaum. Several authors have studied the 
structure of the difference set —.# for “dense” sets ; see Stewart and 
Tijdeman (1983) for references. 


4.3. Complete sets and subset sums 


For references, see Erdés and Graham (1980, pp. 53-60). 

A set CN is said to be complete if every large integer can be written as the 
sum of the elements in some finite subset of &. For example, the powers of 2 
form a complete set. It is less well known that the squares form a complete 
set - indeed, every integer greater than 128 can be represented as a sum of distinct 
squares. 

Must a dense enough set be complete? Improving on a result of Erdés, 
Folkman proved in 1966 the following result. 


Theorem 4.15. Suppose of CN is such that A(x) >x''?** for some ¢>0 and all 


large x. Suppose further that the set of subset sums from sf contains a complete 
residue system (mod m) for every mEN. Then & is complete. 
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A somewhat stronger statement, that is still open, was conjectured by Erdés in 
1962. In some sense, though, Theorem 4.15 is best possible, for in 1960 Cassels 
showed that “1/2” cannot be replaced with any smaller number in the theorem. 
By using Theorem 3.11, Sark6zy (1989/1994) proved a finite analog of Theorem 
4.15 which has many applications. (Slightly later, Freiman independently proved 
nearly the same theorem.) 

Let s(x) be the largest number of subsets of «f with the same sum. Improving 
on a result of Erdds and Moser, Sarkézy and Szemerédi proved (by using 
Sperner’s theorem) that if of is a set of positive reals with ||=7, then 
s(f) <¢2"n~*'? for some absolute constant c. This result has since been improved 
by Nicolas, Beck, van Lint, and Stanley. In particular, Stanley showed that s() 
is maximized over all sets of of n positive reals when ©& is an arithmetic 
progression and in this case the most popular subset sum is a subset sum closest to 
the average of the members of s. 

How dense can # C {1, 2,..., N} be if the subset sums from are distinct? 
One of Erdés’s first conjectures is that 


log N 
log 2 


max || = + O(1). 


Towards this conjecture, Erdés and Moser proved that 


log N + log log N 
log 2 2 log 2 

In 1969, Conway and Guy gave the lower bound (log N)/(log 2) + 2 for N = 2* 
which is just 1 better than the trivial example of taking the powers of 2. However, 


no one has succeeded in giving any example that is 2 better than the trivial. 
Ryavec showed that 


1 
Ss —<2 


aca & 


max |.4|< + O(1). 


if & is a finite set of natural numbers with distinct subset sums. 

In 1969, Erdés and Heilbronn showed that if »f C Z/p, where p is prime and 
| 8 | > 3V6p, then the subset sums of & cover all of Z/p. Olson improved on the 
constant 3V6 and Szemerédi (1970) extended the result to arbitrary finite Abelian 
groups. Sarkdzy (1989/1994) studied the case when the elements of are not 
necessarily distinct. 


5. Multiplicative problems 


5.1. Primitive sets 


For many references, see chapter V of Halberstam and Roth (1983) and Hall and 
Tenenbaum (1988). 
Say % CN has the property that whenever b € &, every positive multiple of b 
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is in #. Examples of sets with this property include the set of even natural 
numbers, the set of composites, and the set of natural numbers with a divisor 
between 100 and 200. An example with historic interest is the set of abundant 
numbers, i.e., natural numbers n such that the sum of the positive divisors of n 
(other than 7) exceeds n. 

More generally, if & CN is arbitrary, then the set @(x) of all positive 
multiples of members of # evidently has the property that if b € @(#), then all 
positive multiples of b are in B(x). Is every set @ with this property in the form 
B(x) for some &? The answer is clearly yes. In fact, if @ has this property and 
is the set of primitive elements of Z, i.e., members of @ not divisible by any 
other members of @, then @ = B(x). 

We say a set & CN is primitive if no member of & divides another member of 
4. Thus # — &B(sZ) is a one-to-one correspondence between primitive sets and 
sets B CN such that kb € SB whenever DE &. 

The set of prime numbers is a primitive set. More generally, the set of n EN 
with (n) = & is primitive for any fixed k EN. The case k = 2 is the primitive set 
for the set of composites. If NEN, then 


Iy = {ni 3N<n<N} 


is primitive. Clearly any subset of a primitive set is primitive. 
These considerations suggest several questions: 
(i) Is ¥, the most numerous primitive subset of {1, 2,...,N}? 
(ii) Must a primitive set have asymptotic density of 0? 
(iii) Must a set 2(¢) have asymptotic density? 
At least one of these questions is fairly easy as is seen in the following result. 


Proposition 5.1. %,, is the most numerous primitive subset of {1, 2,..., Ny}. 


Proof. For each n EN, let n’ denote the largest odd divisor of a. Clearly if 
m'=n' and m<n, then m|n. Thus the mapping n—> 7’ must be one-to-one on 
any primitive set of. There are exactly N— |4N] odd integers in {1, 2,...,N}, 
so no primitive subset of {1, 2,...,N} can have more than N— |4N] =|4,| 
members. O 


The sets ¥, can be essentially glued together to get a counter-example to 
question (ii). The key tool is the following, perhaps surprising, result of Erdés 
from 1935. We first note that, concerning question (iii), if « CN is finite, then 
clearly d(@(# )) exists. 


Theorem 5.2. Let ©, =d(B(¥,)); that is, &, is the asymptotic density of the 
integers with a divisor in (4N, N). Then limy_... €y = 9. 


This result can be proved by consideration of the “normal” number of prime 
factors below N of a random integer. We still do not have an asymptotic formula 
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for ey; the best results to date are due to Tenenbaum. Finally, it should be 
remarked that Erdés actually proved a stronger version of Theorem 5.2 where 
(4N, NJ is replaced by (N *°n NJ and 8,—0 arbitrarily slowly. 

The following result is due to Besicovitch in 1934. 


Theorem 5.3. For each ¢>0, there is a primitive set @ with d(s@)>1-. 


Proof. We use Theorem 5.2; ahe original proof of Besicovitch used a weaker form 
of this result. Let N, <N,<--- be a sequence of natural numbers with ey < 
2 ee and such that the saab of integers up to N, divisible by some member of 
Iy U's USy_. is at most 


(2&y, +++ +2ey, )N, <eN,. (5.4) 


Thus if we denote by #, n, the set of members of 4, not divisible by any member 
of $y Urs-UGy, , then IF) > (5 — &)N,. 
Further, “if is the union of all Fy, , then x is evidently primitive and 
A(N:) > |F yl (4 — EDN, 


2 


for each i. Thus d(#)>1—«e. O 


The set just constructed (with 0<¢< {) also answers question (iii) in the 
negative. Indeed, 
d(B(f)) =d(d)z1-e >t, 
and since the number of members of %(<) up to 4N, is by (5.4) at most eN,, we 
have 


d(B(A)) <2e <4. 


Thus @() does not possess asymptotic density. 
It is a simple exercise to show the following. 


Proposition 5.5. If of CN is such that Deg 1/a<~™, then d(B(f)) exists. 


It is a slightly more difficult exercise to show that if furthermore 1 3, then 
d(B(xf)) <1; see Pomerance and Sarkézy (1988). 

As we have seen, a primitive set sf need not have density 0. However, if one 
considers a weaker density than asymptotic density, namely logarithmic density, 
then every primitive set has density 0. The logarithmic density of a set x CN is 
defined by 


5(<d) = lim es p> -, (5.6) 


He 
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should this limit exist. It is not hard to show that if d(s¢) exists, then so does 5(s) 
and it is equal to d(x). 
In 1935, Erdés proved the following result. 


Theorem 5.7. There is an absolute constant c such that 


1 


Glooa 
ae T10R a 


for every primitive set 3 CN except 4 = {1}. 


(Erd6és conjectures that the maximal value of the sum in this theorem is 
attained when # is the set of primes.) It follows immediately that 6(%) =0 for 
every primitive set sf. Since d(s) <6() always (this is easy), another corollary 
is that the lower asymptotic density of any primitive set is 0. 

In 1937, Davenport and Erdés proved the following result. 


Theorem 5.8. (i) [f & CN has positive upper logarithmic density, then x4 contains 
an infinite sequence a, <a,<-+-- with a,la,,, fori=1,2,.... 
(ii) For any & CN, 5(A(aP)) exists. 


Of course, by “‘upper logarithmic density” we mean that the lim in (5.6) is 
replaced with limsup. These results really underline the fact that logarithmic 
density is the ‘correct’? measure when considering primitive sets and sets of 
multiples. 

How large can Ly 1/a be for a primitive set # C {1, 2,...,N}? This question 
is partially answered by the following result of Behrend from 1935. 


Theorem 5.9. There is a positive constant 'c, such that if N =3, then 
1 Z 

> @ ~ei(log N)(log log N) i 

aes 
for any primitive set # C {1, 2,..., N}. 
Proof. Let s(u) denote the cardinality of the largest primitive set made up of 
divisors of u. We begin our proof by using Sperner’s theorem (see chapter 24) to 
compute s(u) when u is squarefree. Indeed, if u is squarefree and »(u) =k, then 


each divisor of u corresponds to a subset of the k primes of u. Thus Sperner’s 
theorem immediately gives 


= (i s)) ; (5.10) 


We now show the connection of s(u) to our problem. Let u‘ denote the largest 
squarefree divisor of u. If sf C {1, 2,..., N} is primitive and every member of 
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is squarefree, then 


Yowy2E Di-d Di->d [4 


usNn “sn alu acd ueNn cen L@ 
act alu 
1 
>N D> =-N. 
atest a 


Thus it will suffice to prove that 


> s(u’) <cN(log N)(log log N)'? (5.11) 


“st 


for some constant c and all N23. 
To do this, we apply Stirling’s formula to (5.10), getting 


s(u) <2” (v(u))"'? ,_ wsquarefree 
for some constant c. Since v(u) = »(u’), we thus have (with / = [log log NJ) 


> s(u')<c p> 2”) (y(u)) 7”? 


4ueNn 


=¢ 5 2° (vu)? + = 2° (y(u)) 1 


uN 
v(ujsl atalel 
<c:2'N+el7'? 5) 2" 
usN ; 
<cN(log N)'°8? + cl"? S su), (5.12) 


uN 


where 7(u) is defined in section 1. The final sum in (5.12) is easily majorized. We 
have 


LL tu= > 21= > > 1 oe (N/d| 


wENn u=N dlu aden ueNn 
dlu 


<N >, 1/d<N(logN+1). 
den 


Putting this estimate in (5.12) gives (5.11) and thus the theorem for the case when 
every member of # is squarefree. 

Now suppose  C {1, 2,...,N} is an arbitrary primitive set. For each AEN, 
let sé, denote the set of a Eo with largest square divisor being k’. Thus {a/k’: 
a Eo.) is a primitive set of squarefree numbers not exceeding N/k’. Thus 


? 


= 1 
= c(log N)(log log N)~'’? ~ = 
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by our theorem for primitive sets of squarefree numbers. Since )) 1/k? is 
convergent, the general case of our theorem follows. O 


That Theorem 5.9 is essentially best possible was shown by Pillai in 1939. 
Namely, Pillai showed that there is a positive constant c, such that for each large 
N there is a primitive set # C{1, 2,...,N} with 


1 = 
yy ] > &2(log N)(log log N)~"”?. (5.13) 


The gap between Theorem 5.9 and eq. (5.13) was eliminated by Erdés, Sarkdzy 
and Szemerédi in 1967. They show that if L(N) is the maximum value of )) alla 
for all primitive sets #@ C {1, 2,...,N}, then 


L(N) = ((2%)7'”? + 0(1))(log N)(log log N)7'” (5.14) 


The lower bound in (5.14) had already been shown by Erdés in 1948 by taking 
to be the set of mE {1, 2,...,N} with O(n) = [log log NI. 
Interestingly, if is an infinite primitive set, then we have 


S77 olllog N)(log log N)~'”?) (5.15) 
a@eNn 


and no statement stronger than (5.15) is true, a result of Erddés, Szemerédi and 
Sark6zy in 1967. For references, see Erdés et al. (1970). 


5.2. Product sets and other multiplicative problems 


If , BCN, we denote by 8 the set of products ab where a € 4, bE B. Also 
we write ¢” for #4, > for SA, etc. 
In 1960, Erdés proved the following remarkable theorem. 


Theorem 5.16. if = {1, 2,...,N}, then |#?|=N?(logN) 7*°), where a = 
1 — (1 + log log 2)/log 2. 


Thus there are only o(N’) distinct integers in the N x N multiplication table! 
This seeming paradox is explained by the fact that a “normal product” of integers 
a,, a,<N has about 2loglogN prime factors, which is quite abnormal for 
integers below N’. The idea of looking at the normal number of prime factors of 
an integer is often fruitful; in fact this idea was mentioned above in connection 
with Theorem 5.2 whose proof is actually quite similar to the proof of Theorem 
5.16. 

What can one say about || if « and @ are just “dense” subjects of {1, 
2,...,N}? This question is addressed in a recent paper of Pomerance and 
Sarkézy (1990). 


Theorem 5.17. If ¢ >0 is arbitrary and x, BC {1,2,...,N} with |#|, |B|> en, 
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then |B | > N?(log Ny)? 18 2+0(1) Moreover, there is a set Sy CG {1,2,...,N} 
with || ~N and i ea — N7(log Ny? 62400), 


We say a set & CN is a multiplicative basis of order k if xf" =N. See Wirsing 
(1957) for a study of density properties of multiplicative bases. 

Following Theorem 4,12 we asked the Sidon problem: if of is a basis (additive) 
of order 2, must s(n), the number of representations of n as a, +a, with a,, 
a, ©, be unbounded? The multiplicative analog of this problem was solved by 
Erdés in 1965 (see Erdés and Graham 1980, p. 100). 


Theorem 5.18. [f # is a multiplicative basis of order 2, then t(n), the number of 
representations of n as a,a, with a,, a, E 4, must be unbounded. 


How large a set # C {1, 2,..., MN} can we choose with all of the products a,a, 
(with a,, @,€o, a,<a,) distinct? If k(N) denotes the maximal cardinality of 
such a set , then Erdés has shown (with graph-theoretic tools) that 


n(N) +¢,N*‘/log??N <k(N) < 2(N) + c,N**/log??N 


for certain positive constants c,, c, and all large N. Erdés and Posa have 
considered the analogous problem where all of the subset products from # are 
distinct. See Erdés and Graham (1980, p. 98) for references and a proof. 

In 1976, Szemerédi proved the following attractive result (see Erdés and 
Graham 1980, pp. 98-99). 


Theorem 5.19. There is a constant c such that if %, 8C{1, 2,...,N} and 
|. 9B| = |.sd||B|, then |tB|<cN/log(N + 1). 


In contrast, it is easy to construct sets @, @ CN such that each n EN has a 
unique representation n = ab with a € 4, bE B. For example, we may choose # 
to be the powers of 2 and & to be the odd natural numbers, or more generally, # 
the natural numbers all of whose primes come from ¥, and @ the natural 
numbers all of whose primes come from ¥,, where ¥,U, is an arbitrary 
partition of the set of primes. Erd6s, Saffari, Vaughan, and Daboussi have studied 
this problem. 

In 1975, Erdés and Selfridge (see Erdds and Graham 1980, p. 66) showed the 
following striking result. 


Theorem 5.20. The product of two or more consecutive positive integers is never a 
non-trivial power, 


There are many other problems and results concerning blocks of consecutive 
integers in Erdés and Graham (1980, section 8). 
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6. Van der Waerden’s theorem and generalizations 


For many references, see chapters 1 and 2 of Graham et al. (1980), and section 2 
of Erdés and Graham (1980). 

How much of the structure of the natural numbers must be preserved in a 
“dense” subset? In 1927, B. L. van der Waerden showed that if the natural 
numbers are partitioned into two subsets, then one subset contains arbitrarily long 
arithmetic progressions. In one sense, this theorem is best possible, since it is an 
easy exercise to partition the natural numbers into two subsets, neither of which 
contains an infinite arithmetic progression. But the theorem still leaves us 
wondering about our opening question, which we now repeat more specifically. 
How dense must a subset of the natural numbers be for it to contain arbitrarily 
long arithmetic progressions? 

In particular, in 1936 Erdos and Turan conjectured that if sf CN has positive 
upper asymptotic density, then s contains arbitrarily long APs (we abbreviate 
“arithmetic progression” as AP). In 1952, Roth used the Hardy—Littlewood circle 
method from analytic number theory to prove the Erdés—Turan conjecture for 
three-term APs. In 1969, via a combinatorial argument, Szemerédi showed the 
conjecture for four-term APs, and in 1974, in what must be one of the most 
complex proofs in combinatorial number theory, he proved the complete 
conjecture. A few years later, Fiirstenberg, using ergodic theory, gave another 
proof (also complicated) of what is now known as Szemerédi’s theorem. 

How much can Szemerédi’s theorem be improved? An old conjecture of Erdés 
is that if of CN satisfies only the weaker hypothesis 


rather than positive upper asymptotic density, then this is enough to force # to 
have arbitrarily long APs. 

A corollary to this conjecture of Erdés is that the set of primes would contain 
arbitrarily long APs. It is unclear, though, that one should think of this prime 
number problem in terms of the Erdés conjecture. That is, Erdés is suggesting 
that the set of prime numbers contains arbitrarily long APs only because the 
prime numbers are fairly numerous and not because of any special properties of 
the prime numbers. This technique of generalizing a hard problem to put it in 
proper perspective is of course an often-successful trick in mathematics. However, 
the only progress we have had so far on showing the set of primes contains 
arbitrarily long APs is through intrinsic properties of the primes. 

For example, Chudakov, Estermann, and van der Corput independently in 
1937-38 used the circle method to show the following strengthening of Corollary 
3.1: for any A>0, the number of even integers up to x not the sum of two 
distinct primes is O(x/log“x). From this we can prove the following result. 


Corollary 6.1. The set of primes contains infinitely many three-term APs. 
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Proof. By the prime number theorem, the number of even integers up to x of the 
form 2p with p prime is ~x/(2log x). Thus by the above-mentioned theorem, 
most of these numbers 2p can be represented as g +r, where g <r are primes. 
But then q, p, r form a three-term AP of prime numbers. O 


It is still unknown if there are infinitely many four-term APs of primes. The 
longest AP of primes ever found has length 22, a result of Pritchard et al. (1995), 

We now look at several “equivalent” formulations of van der Waerden’s 
theorem. The reason for the quotation marks is that logically, all theorems are 
equivalent. Here we mean it in the subjective sense that the proofs of inter- 
dependence are simple and fairly transparent. 


Theorem 6.2. The following statements are equivalent: 

(i) If N is partitioned into two subsets, then one subset contains arbitrarily long 
APs, 

(ii) For each k EN, there is a number W(k) such that if {1, 2,..., W(k)} is 
partitioned into two subsets, then one subset contains a k-term AP. 

(iii) For each k, rEN, there is a number W(k,r) such that if {1, 2,..., 
W(k, r)} ts partitioned into r subsets, then one subset contains a k-term AP. 

(iv) For each rEN, if N is partitioned into r subsets, then one subset contains 
arbitrarily long APs. 

(v) Uf {a,} is an infinite subsequence of N with {a, ,, — 4,} bounded, then {a,} 
contains arbitrarily long APs. 


Proof. We first show that (i) > (ii), which is probably the most difficult of the 
implications. It is an example of the “‘compactness principle” in Ramsey theory 
(cf. chapter 42). Suppose k is such that W(k) does not exist. Thus for every N 
there is a subset of, of {1, 2,...,N} such that neither sf, nor its complement 
contains a k-term AP. Obviously there is an infinite subsequence N,,<N,.<-°* 
of N such that 


$25 hy, AVY = ty Aas 


that is, either 1 is in each btu, or 1 is in no 4 ,; By passing to an infinite 
subsequence N,,<N.,, <-°: of {N,} we have 


F,i= dy 1 {1,2} = ty NEL 2s 


and FY, CY,. eonnnuing: in this fashion we find ¥, C ¥,C---CN and an infinite 
subsequence N,<N,<::- of N (W,=N,,, N2= = Nous ach with J = 4, 9 {1, 
2,..., J} for ech Thus if f= U &, then neither f nor N\SF contairis ys term 
AP, contradicting (i). , 
Now we show (ii) > (iii). We do this by induction on r, (ii) being the 
statement for r = 2 (and the case r = 1 being trivial). Suppose W(k, r) exists for all 
kK for some fixed r22. Say {1, 2,...,N} is partitioned into r+1 sets 
4,,...,8.,,. If N2W(, vr), then one of 4, U,, &,...,%,,, contains an 
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i-term AP, call it 8. If BC #, US, then 8,NB, AH, is a partition of B 
into two parts. Thus if / = W(k, 2), then one part contains a k-term AP. Since 
clearly W(k, 2) =k, if @ is contained in one of ,,...,%,,,, then one of these 
sets contains a k-term AP. Thus, not only have we shown that W(k, r +1) exists, 
but we have shown that the least choice for W(k, 7 + 1) is at most W(W(k, 2), r). 

It is obvious that (iii) > (iv) > (i). 

It is also clear that (v) > (i), since if Z= #4, U#,, 4, NA, =9, and KEN, 
then either x, contains k consecutive integers or the maximal gap between 
consecutive members of #, is at most k. 

Finally we show (iv) > (v), which will complete our proof. Suppose of C N has 
maximal gap r between consecutive members. Let #% = + {i} for i=0, 
1,...,7—1. Then #,U---U_, =N. Although these sets may not be disjoint, 
(iv) still implies that one of them contains a k-term AP. Then so does #, = &. 

0 


Van der Waerden’s theorem, as we originally stated it, is statement (i) of 
Theorem 6.2. We now give a proof of van der Waerden’s theorem. 

We begin with two definitions. If x,, x,...,%, and x;, %3,...,X%,, are two 
sequences where each term is in {0, 1, ... ,2}, we say {x,} is l-equivalent to {x;} 
if either / does not appear in either sequence or there is some k with x; = x; for 
i<k and each x,, x; </ for j >k. That is, {x;} and {x;} agree at least up to the 
last appearance of /. 

Next, we define the statement S(/,m) (where 1,mEN) as the following 
assertion: for each rEN there is a number M(/, m, r) such that whenever (1, 
2,...,N(l, m, r)} is partitioned into r parts #,,...,%, there exist a, 
d,,...,d,, EN such that 


atid, +--+ +d,)<N(l,m,r) (6.3) 
and such that whenever {x;} and {x/} are m-term sequences from {0, 1,..., J} 
that are /-equivalent, a+x,d,+-:-+x,d, andat+xjd,+-:++x/d,, are in the 


same ,. 

Note that the condition (6.3) guarantees that a + i x,d, and a+ )) x/d, are in 
{1, 2,...,NQ, m, nr}. 

Note also that the assertion S(/, 1) is essentially the same as statement (iii) of 
Theorem 6.2, Indeed, two integers x, x'€ {0, 1,...,/}, considered as 1-term 
sequences, are /-equivalent if and only if both x, x’</ or x =x’ =1. The assertion 
S(i, 1) says that there is some number M(/, 1, r) such that if {1, 2,..., N@, 1, r)} 
is partitioned into 7 subsets, then there are positive integers a, d with a, 
at+d,...,a+(/—1)d all in one of the parts. That is, one part contains an /-term 
AP. [Note that S(/, 1) also carries the extra stipulation, not found in statement 
(ili) of Theorem 6.2, that a + /d < NJ, 1, r).] Thus van der Waerden’s theorem 
will follow from the following result. 


Theorem 6.4. For each |, mEN, the assertion S(l, m) is a theorem. 
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Proof. Our plan is as follows. First we show that if S(/,k) is a theorem for 
k=1,...,m, then so too is S(/, m+ 1). Next we show that if S(/, m) is a theorem 
for all ym, then so too is S(/ + 1, 1). Thus our theorem will follow from this double 
induction and the fact that S(1, 1) is trivially true. 

Suppose /, mEN and S(/,k) is a theorem for k=1,...,m. Let rEN be 
arbitrary, let M = M(/, m, r), M'=N(, 1, r™ ). We shall show that we may choose 
Nii, m+1,r)=MM". Let &,,..., %, bea partition of {1, 2,...,MM’} and let 
C be the function that assigns to (© {1, 2,...,MM’} the number /€ {1, 
2,...57} with £€ of. 

Consider now the matrix 


C(1) C(2) vs+  C(M) 
ae C(M +1) C(M +2) C(2M) 
C((M'-1)M+1) C(M'-1)M+2) +. C(M'M) 


Each row of A is one of the r” M-term sequences from {1, 2,...,r}. We now 
partition {1, 2,...,M’} into r™ subsets where i, j are in the same subset if and 
only if the ith row and jth row of A are identical. Since S(/, 1) is true and by our 
choice of M’, one of these subsets contains an /-term arithmetic progression 5, 
b+d,...,b+(i-1)d, where b, d are positive integers with b + Id < M’. That is, 
rows b + id for i=1,...,/—1 of A are identical. 

We now apply S(/, m) to {(6 —-1)M +1, (6 -1)M+2,...,bM} (a translate of 
{1, 2,...,M}) and the partition we already have of {1, 2,...,MM’} restricted 
to this subset. Thus there are natural numbers a, d,,...,d,, such that 

(1) a= (b-1)M+1,at+ld,+--:+4,,)<6M; 

(2) whenever {x,}, {x;} are {-equivalent, m-term sequences from {0, 
1,...,0}, then Ca+ Lix,d,) = Ca t+ Yixid,). 

Let d,,,, =dM. To prove assertion S(/, m+ 1), we will show 

(V') at+l(d,+-:++d,.,)<MM'; 

(2") whenever {x;}, {x;} are /-equivalent, (m+ 1)-term sequences from {0, 
1,...,0}, then Ca+ Vix,d,) =Ca+ Yi x/d,). 

For (1’), note that the left-hand side is 


at+i(d,+---+d,)+ld,,,<bM+ldM 
by (1). But we noted above that b + ld <M’, so we have (1'). 


For (2’) we may clearly assume that the /-equivalent sequences {x;} and {x; } 
are not identical. Thus x,,,,, %4, </. Let 


jza-(b-1)M+D x4,, 9 j'=a-(6-1)M+Dxid,. (6.5) 
I 1 


Then j, j’€ {1, 2,...,M}. We look now at columns j and j’ of matrix A and how 
they intersect rows b, b+ d,...,b+(t—1)d. Of course, column j is constant on 
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these rows, as is column j’. But from (6.5), 
(b-1)Mt+j=a+Dxd,, (b-1)Mt+j'=a+>x!'d,, 
1 1 


and since x,,...,%,, is /-equivalent to x|,...,x,,, (2) implies C((b — 1)M +j) = 
C((b — 1)M + j’). Thus the constant value on the / special rows of column j is the 
same constant value as in column j’. Note that one of these rows is b + x,,,,d, 
whose jth entry is 


m+) 


C((b +X_4,d-1)M+f)= c(a+ > x4,) 
1 
by (6.5). Another special row is b + x/,,,d, whose j’th entry is 
m+} 
C(b +x!,,,d-1)M+j")= c(a +> x/d,) 
1 


Thus (2') holds and we have proved S(/, m +1). 

For our second induction, assume / EN and S(/, m) is a theorem for all m EN. 
Choose r EN, r = 2 (since we clearly can take N(i+ 1, 1, 1) =/+2). Let N=N(l 
r, r). We shall show we may take N(/ + 1, 1, r) =2N. Indeed take any partition of 
{1, 2,...,2N} into r subsets #,,...,%,. Let a, d,,...,d,EN be such that 
a+Ud,+---+d,)<N and such that whenever {x,}, {x;} are /-equivalent, 
m-term sequences from {0, 1,...,/}, a+ x,d, is in the same A, asat Yi xid,. 

Since r =2, we have ('31)>r, so there are integers u<v with OSu<u<r 
and such that 


a+D ld, at+Qld, 
i=l i=] 
are in the same subset from #,,..., %,, say %. Let 
a’=at+>ld,, d’'= > d,. 
é=1 é=utl 


We now claim that the (/ + 1)-term arithmetic progression a’, a’ + d',...,a’ +Id' 
lies wholly in sf, and that a’ + (/ + 1)d'<2N. Indeed, we know already that a’ 
and a’ + /d’ both lie in sf, and if x € {1, 2,...,f- 1}, then 


iccgl Oars OP and! AG rd, Kya eg ts Oe 2g OF 
——ee ee ee ee ee 
u r-u u v-u r-v 


are l-equivalent, so that a’ and a'+ xd’ both lie in &%. The assertion about 
a’ + (i + 1)d’ is trivial since a’ + Id’ <a+U(d,++:++d,)<N. Thus S(i+ 1, 1) is 
proved, as is the theorem. O 


The above proof is a “‘fleshed-out” version of the short proof presented in 
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Graham et al. (1980, p. 32). It shows that the function W(k,r) defined in 
statement (iii) of Theorem 6.2 is recursive, but it does not show it is primitive 
recursive. This has recently been done by Shelah (1988), thus solving a problem 
that had been outstanding for a long time. 

For the record we now formally state Szemerédi’s theorem. 


Theorem 6.6. If @ CN has positive upper asymptotic density, then & contains 
arbitrarily long APs. 


As van der Waerden’s theorem contains several essentially equivalent forms, 
the following result which is superficially stronger than Szemerédi’s theorem is 
easily proved to follow from it. 


Corollary 6.7. Let k € N and ¢ >0. For each sufficiently large N, depending on the 
choice of k and e, if $C {1,2,...,N)} with |A|>eN, then & contains a k-term 
AP. 


Proof. We may assume & = 3. If the corollary is untrue, then there is an infinite 
sequence N,, N;,... and sets fy C {1,...,N,} with |f,| > eN, and sf, does not 
contain any k-term AP. Let By, = ty + QN}, so that (By |>eN,, By C {2N, + 
,3N,} and &, does not contain’ any k-term AP. By passing to an infinite 
subsequence if necessary, we may assume N,,, 25N, for i=1,2,.... Let 


A= U By, 
a1 
Then sf has upper asymptotic density at least +e. Furthermore, # contains no 
k-term AP, since no %, _ does and since # does not contain any 3-term AP that is 
not wholly in some &, . However, Theorem 6.6 denies the existence of any such 
set of, which proves the corollary. O 


An interesting and still unsolved problem that is perhaps connected with these 
considerations is the following old problem of Erdds. Is there a sequence e,, of 1’s 
and —1’s with 


N 
ak, N)= > &, 
n=] 


bounded for all k, NEN? Another form of this problem asks if g(1,N) can be 
bounded for all N for some sequence e, of 1’s and —1’s that is also a 
multiplicative function. 

The integers 


4030, 4131, 4232, 4333, 4434, 4535, 4636, 4737, 4838, 4939 (6.8) 


obviously form an AP of length 10. We generalize this idea as follows. Let cy 


denote the set of N-term sequences from {0, 1,...,#— 1}. Then ¢ distinct points 
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P,, P,,...,P, in C’ are said to form a line if the sequence of jth terms Pi, 
P),...,P' is either constant or is 0, 1,...,¢— 1. Thus the guadruplets formed 
by the digits of the integers in (6.8), viewed as members of C{,, form a line. The 
following result is due to Hales and Jewett in 1963. 


Theorem 6.9. For each r,t EN, there is a number HI(r, t) such that if N= HI(r, 0) 
and C’ is partitioned into r subsets, then one subset contains a line. 


By writing the integers below 1” in base-t notation, we see that if {0, 
1,...,t%-1} is partitioned into r subsets, where N > HJ(r, ¢), then one subset 
contains a ft-term AP. That is, the Hales-Jewett theorem implies van der 
Waerden’s theorem. In fact, Shelah’s recent result mentioned above, that W(k, r) 
is primitive recursive, was obtained by proving the stronger theorem that the 
function HJ(r, t) is primitive recursive. 

In the 1930s, Gallai proved the following generalization of van der Waerden’s 
theorem as a corollary of the latter. It is also possible to prove this result as a 
simple corollary of the Hales-Jewett theorem. 


Theorem 6.10. For all u, r, kEN, if Z” is partitioned into r subsets, then one 
subset contains a set of the form B“ where B CZ is an AP of length k. 


By &” we mean of course all the points of Z“ whose coordinates lie in &. 

It is natural to ask if Theorems 6.9 and 6.10 can be generalized to “‘dense”’ sets. 
For Hales—Jewett, the following theorem was recently announced by Furstenberg 
in the Abstracts of the 1990 International Congress of Mathematicians. 


Theorem 6.11. For each tEN, ¢>0, if N is sufficiently large and & CC™ with 
|sf|> et”, then sf contains a line. 


The proof is by ergodic methods. The following result is a Szemerédi-type 
analog of Gallai’s theorem 6.10. It was first proved by Furstenberg and Katznel- 
son in 1978. However, it now may be viewed as a corollary of Theorem 6.11. 


Theorem 6.12. For each k,u€N and each «>0, if N is sufficiently large and 
&C[-N,N]“NZ" with |sf|>eN“, then & contains a subset B“ where B CZ is 
an AP of length k. 


Another question one may attack is that of sequences of lattice points in Z*. If 
a sequence v,, v,,... in Z” has “small gaps’’, what may be said? We should not 
expect to find long APs as can be seen from the following result of Dekking 
(1979). 


Theorem 6.13. There is an infinite sequence v,, v,,... in Z’ with each 0;,, — 0; = 
(0, 1) or (1, 0) and such that no five of the v’s form an AP of vectors. 
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However, we may expect a large subset to be collinear, at least for Z’ (we mean 
“collinear” in the ordinary, geometric sense). The following result is due to 
Ramsey and Gerver (1979). 


Theorem 6.14. [f v,, v>,.. . is an infinite sequence in Z” with |v,,, — v,| bounded, 
then for every k there are k of the v’s which are collinear. 


As is also shown by Ramsey and Gerver, this result is not true for Z’. 

Theorem 6.14 bears a similarity in appearance to statement (v) of Theorem 6.2 
and thus may be thought of as an analog of van der Waerden’s theorem. The 
following result of Pomerance (1980) would thus be a Szemerédi-analog. 


Theorem 6.15. /f v,, v>,... is an infinite sequence in Z* with 


1 N 
lim inf > |v,,,-v,|/<, 
mint yy D lee, 
then for every k there are k of the v’s that are collinear. 


An old result of Schur says that if N is partitioned into finitely many classes, 
then one class must contain infinitely many triples x, y, z with x + y =z. This 
result has been starting point of many interesting Ramsey-type problems on the 
integers. One particularly interesting result is due to Hindman in 1974 who 
showed that if N is partitioned into finitely many classes, then one class contains 
an infinite set such that all finite subset sums from x belong to the same class. 
For more on problems of this type, see chapter 3 of Graham et al. (1980). 


7. Miscellaneous problems 


As with combinatorics as a whole, combinatorial number theory is rich in 
attractive problems that defy precise classification. It is from the wealth of such 
problems in an area that we are sometimes able to discern patterns that become 
the broad outlines of a more mature branch of mathematics. In this section we 
take a very brief glimpse at a few of these problems. 


7.1. Covering congruences 


For references see section 3 of Erdés and Graham (1980) and section F13 of Guy 
(1994). 

In 1934, Romanoff posed the following problem. Can every sufficiently large 
odd integer be written as a sum of a power of 2 and a prime? In 1950, Erdés and 
van der Corput independently answered this problem in the negative by showing 
that in fact there is an infinite arithmetic progression of positive odd numbers m 
not of the form 2” + p. Erdés’s solution begins with the observation that every 
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integer n is in at least one of the following residue classes: 


O(mod2), 0(mod3), 1 (mod 4) , 


3(mod8),  7(mod12), 23 (mod 24). i) 


From this he was able to deduce that if n is a positive integer and m 
simultaneously satisfies 


m=1(mod 32), = m=2°(mod3), m =2° (mod 7) , 
m =2'(mod 5), (7.2) 
m=2°>(mod17), =m =27(mod13), =m =2” (mod 241), 


then m is odd and not representable as 2” +p. Moreover, by the Chinese 
Remainder Theorem from elementary number theory, the set of integers m which 
satisfy all of the congruences in (7.2) form an infinite arithmetic progression with 
common difference 32-3-7-5-17- 14-241. (It is still not known if a positive odd 
integer m which is not representable as 2” +p must belong to an infinite 
arithmetic progression of such integers.) 

A finite system of residue classes, such as (7.1), which have distinct moduli and 
such that every integer belongs to at least one class, is called a covering system of 
congruences. Another example that is simpler than (7.1) (but that does not have 
relevance to the 2" + p problem) is 


0 (mod 2) , 0 (mod 3) , 3 (mod 4) , 1 (mod 6) , 5 (mod 12) . 
(7.3) 
In both (7.1) and (7.3) the least modulus is 2. The following problem of Erdés 


from 1950 is the major unsolved problem in the area. 


Problem 7.4. It is true that for every k there is a covering system of congruences 
with least modulus at least k? 


An example due to Choi has least modulus 20. There are numerous other 
problems and some results concerning covering systems of congruences. We 
mention two more problems, the first due to Selfridge, the second to Erdés. 


Problem 7.5. Is there a covering system of congruences with all moduli squarefree 
and greater than 2? 


Problem 7.6. Is there a covering system of congruences with all moduli odd and 
greater than 1? 


7.2. Graham’s conjecture 


If SCN, let F(P) = (a/b: a, bE FY}. Graham conjectured in 1970 that if ¥ is 
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finite, then 
F(P) Z F({1, 2, ... , |F| -— 1})-. 


That is, there are a, b € ¥ with a/(a, b)=|F|. This problem has attracted much 
attention and there have been numerous partial results. One of the easier ones is 
due to Szemerédi who has proved Graham’s conjecture in the case |¥|=p, a 
prime. Indeed, we may assume not every member of ¥ is divisible by p (if not, 
replace each a € # with a/p) so that there are two members a, b & ¥ with either 
a=b 0 (mod p) or a#b =0 (mod p). In either case, a/b € F({1, 2,..., p— 
1}). 

y ecently Graham’s conjecture was independently proved by Szegedy (1986) 
and Zaharescu (1987) for all sufficiently large values of |¥|. They were even able 
to describe those sets S with F(Y) = F({1, 2,...,|S|}). 


Theorem 7.7. There is some number n, such that ifn2=n,, SCN, |¥| =n, then 
F(Y) Z F(A, 2,...,2-1}). ff FY) CF({1, 2,...,}), then there is some 
KEN with either 


[Ral 

S = {k,2k,...,nk} or S= TDF 

In Cheng and Pomerance (1994) it is shown that we may take n, = 10°” and 

in Balasubramanian and Soundararajan (1995) it is shown that we may take 

n, =5. Note that the first claim in Theorem 7.7 is true for n <5, but the second 
claim fails for n = 4, since Y = {2, 3, 4, 6} has F(Y) = F({1, 2, 3, 4}). 


7.3. Perfect numbers — Wirsing’s theorem 


Let a(n) denote the sum of the positive divisors of n. If o(n) = 2n, then n is said 
to be perfect. The first few examples are 6, 28, and 496. It has been known since 
Euclid that if 2’ —1 is prime, then 2”~'(2? — 1) is perfect. The three examples 
just cited fit this formula with p = 2, 3, 5. In fact, it has been shown by Euler that 
every even perfect number comes from Euclid’s formula. The two big questions 
are: (1) are there infinitely many perfect numbers?, and (2) are there any odd 
perfect numbers? 

From the Euclid—Euler results, the first question is equivalent to the existence 
of infinitely many Mersenne primes, that is, primes of the form 2? — 1. This is a 
very hard problem about which very little is known. We know 33 values of p for 
which 2” — 1 is prime, the largest being p = 859 433. 

We are still far from solving the second problem too. Numerous partial results 
are known, however. One of the more interesting theorems in the subject is due 
to Wirsing (1959), a paper which extends earlier joint work with Hornfeck. 


Theorem 7.8. There are absolute constants co, X, such that if x =X, and a is any 
rational number, then the number of n <x with a(n) = an is at most x°0!'°8'°8* 
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Proof. We begin with the observation that o(m) is a multiplicative function 
[a(mn) = o(m)a(n) when (m, n) = 1]. Suppose & is given; write a in reduced form 
u/v. Suppose x is large, n <x, and o(n) = an. Write n = ab, where b is the largest 
divisor of all of whose prime factors p satisfy p <logx or p|v. Then ab is an 
integer. The idea of the proof is to use the equation 


a(n) = o(ab) = o(a)a(b) =a-ab. (7.9) 


The plan is to show that we must have o(b)/ ab, and use this to show that b just 
about determines a. 

Suppose the prime factorization of a is p®' p¥?--- p%*. Let 1 be the least integer 
2log x/log log x. Since each p, >logx and since aXn<x, we have 


B,+B&+-°°+8, <l, (7.10) 


so that, in particular, k </. Then 


oa) _7 “1 ee | 1 
<— =[la+e; $254 p, y<I1 (14544) 
i= i=] i 
< (s—4.)< (W374) <2 7.11 
eAP 1 Pi —1 oe (log x) —1 oy 


for x >x,. Thus for x =x, we have a|o(a) if and only if a = 1. Putting this into 
(7.9) we see that for x = x, we have either a = 1 or o(b) ab. In fact we get even 
more. If a’|a, (a', a/a')=1, and a’ <a, then applying (7.11) to a/a’' we have 
a(a'b)ta'- ab. 

Let us see how we can reconstruct the number a given only b and an ordered 
k-tuple (with k =0) of positive integers B,, B,,.-.., 8, satisfying (7.10). First if 
k =0, then a = 1, and we are done. So suppose k >0. Then o(b)/ ab (if this fails 
then there can be no a at all), so let p, be the least pame that divides a(b) to a 
higher power than it divides ab. If k=1, then a=p*}, so suppose k>1. Let 
b' = bp*'. Then as before we may assume o(b’)/ab’; let p, be the least rime 
that divides a(b’) toa pipher) power than it divides abe If k=2, thena=p; 1 p82. 
If k>2, we let b” = bp®! p& and conte as before. This procedure either 
terminates with an integer a= pit ps. ++ pbk Or proves no a can exist satisfying 
(7.9). If @ is constructed, it may or may not satisfy (7.9). But if some a satisfying 
(7.9) does exist, this procedure will find it. 

Thus for x 2x, the number of n <x satisfying a(n) = an is at most BC, where 
B is the number of b<x such that v|b and for every prime p in b we have 
p<logx or p|v and C is the number of ordered tuples of natural numbers 
satisfying (7.10). 

From elementary combinatorics we have C = 2’. 

Neje Mae we have B <B,B,B,, where B, is the number of b, <x of the form 
qi'qz? +++ q;' where 41, 4,-.., 4, are all st the primes in v exceeding log x and 
Yi. Yor--+>% are natural numbers, B, is the number of 6, <x such that every 
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prime in b, is in the interval (log’’*x, logx], and B, is the number of b, < 
divisible by no prime exceeding log sas 

An upper bound for B, is the number of sequences y,, y,..., ¥, of natural 
numbers such that 


ytyHto+y Sl. 


Thus B, <(,/,)<2'. 
The total number of prime factors in a choice for b, is at most (log x)/ 

log(log’’*x) <2/. Say the primes in (log’’“x, logx] are r,, r.,...57_- Then B, is 

at most the number of sequences 6,, 6,,...,6, of non-negative epee with 


8, +8,+°°°+8,<2!. 


Thus again using elementary combinatorics, we have 


B <(” fe <mtu 
7 eas RE TS is . 


However, m = 1(log x) — m(log’’“x), so that from the prime number theorem we 


have m ~ (log x)/log log x. Since we have yet to use such a “‘big gun’’, we could 
rely instead on the more elementary inequality (1.1). To be specific, we use 
a(z) <22/log z, which holds for all z > 1. Thus we have m < 21, so that B, <2“. 
(In fact, the inequality z(z)<2z/logz can be proved by a very easy argument 
involving binomial coefficients, but we suppress the details.) 

If p is a prime and p® divides some choice for b;, then p® <x so that B < (log 
x)/log 2. Thus B, is at most the number of ordered z(log’’*x)-tuples with each 
coordinate a non-negative integer at most (log x)/log 2. Thus 


314 


<(1 + (log x)/log 2)7"'8 Ve a a (log x)/log 2)'8" 4x < 2! 
for x FX. 
From the above, if x>x,, then B<B,B,B, <2". Since C=2', if we have 


x =x, =max{x,,x,}, then the number of 1 <x with o(n)=an is at most 2”, 
proving the theorem. O 


While it is clear that a smaller value of cy may be found from a more careful 
proof, it would be more interesting to replace cy with some function tending to 0, 
perhaps only in the special case a =2 corresponding to perfect numbers. As for 
lower bounds, we know of no a for which we can prove o(n) = an has infinitely 
many solutions. In fact, we cannot even prove that the number of solutions is 
unbounded as a@ varies. It is known that if for some a and k there are infinitely 
many solutions to o(”) = an with v(n) =k, then there are infinitely many even 
perfect numbers, a result due to Kanold in 1956. Pomerance (1977a) proved the 
following effective form of this theorem. 


Theorem 7.12. For any a and k there is an effectively computable constant N(a, k) 
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such that if n> N(a, k), a(n) = an, and v(n) =k, then n= em, where e is an even 
perfect number (e,m)=1, and m< N(a, k). 


The bound Ma, k) is not very friendly, although it is primitive recursive as a 
function of k. In the special case of odd solutions n, there is a somewhat more 
reasonable bound. For example, if n is an odd perfect number with v(m) = k, then 
Heath-Brown (1994), improving on a result in Pomerance (1977a) showed 


k 
n=4*. 


That n is bounded by some function of k (for odd perfect numbers) was first 
shown by L. E. Dickson in 1913. 

In 1932, D. H. Lehmer proposed the following problem that is similar in flavor 
to the odd perfect number problem. Lehmer asked if there are any composite 
natural numbers n with ¢(7)|—1, where ¢ is Euler’s function from elementary 
number theory. This is still unsolved today. We do know that the number of 
composite integers n <x with g(n)|n—1, is O(x'”log**x), and that if g(n) 
divides n—1, »(n) =k, and k>1, then 


2k 
n=k’ , 


see Pomerance (1977b). This can be improved to n <4" using the method of 
Heath-Brown (1994). 

Consider the function s(n) = a(n) — n, the sum of the proper divisors of n. Thus 
a perfect number » satisfies s(n) = n. For any.natural number 1, one may consider 
the aliquot sequence for n: n, s(n), s(s(n)),.... An old conjecture of Catalan and 
Dickson is that any such sequence either terminates at 0 (by hitting a prime and 
becoming 0 two steps later) or becomes periodic. This has been proved for all 
n<275. Guy and Selfridge (1975) instead conjecture that the set of 2 whose 
aliquot sequence is unbounded has positive lower asymptotic density. 

A cycle of length 2 for the function s(n) is called an amicable pair. Namely, this 
is a pair of distinct integers a,b such that s(a)= 6 and s(b) =a. Such numbers 
have been studied since Pythagoras who noted that 220 and 284 are an amicable 
pair. In 1955, Erdés showed that the numbers which belong to an amicable pair 
have asymptotic density 0. In Pomerance (1981) it is shown that the number of 
integers up to x which belong to an amicable pair is at most x - exp(—(log x)'”) if 
x is sufficiently large. 


7.4. Graphs on the integers 


Consider the coprime graph on Z. This is the graph whose vertex set is Z and two 
integers a, b are connected by an edge if (a, b) = 1. 

The problem that opens this chapter can be reworded as follows. What ig the 
largest set f € {1, 2,...,N} such that the induced coprime graph on # contains 
no edges? This is the case k=2 of the following famous problem of Erdés. 
Namely, what is the largest set f C {1, 2,...,N} such that the induced coprime 
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graph on & does not contain a complete graph on & vertices? Of course, the set 
of integers n < N which have a prime factor among the first k — 1 primes is such a 
set, and Erd6s conjecture was that this set gives the maximum. This conjecture is 
fairly easily proved for k =3. The case k =4 was proved by Szabé and Téth 
(1985). Finally this long-standing problem has been very recently settled com- 
pletely by Ahlswede and Khachatrian (1995). First, they showed that there is a 
pair k, N for which the conjecture fails, and their example suggests that probably 
there are infinitely many integers k such that the conjecture fails for these k and 
certain small values of N. On the other hand, they proved that the following 
slightly weaker form of the conjecture is true: for every A there is a number 
Ny = No(k) such that for N>N,(k), up to N the set of multiples of the first k — 1 
primes gives the largest set with " k numbers pairwise coprime. 

If  C{1, 2,...,N} has ||> |4N] +1, then we have seen that the coprime 
graph on # must contain an edge. Must it already contain many edges? The 
answer is yes, for as Erdos et al. (1980) show, there must be some a € ¥ with 
valence at least cN/log log N. Moreover, if lod |=(4 + .e)N, then the coprime 
graph on of contains at least c(e)N* edges. They also show that if |A| > (2 + e)N, 
then the coprime graph on contains at least c(e)N° triangles, i.e. triplets a,, a>, 
a, with (a,, @,) =(@,, 4;)=(@,, a;)=1. 

The coprime graph on Z has many edges so we might expect that if J, and 7, are 
disjoint intervals of n consecutive integers, then the induced coprime graph on 
J, UJ, contains a matching from J, to J,. This is not the case, however. Suppose 
I, = {2, 3, 4} and /, = {8, 9, 10}. Then any one-to-one correspondence between J, 
and J, must have at least one pair of even numbers in the correspondence. 
Another example: /, = {2, 3, 4, 5}, 1, = {30, 31, 32, 33}. Here nothing can 
correspond with 30. 

About 25 years ago, D. J. Newman conjectured that if J, = {1,2,...,} and J, 
is any interval of n consecutive integers, then there is a coprime matching from J, 
to I,. (If J, 1, #®, we mean there is a one-to-one correspondence from /, to J, 
with corresponding numbers coprime. This can still be thought of as a matching in 
the coprime graph if we replace I, by I, + {n!}.) In Pomerance and Selfridge 
(1980), Newman’s conjecture is proved by giving an algorithm for constructing a 
coprime matching and proving it works for every n. The proof involves effective 
estimates for the cardinality of the sets S(x, u) = {n <x: g(n)/n <u}, where ¢ is 
Euler’s function. 

Consider now the divisor graph on N. Here two distinct numbers a, b are 
connected by an edge if either a|b or b|a. There are many attractive problems 
concerning the divisor graph; few of them are completely solved. The divisor 
graph is not as dense with edges as the coprime graph, so in general two 
n-element subsets of N should not be expected to contain a matching. Rather, we 
might consider the following. Let f(m) be the least integer such that the divisor» 
graph contains a matching from {1, 2,...,n} into {n+1,n+2,..., f(n)}. The 
following result is due to Erdés and Pomerance (1980). 


Theorem 7.13. There are positive constants Cy,c, such that for all large n, 


< fn) <c,n(log n) 


1/2 120 


Con((log n)/log log n) 
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Proof. We present only the proof of the upper bound; the lower bound proof is 
much harder and not particularly combinatorial in flavor. We shall show that we 
may take c, as any number larger than 2. 

Let ¢ >0 be arbitrary. The divisor graph clearly contains a matching from the 
integers in (n/ Vlog n, n] into the integers in (n, n [Vlog n]]-indeed, just multiply 
each number in the first interval by [Vlog ”]. It will thus be sufficient to show the 
divisor graph contains a matching from / to J, where 


I=[l,a2/Vlogn]ON, J=(n[Vloga], (2+ e)nVlognrJON. 
We consider in fact only the subgraph where a €/, b EJ are connected by an 
edge if b/a is prime. 


If aeE/J, the number of primes p such that pa€ J is, by the prime number 
theorem, 


m (Z(2+enviogn) ~  (Zntviognl) 
nViog n 


>(i+i 
(1+ 76) a log(a” '‘nviog n) 
logn 
2(1+$e) log logn 


if n = ny, uniformly for al a EJ. On the other hand, if b € J, the maximal number 
of a1 that can correspond to b is at most the number of primes p that divide b 
with p 2 logan. Since b <(2+ e)n\Vlog zn, this number evidently is at most 


log((2 + €)n Vlog n) 
log log x 


logn 
] 2 _ 

<(1 +36) log logn 

for n2=n,. Thus for n>max{n,, 1,}, the Kénig—Hall marriage lemma (see 
chapter 3) implies there is a matching from / into J. O 


What is the length H(n) of the longest simple path in the divisor graph on {1, 
2,...,m}? Hegyvari conjectured H(n) = o(7) and this was proved by Pomerance 
(1983). It would be nice to get an asymptotic formula for H(7). Recently, Saias 
and Tenenbaum have obtained fairly sharp estimates for H(n). Other problems of 
a similar nature are considered in Erdés et al. (1983). 


7.5. Egyptian fractions 


The ancient Egyptians thought fractions 1/a where a€WN were especially nice. 
There is today a wide body of literature and many problems and results 
concerning Egyptian fractions—see section 4 in Erdés and Graham (1980) and 
section D11 of Guy (1994). 

It has been known since Fibonacci that every positive rational r can be 
expressed as a finite sum of distinct Egyptian fractions. In fact, the greedy 
algorithm of choosing 2,4; Minimal with a,,, >a, and 1/a,+---+1/a,,,<r 
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always terminates. However, a representation of r as a sum of distinct Egyptian 
fractions is certainly not unique and this fact leads to many questions. For 
example, what is the fewest number of summands for r? Or, how many ways are 
there to write 7 as a sum of m distinct Egyptian fractions as 1 — ©? For the latter 
question, the case 7 = 1 has special interest. 

It has been conjectured by Erdds and Straus that for every integer n>1, 4/n 
can be written as a sum of three Egyptian fractions. This has been verified 
numerically for small values of n and has been shown true for all n but for a 
possible exceptional set of asymptotic density 0. More generally, Schinzel and 
Sierpinski have conjectured that every positive rational a/b can be expressed as a 
sum of three Egyptian fractions provided the denominator 6 is sufficiently large as 
a function of the numerator a. This is easily seen not to be true for a sum of two 
Egyptian fractions. For example, if p is a prime with p=1 (mod3), then 
3/p =1/x + 1/y is not solvable in integers. 


7.6. Pseudoprimes 


From Fermat’s little theorem, if is prime and a #0 (mod n), then 
a”-'=1(modn). (7.14) 


The congruence (7.14) is very useful for testing large numbers n for primality. 
Indeed, even if n is very large, it is a relatively simple matter to compute the least 
non-negative residue of 2""’ (modn); if this is not 1 (and n>2), then n is 
composite. The residue may be found in O(log) multiplications (mod m) using 
the repeated squaring method. However, this method is not perfect — sometimes 
we come across composite numbers 7 that nevertheless satisfy (7.14) for some a. 
The least example with a = 2 is n = 341, and with a =3 is n=91. We say n is a 
pseudoprime to the base a if n is a composite natural number and (7.14) holds. 

Let P,(x) denote the number of base a pseudoprimes up to x. Can we prove 
that for a fixed a we have P,(x) = o(7(x)); that is, that base a pseudoprimes are 
rare compared with primes? Certainly not for a = +1, since then (7.14) has many 
composite solutions. However, for |a|>1, Erdés showed in 1956 that P,(x) = 
o(7(x)) does in fact hold. The best result in this direction is due to Pomerance in 
1981. 


Theorem 7.15. For each integer a with |a| > 1, there is a number x(a) such that for 
x 2X_(a) we have P,(x)<x'~*”’?, where 


e€(x) = (log log log x)/log log x . (7.16) 
In 1956, Erdés conjectured that P, (x) >x'~©*® for some c > 0 and x sufficiently 


large and where e(x) is defined in (7.16). This conjecture was refined by 
Pomerance to the following. 
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Conjecture 7.17. For each integer a with |a|>1, we have 


P(x) = x(t ea( ee) 
, ; 


One might wonder if a number can be simultaneously a pseudoprime to the 
bases 2 and 3. This in fact can happen, the least example is n = 1105. It is not 
known if there are infinitely many such m. There are numbers n which are a 
pseudoprime to every base a with (a, n) = 1. Such numbers are called Carmichael 
numbers; the smallest example is n = 561. If C(x) is the number of Carmichael 
numbers up to x, it is known that C(x) <x'~*" for x sufficiently large and it is 
conjectured that C(x) =x!" 7°", Alford et al. (1994) recently proved there 
are infinitely many Carmichael numbers. In fact they showed the following. 


Theorem 7.18. For all sufficiently large values of x, C(x)>x?"’. 


The proof, which roughly follows a heuristic argument given by Erdés in 1956, 
has some strong combinatorial elements. 

We remark that any composite number 7 with g()|n—1 must also be a 
Carmichael number, but no such n are known to exist (see the earlier remarks on 
perfect numbers). 
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finitely starlike 850 

firm geometry 651 

FIRST-FIT DECREASING heuristic 1551 

FIRST-FIT heuristic 1551, 1575 

first-moment method 356 

first-order theory of graphs 364 

first-passage percolation 1937 

Fisher’s inequality 696, 1707 

five-color theorem 237, 257, 277, 2179 

fixed point 1862, 1864 

fixed-point-free element 628 

fixed-point property 1824 

fixed-point sets 1824 

fixed points in posets 1824 

FKG inequality 1805 

flag 680, 1452 
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flag geometry 675 

flag-transitive 1452, 1454, 1469, 1504 

flat manifolds 922, 958 

flat of a matroid 484, 490, 497 

flat of deficiency & 852 

flats 497, 859 

flock 661 

flow 124, 599 

2-flow 277 

flow-augmenting path 

4-flow conjecture 515 

flow equivalent tree 207 

Foata's transformation 1024, 1045 

football-pool problem 715, 788 

forbidden graph 1233 

forbidden minor 1833 

forbidden subgraph 1268, 2178 

forbidden subgraph problem 1233, 1256 

forest 10, 115, 389, 488, 491 

forests 489 

formal power series 1114 

four-color problem 56, 237, 910, 2178 

four-color theorem 56, 238, 248, 258, 291, 
295, 332, 541, 2179 

Fourier analysis 930 

Fourier series 922 

Fournier—Fraisse theorem 

fractional blocking 395 

fractional blocking number 
1576, 1577 

fractional blocking set 388, 395 

fractional edge-coloring number 

fractional Helly theorems 854 

fractional matching 219, 388, 395, 412, 1548 

fractional matching number 388, 396, 408 

fractional (nade-)cover 388 

fractional packing 1566, 1567 

fragment 1470 

Fraisse theorem 

framework 1914 

Frankl theorem 

free arrangement 

free group 1479 

free matroid 486, 494, 497, 504 

free monoid 1027, 1029 

free product 1467, 1481 

free subgroup 1753 

Freiman’s (doubling) theorem 986 

frieze patterns 2140 

Fulkersonian hypergraph 395, 402 


1661 


1262 


388, 396, 408, 409, 


1690 


1260 


1274 
2071, 2073 
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Fulkerson’s optimum arborescence theorem 
1674 

full Ramsey-type theorems 1357 

full time-constructibility 1628 

fully polynomial approximation scheme 
1633 

fully space-constructible 1611 

fully submodular function 572 

fully time-constructible function 

functional digraph 1024 

functional equations 929, 1195, 1206 

functional gate 2012 

fundamental circuit 554 

fundamental group 1495, 1846, 2069 

fundamental parallelepiped 923 

fundamental parameters 799 

fundamental theorem of extremal graph theory 
1243 

fundamental theorem of projective geometry 
655 

Fiiredi theorem 1242, 1275 

fusion principle 2138 


1584 


1609 


G-set 614 

G-space 614 

GAB, see Tits geometry 681 

Gale diagram 882, 893, 894 

Gale transform 882 

Gallai hyperplane 814 

Gallai identities 182 

Gallai line 812, 814, 818 

Gallai plane 814, 815 

Gallai theorem 250, 254, 255, 1799 
Gallai-Aschbacher decomposition 1476 
Gallai-Edmonds structure theorem 202, 1471 
Gallai-Milgram theorem 44 
Gallai-Roy theorem 48 

Galois fields 2181 

Galois geometry 657 

Galois group 954 

Galois theorem 649 

game 419 

game theory 628 

I-flow 292 

gamma function 925, 1068, 1076 
gammoid 500 

Gaussian coefficient 650, 1365 
general linear group 655 

generalized arithmetic progression 986 
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generalized assignment problem 1578, 1579, 
1884 

generalized ball 1297 

generalized coloring problem 1624, 1625 

generalized Kempe chain 256 

generalized line graph 1734 

generalized m-gon 761 

generalized polygon 674 

generalized polymatroid 581 

generalized quadrangle (GQ) 665, 684 

generalized Ramsey number 1347 

generating functions 900, 1025, 1027, 1095, 
1962 

generator matrix 777 

generator polynomial 784 

generic arrangement 2067 

genericity assumption 1919 

generously transitive 766 

genetics 1985 

genome 1985 

genus of a graph 320 

genus of a group = 1493 

genus of a surface 1487 

geometric graph 684 

geometric lattice 496-498, 516, 1055, 1713, 
1857 

geometric lattice of flats 830 

geometric realization 1843 

- geometric simplicial complex 1843, 1859 

geometrical dual 2178 

geometry 2180 

geometry of numbers 921-923, 929, 956 

Gewirtz graph 701 

giant component 357 

Gilbert-Varshamov bound 781 

girth 10, 842, 1264, 1503, 1752-1754, 1757, 
1810 

gluing lemma 1848 

Go 2119, 2129 

Gédel’s compactness theorem 2092 

Golay codes 716, 791, 794 

Goldbach’s conjecture 971, 982 

Goldberg conjecture 275 

golden identity 248 

golden ratio 248 

Gomory-—Hu tree 130 

Gorenstein complex 1856 

Gosper’s algorithm 1075 

Gosper's indefinite hypergeometric summation 
algorithm 1210 


GQ, see generalized quadrangle 665 

GR, see graph realization 592 

gradient method 1660 

Graham—Newman problem 820 

Graham-Rothschild theorem 1335 

Graham’s conjecture 1007 

Gram matrix 767 

graph 5, 500, 2177 

— 2-are-transitive 1469 

— 2-bicritical 218 

— 3-colorable 246 

— 2-connected 117, 493 

— 3-connected 151 

— 3-connected planar 891 

— l-extendable 195 

— 2-regular 1165 

- acyclic 10 

— adjacent faces of a planar 55 

~ adjacent vertices of 5 

— algorithms 2187 

— aritical 263, 264 

— antipodal 1505 

~ arc-transitive 1454, 1455, 1503-1507 

— asymmetric 1461 

— automorphism of 64, 1363 

— balanced 362 

— balanced bipartite 6 

— basis 1831 

- bicritical 198 

— bidirected 214 

— binomial random 353 

— bipartite 6, 14, 386, 397, 407, 947 

— bipartite incidence 421 

— bridge 587 

— Catlin 256 

— Cayley 64, 1467-1469, 1473, 1478, 1482- 
1484, 1486, 1493, 1494, 1497, 1500, 
1752-1754, 1756, 1757 


— Chang 768 
~ chemical 1957 
— x-critical 48 


— chordal 117, 329, 399 

— circuit extendable 78 

~ claw-free 66, 183, 225 

— Clebsch 70], 1454 

- closure of 23, 1257 

— color-critical 236, 239, 242-244, 250, 
253-256, 262 

— coloring number of 250, 2104 

— coloring problem 1892 
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graph (cont @) — genus of 320 
— comparability 266, 443 — geometric 684 
-— complete 5, 1268, 2176 — Gewirtz 701 
— complete bipartite 6 — Grinberg 57 
— component of 9 — Hajos constructible 252 
— composition of 43 — Hamiltonian 20, 2176 
— conference 753 ~ Hamming 1463 
— connected 9, 489 ~ Heawood 19, 63, 64, 1452 
- connected labeled 1041, 1114 — hexagonal animal 1973 
— connectivity 1958 — Higman-Sims 701, 729 
— contraction-critical 257, 262 — Hoffman-Singleton 752, 760 
— convex representation of 310 ~ homogeneous 1507, 1508 
— coprime 1011, 1012 — homomorphism 276 
— covering 1455, 1469 — hypo-Hamiltonian 54 
— Coxeter 52, 54, 64, 1452, 1472 — hypomatchable 196 
— critical 156, 196, 2179 — hypotraceable 54 
— critically imperfect 268, 269 — incidence 18, 386, 421, 674 
— cube of 53 — infinite $, 2179 
— cubic 59, 1965 — interaction 1949 
— current 325 — intersection 387 
— d-interval 312 — interval 266, 1994 
— d-polytopal 879, 905-907 ~ invariants 1957, 1973 
— d-regular 7 — isomorphism 1511, 1516, 1518 
— dependency 1798 — isomorphism classes of 1060 
— diameter of 10 — isomorphism problem 1619, 1631 
— dimension of 842 - isospectral 1456 
- directed 15 — joined vertices of 5 
— distance-regular 760, 1506, 1507 — k-chromatic 48 
— distance-transitive 629, 766, 1452, 1455, — k-colorable 48, 246 
1505, 1506 — k-extendable 195 
— divisor 1012 — Kneser 64, 240, 764, 1455, 1825 
- doubly periodic 311 — Konig-Egervary 219 
~- dual 259, 308 — Kuratowski 6 
— dual polar 768 ~ A by A half-bipartite 2105 
— edge-critical 272 — Levi 64, 76, 1449 
— edge-weighted 14 — lexicographic product of 43 
— embedding 1970 — line 65, 183, 274, 387, 392, 1455, 1456, 
— embedding of 303 1734, 2053 
— empty 6 — locally connected 66 
-— end of 1480 — locally finite 1477 
~ Erdés-Rényi 1242 — locally planar 262 
— Eulerian 65, 163, 2076, 2176 — LSG, see Latin square 
- even 7,9 — map 1826 
— evolving 354 — Margulis 1265 
— extremal 12, 1167, 1233 — matching covered 195 
~ factor critical 196 — maximal non-Hamiltonian 1275 
— finite 5 — metrically k-transitive 1508 
— first-order theory of 364 — minimal forbidden unit-distance 844 
— forbidden 1233 — minor of 304 


— generalized line 1734 — molecular 1957 
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graph (contd) — representative 387 
— Moore 63, 760, 1264 — ridge 905 
— non-isomorphism problem 1631 — rigid 1461 
— of groups 1495 : — rigid-circuit 266, 267 
~ order of 5 — rooted 89] 
— oriented 15 — s-arc transitive 629 
- outerplanar 308 - Sachs 1973 
— overlap 58 — Saturated 263 
- Ps-isomorphic 269 - Schrijver 240 
— packing 1276 — self-complementary 1464 
— Paley 1760, 1761 — series—parallel 330, 538 
— panconnected 78 — shrinkable 197 
— pancyclic 24, 25 — signed 2076, 2077 
— partition of 43, 1830 — simple 5 
~ path partition of 43 ~ sizeof 5 
— path-tough 53 — sparse 239 
— perfect 182, 265-267, 269, 270, 1546, 1591, — split 266 
1687 — square lattice 684 
— Petersen 51, 54, 62-64, 75, 89, 92, 189, — square of 52 
211, 295, 296, 377, 515, 548, 701, 750, 752, — string 314 
760, 1452, 1472 — strongly regular 684, 750, 1456, 1462, 1471, 
— planar 55, 153, 237, 259, 277, 306, 494, 1498, 1501, 1513 
1458, 1511, 1564, 2078 — T-contractible 217 
— plane 55, 306 - t-perfect 271 
~ Platonic 5 — ttough 52 
— point 684 ~— theory 2167 
— polynomial 252, 1772 — Thomsen 6 
— powerof 52 — tough 51, 1472 
‘ — process 354 — toughness of 52 
-—- fandom 354 — traceable 20 
~ product — transitively orientable 1476 
- — Cartesian 1463 — triangle-free 239, 240, 250 
— — categorical 1463 — triangular 684, 750, 751 
~ — lexicographic 1463 — Turan 1234 
- -— strong 1463 ~ underlying 15 
— property 354, 355 — uniform random 353 
— property of 1253, 1284 - unit-distance 834 
— pseudo-geometric 684 — unlabeled 1060 
— pseudo-random 1758, 1760 - (6k A) 751 
— quasi-random 376 — vertex-critical 268 
— quotient 1461 — vertex-transitive 64, 1452, 1453, 1464, 
- Rado 1463, 1474, 1508 1468-1473, 1475, 1477, 1480-1482, 1487, 
— Ramsey 384, 418, 419, 1464 1488, 1491, 1493, 1494, 1500, 1504 
- random 353, 1350, 1461, 1932 — very strong perfect 269 
— rank3 766 — weakly saturated 1273 
— Teaction 1957, 1958 — x-path 69 
— realization (GR) 592 graph-like state of matter 1975 
— reconstruction 1730 graphic matroid 491, 494, 505, 516, 518, 532, 
— rectangular representation of 312 547, 594 


— regular 7 graphic notation 1959, 2177 
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graphical enumeration 1195 

graphical regular representation, see GRR 

Grassmann variety 2041, 2062, 2064 

Grassmann—Pliicker relations 2063 

Grassmannian 1510 

Gray codes 1473 

greedoid 521, 582, 1546, 1830, 1857 

— rank 1830 

greedy algorithm 139, 373, 638, 1545-1547, 
2187 

greedy blocking set 1547, 1548 

greedy heuristic 1546, 1547 

greedy set 581 

greedy solution 1546 

Green Hackenbush 2138 

Greene-Kleitman theorem 437, 439 

Griesmer bound 782 

Grinberg graph 57 

Grinberg’s condition 56 

Grétzsch theorem 260 

ground state 1946, 1949 

ground states of spin glasses 1568 

group 

— 2-closed 1502 

— 2*-closed 1502 

— 2-transitive 623, 624 

— action of 614 

- affine 1453, 1454 

- almost simple 618 

— alternating 619, 1473, 1501 

- automorphism 784, 938, 1447, 2054 

— base of a permutation 637, 2057 

~ basic 618 

— Bieberbach 942, 943 

— black box 1519 

— CFSG, see classification of finite simple 

—- Chevalley 619 

— classical 620 

— compact topological 2051 

~— Coxeter 677, 2061 

— crystallographic 1488 

— cyclic 1451 

— dihedral 1451 

— discrete 921 

— doubly transitive 1451, 1504, 1507, 2053 

~ doubly transitive permutation 2057 

—endof 1480 

— exceptional 620 

— free 1479, 1494, 1495 

— fundamental 1495, 1846, 2069 


- Galois 954 

— general linear 655 

— genusof 1493 

— graph of 1495 

- homology 1846 

— isometry 1458 

— Jordan 625, 626, 640 

— Lie 922, 1479 

— Mathieu 622, 625, 716, 2183 

— monster 922 

— multiple transitive 623, 625 

— nilpotent 1478 

— nilpotent-by-finite 1478 

— of Lie type 619, 621 

— of permutations 2169 

— orthogonal 620, 1460 

— permutation 614, 1516, 1963, 2053, 2054 

— polyhedral 1459 

— primitive 616, 1502, 1515, 2054 

— primitive permutation 2053, 2056 

— projective special linear 620 

— rank of permutation 626 

— reduced simplicial homology 1846 

— reduction functions 2172 

~ reflection 1473 

— regular 615 

— representation theory 2174 

— representations 1755, 1756 

— representations of symmetric 1057 

— rotation 1458 

— semiaffine 1453, 1454 

— simple 2056, 2057 

— solvable 1453 

— space 922, 941, 943 

— sporadic 622 

— symmetric 1451, 1473, 2053 

— symmetry point 1965 

— symplectic 620 

— transitive 614, 1482 

— twisted 620 

— uniprimitive 2054 

— uniprimitive permutation 2053 

— unitary 620 

— vertex-stabilizer 1452 

group-divisible design 705 

growth rate 1477 

— exponential 1478 

— polynomial 1478 

GRR (graphical regular representation) 1476, 
1477, 1500 


Grundy number 276 

Grundy’s Game 2136, 2150, 2158 

guess tape 1606 

Gyarfas conjecture 1280 

Gyarfas~Komlés—Szemerédi theorem 1266 

Gyarfas—Prémel-Szemerédi-Voigt theorem 
1267 


H-admissible 1178 

H-admissible function 1179 

h-vector 898-900, 1841 

Haar measure 2041, 2051 

Hadamard code 795, 803 

Hadamard matrix 729, 2183 

Hadamard product 1110 

Hadwiger number 1493, 1494 

Hadwiger theorem 857 

Hadwiger—Nelson problem 238, 841 

Hadwiger's conjecture 238, 257, 258, 262, 
332, 2179 

Haggkvist theorem 1259-1261 

Hajnal-Szegedy theorem 1282 

Hajnal-Szemerédi theorem 248, 1280 

Hajos constructible graph 252 

Hajos construction 252 

Hajos theorem 252 

Hales—Jewett theorem 1005, 1338, 1363 

— density 1366 

* Hall theorem 1052, 1053, 2185 

Hall triple system 654 

Hall's condition 185 

Hall’s marriage theorem 438 

halting problem 1605 

halving lines 865 

ham sandwich theorem 1828, 1865 

Hamada conjecture 674 

Hamilton |-factorable 80 

Hamilton-connected 26 


Subject index 


Hamming bound 781 


LXXVII 


Hamming code 667, 777, 778, 783, 788, 792, 


2047 
Hamming cone 2048 
Hamming cube 1802, 1806 
Hamming distance 1802 
Hamming graph 1463 
Hamming metric 2048, 2049 
Hamming scheme 762 
Hammond operator 2174 
hard hexagon 194] 
hard Lefschetz theorem 1839 
Hardy—Ramanujan Tauberian theorem 
harmonic function 1744 
Hasse diagram 312, 436 
Hasse—Minkowski theory 2183 
hat-check girl problem 1790 
Hauptsatz (of Polya) 1962 
Hayman-admissible 1177 
head of a digraph 15 
head of a walk 16 
heap 2020 
Heawood conjecture 323 
Heawood graph 19, 63, 64, 1452 
Heawood number 261 
Heawood theorem 260 
Heawood’s formula 260, 261 
height function 497 
heights of binary trees 1199 
Heilbronn’s problem 861 
Helly dimension 853 
Helly (homothetic) dimension 853 
Helly number 522 
Helly property 387, 393-395, 407 
Helly theorem 834, 848-852, 856 
— fractional 854 
— multiplied 852 
Helly (translate) dimension 853 


Hamilton cycle 462, 1256, 1275, 1472, 1473, Helly-type theorems 848 


1738 
Hamilton decomposition 86 
Hamilton path 20, 1472 
Hamiltonian 1927, 1928, 1948 
Hamiltonian circuit 20, 1548, 1601, 1602, 


hereditary class 516, 517 
hereditary closure, see dn 581 
hereditary family 1295, 1546, 1547 


hereditary hypergraph 388, 394, 427 


hereditary property 1253 


1607, 1610, 1614, 1617, 1634, 1635, 2028, Hermite normal form 924, 952 


2034, 2035 
Hamiltonian graph 20, 2176 
Hamiltonian path 1790 
Hamiltonicity 365 
Hamiltonicity obstacles 1472 


Hermite’s constant 936 
Hermitian unital 662 

heuristic 1543 

— 2-OPT 1553, 1555 
— Christofides 1550 
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heuristic (cont @) 

~ construction 1553 

— dual 1543, 1550, 1560, 1586 

~ exchange 1553, 1555-1557 

— FIRST-FIT 1551, 1575 

— FIRST-FIT DECREASING 1551 

— greedy 1546, 1547 

— Lin-Kernighan 1555 

— NEAREST INSERTION 1549, 1550 

— NEAREST NEIGHBOR 1549 

— NEXT-FIT 1551 

- on-line 1552 

— r-OPT 1554-1556 

— random node 1577 

— separation 1570 

HEX game 1863 

hexadecimal games 2158 

hexagon 935, 939 

hexagonal animal 1961 

hexagonal animal graph 1973 

hidden network 595 

high vertex 255 

higher surfaces 260, 261 

Higman-Sims graph 701, 729 

Hilbert basis 1673 

Hilbert function 896, 898, 902, 2058, 2065 

Hilbert series 2058 

Hilbert’s basis theorem 1776 

Hironaka decomposition 2058, 2065 

Hirsch conjecture 908, 909, 1656, 1668-1670 

Hirzebruch inequality 828 

Hitchcock-Koopmans transportation problem 
1668 

hitting time 355 

HNN-extention 148} 

Hoffman-Singleton graph 752, 760 

holonomic 1110 

homogencous 2054, 2055 

homogencous configuration 754, 766 

homogeneous difference equations with constant 
coefficients 2169 

homogeneous for / in the color a 2093 

homogeneous set 1334 

homologous series 1959, 1961, 1962 

homology 668, 1846 

homology group 1846 

homomorphism 1451 

homothetic 951 

homotopy 1846 

~ Cohen-Macaulay 1855 


Subject index 


complementation theorem 1852 
— equivalent 1846 

group 1846 

— invariant 1846 

- theorem 1821, 1848 

— type 1846, 2069 

honeycomb 945 

hook formula 1155 

hook lengths 1058 

Hopf trace formula 1863 

Hopping lemma 72 

Hosoya index 1975 

hot game 2128 

HS-admissible 1180, 1181 

Hiickel matrix 1972 

Hungarian forest 193 

Hungarian method 190 
hydrocarbon 1960 

hyperbolic geometry 1751, 1777, 1778 
hyperbolic plane 1487, 1488, 1494 
hyperbolic quadric 665 
hypercycle 387, 395, 420 
hyperedge 18 

hypergraph 18, 241, 381, 385, 846, 1456 
— 2-colorable 246 

- acyclic 390 

— arboreal 384, 395, 398 

— balanced 384, 390, 394, 395, 397 
— bicolorable 397, 421, 422 

— color-critical 241-243 

— complete 388, 426 

— connected 387 

- corank of 387 

— cover of 1547 

- cycle of 387 

— degree in 387 

— discrepancy of 1408, 1414 

— dual 386, 391-393, 397, 405 

— edge coloring of 385 

— factor 418 

- factorization of 385 

— Fano 18, 63 

— Fulkersonian 395, 402 

— hereditary 388, 394, 427 

— induced 386 

— intersecting 387, 391, 394, 411, 412, 425 
— interval 395, 398, 400 

— isomorphic 386 

— k-chromatic 417 

— k-colorable 246, 415, 417, 422 
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hypergraph (con: @) 

— Kneser 424 

— linear 387, 416, 422, 425, 427 

— Mengerian 384, 395, 397, 402, 404 
- minor 390 

— minor of 390 

— multipartite 414 

— normal 384, 395, 401 

— partial 401, 408 

— product of 386, 419 

— r-partite 387, 414, 415 

— random 410 

— rank of 387 

- regular 415 

— regular, uniform 1471 

- regular uniform 1471 

— restriction of 386 

— separating 387, 393 

— simple 387 

- theory 383 

— totally balanced 384, 395, 398, 399 
— trace of 386 

— transversal] 387 

- uniform 387, 392, 408, 415 

— unimodular 395, 397 

hypermetric cone 2049 

hypermetric inequalities 1568, 2041, 2046 
hyperoval 658 

hyperplane 406, 492, 493, 498, 499, 650, 856 
hyperplane arrangement 2065-2067 
hypo-Hamiltonian graph 54 
hypomatchable graph 196 

hypopath 594 

hypotraceable graph 54 


(i,j)-shift 1298 

ice problem 1940 

icosahedron 6, 719, 1460, 1507 

icosian calculus 2176 

ideal relation theorem 1851 

ideal topology 1844 

idempotent 786 

identification 252 

image (of a hypergraph) 386 

immersion conjecture 343 

immersion (of a graph) 384, 391 

impartial games 2127, 2132, 2134, 2135, 2138, 
2141, 2145, 2150, 2151 

implicit functions 1161, 1195 

imprimitive action 617 


inaccessible cardinal 2112 

incentive 2128 

incidence algebra 1052 

incidence graph 18, 386, 421, 674 

incidence matrix 6, 386, 695, 1707, 1739, 
2170, 2183 

incidence poset 469 

incidence relation 5 

incidence vector 1567, 1653 

incident 18, 55 

inclusion and exclusion 2167, 2171 

inclusion-exclusion formula 29 

inclusion-exclusion principle 971, 1044, 1049, 
1051, 1053, 1087 

indecomposable permutations 1030, 1116 

indefinite hypergeometric summation 1075 

indegree 15 

independence number 40, 372, 1258, 1464, 
1471, 1772, 1773 

independence polytope 1678 

independence structure 2178 

independence system 555, 582, 1546 

independent set 40, 483, 485, 488, 489, 491, 
492, 494, 500-503, 512, 956, 1471, 1757 

index of adesign 696 

index of a sublattice 924 

induced character 632, 633 

induced forbidden subgraph 1254 

induced hypergraph 386 

induced matroid 500 

induced subcomplex 1857 

induced subgraph 8 

infinite families 851 

infinite graph 5, 2179 

— locally finite 1470, 1475, 1480, 1484, 1505 

infinite Ramsey theorem 1351, 1352 

infinitesimal rigidity 1915 

information rate 775 

information symbols 777 

Ingham’s Tauberian theorem 1129 

inhomogeneous recurrences 1134 

inner distribution 759 

input gate 2012 

INSERT operation 2019-2021, 2023 

instance 14, 1601 

integer decomposition property 1664 

integer distances 847 

integer hull] 1691 

integer lattice 924, 925, 945 

integer matrix 952 
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integer polytope 949 

integer program feasibility problem 956 

integer programming 921, 952, 956 

integer programming problem 1618 

integral geometry 958 

integral lattice 924 

integral polyhedron 1656 

integral polymatroid 566 

integral quadratic forms 922 

integral solution 397 

integral vector 1652 

interaction graph 1949 

interactive proof 1511, 1631 

interactive proof system 1630, 1631 

interactive protocol 1630 

interior point method 1592 

interlacing 1734 

internal vertices 9, 40 

internally disjoint 34 

intersecting cuts 1566 

intersecting families 1301, 2187 

intersecting hypergraph 387, 391, 394, 411, 
412, 425, 1295 

intersecting submodular function 572 

intersection 7 

intersection-closed family 1296 

intersection graph 387 

intersection matrices 399, 755 

intersection numbers 754 

intertwining conjecture 344 

interval graph 266, 1994 

interval greedoid 1830 

interval hypergraph 395, 398, 400 

interval in a lattice 393 

interval order 459 

interval scheduling 1901 

intractable 598 

intractable matroid problem 557 

intramolecular rotations 1966 

intrinsic r-volume 946 

invariant 1512, 2177 

— complete 1456 

— integration 2051 

~ theory 2175 

inversion 1027, 1105 

inversion law 950 

inversion table 1027 

inversive plane 659, 704 

invertibility 885 

involution 519, 671, 884 


irreducible 502, 815, 1854 
irreducible character 630 
irreducible cyclic code 785 
irreducible representation 630 
irredundant base 637 

irredundant system 1656 

irregular node 227 

Ising model 1210, 1928, 1944, 1950, 1951 
Ising model partition function 1977 
Ising problem 1943 

isolated 934 

isolated singularities 1148 

isolated vertex 7 

isolation of zeros 1158 

isomer enumeration 1960 

isomeric structures 1960 

isomeric synthons 1969 
isomerization reactions 1966, 1967 
isomers 1970 

isometry 884 

isomorphic 5, 486, 877, 878, 907, 2109 
2-isomorphic 594 

isomorphic hypergraphs 386 
isomorphism 5, 392, 1450, 1510, 1514, 1515 
isomorphism classes of graphs 1060 
isoperimetric 1297 

isoperimetric ratio 1481 
isoperimetry 1481 

isospectral 1497 

isotopy theorem 885 

isthmus 292 


Jackson theorem 61, 1259, 1261 
Jacobi polynomials 1988 

Jacobi symbol 2031, 2032 
Jacobian varieties 958 

Jakobsen conjecture 275 
Jensen's inequality 1814 
Jerrum’s filter 637 

job shop scheduling 1904 
Johnson bound 782 

Johnson scheme 763 
join-contractible 1852 

joined vertices of a graph 5 
joins 1843, 1845, 2149, 2150 
Jones polynomial 2041, 2074-2077 
Jordan curve theorem 306 
Jordan group 625, 626, 640 
Jordan measurable 932 


jump 418 
jump value for r-graphs 1248 
jumping arc 575 


k-acyclic 1847 
k-arc-connected 147 
k-blocking 404 

k-blocking number 396 
k-book 303 

k-canonical representation 895 
k-chromatic 388, 406, 1724 
k-chromatic graph 48 
k-chromatic hypergraph 417 
k-cliques 1457 

k-colorable 388 

k-colorable graph 48, 246 
k-colorable hypergraph 246, 415, 417, 422 
k-coloring 48, 388 
k-complete filter 2114 
k-connected 34, 147, 584, 1846 
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